scitex 2.14.0__py3-none-any.whl
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- scitex/.mcp.json +46 -0
- scitex/__init__.py +268 -0
- scitex/__main__.py +45 -0
- scitex/__version__.py +16 -0
- scitex/_install_guide.py +262 -0
- scitex/_mcp_tools/__init__.py +36 -0
- scitex/_mcp_tools/audio.py +134 -0
- scitex/_mcp_tools/canvas.py +128 -0
- scitex/_mcp_tools/capture.py +174 -0
- scitex/_mcp_tools/diagram.py +106 -0
- scitex/_mcp_tools/plt.py +350 -0
- scitex/_mcp_tools/scholar.py +373 -0
- scitex/_mcp_tools/stats.py +206 -0
- scitex/_mcp_tools/template.py +66 -0
- scitex/_mcp_tools/ui.py +80 -0
- scitex/_mcp_tools/writer.py +220 -0
- scitex/_optional_deps.py +333 -0
- scitex/ai/README.md +295 -0
- scitex/ai/__init__.py +74 -0
- scitex/ai/_gen_ai/_Anthropic.py +173 -0
- scitex/ai/_gen_ai/_BaseGenAI.py +337 -0
- scitex/ai/_gen_ai/_DeepSeek.py +183 -0
- scitex/ai/_gen_ai/_Google.py +162 -0
- scitex/ai/_gen_ai/_Groq.py +95 -0
- scitex/ai/_gen_ai/_Llama.py +144 -0
- scitex/ai/_gen_ai/_OpenAI.py +235 -0
- scitex/ai/_gen_ai/_PARAMS.py +552 -0
- scitex/ai/_gen_ai/_Perplexity.py +207 -0
- scitex/ai/_gen_ai/__init__.py +44 -0
- scitex/ai/_gen_ai/_calc_cost.py +78 -0
- scitex/ai/_gen_ai/_format_output_func.py +185 -0
- scitex/ai/_gen_ai/_genai_factory.py +71 -0
- scitex/ai/activation/__init__.py +8 -0
- scitex/ai/activation/_define.py +11 -0
- scitex/ai/classification/Classifier.py +131 -0
- scitex/ai/classification/CrossValidationExperiment.py +368 -0
- scitex/ai/classification/README.md +251 -0
- scitex/ai/classification/__init__.py +46 -0
- scitex/ai/classification/examples/timeseries_cv_demo.py +416 -0
- scitex/ai/classification/examples/verify_multi/config.json +13 -0
- scitex/ai/classification/examples/verify_multi/multi_task_comparison.md +20 -0
- scitex/ai/classification/examples/verify_multi/multi_task_validation.json +103 -0
- scitex/ai/classification/examples/verify_multi/paper_export/README.md +20 -0
- scitex/ai/classification/examples/verify_multi/paper_export/raw_results.json +174 -0
- scitex/ai/classification/examples/verify_multi/task1/metadata.json +56 -0
- scitex/ai/classification/examples/verify_multi/task1/paper_export/README.md +20 -0
- scitex/ai/classification/examples/verify_multi/task1/paper_export/raw_results.json +93 -0
- scitex/ai/classification/examples/verify_multi/task1/paper_export/summary_table.tex +15 -0
- scitex/ai/classification/examples/verify_multi/task1/report.md +31 -0
- scitex/ai/classification/examples/verify_multi/task1/validation_report.json +47 -0
- scitex/ai/classification/examples/verify_multi/task2/metadata.json +56 -0
- scitex/ai/classification/examples/verify_multi/task2/paper_export/README.md +20 -0
- scitex/ai/classification/examples/verify_multi/task2/paper_export/raw_results.json +93 -0
- scitex/ai/classification/examples/verify_multi/task2/paper_export/summary_table.tex +15 -0
- scitex/ai/classification/examples/verify_multi/task2/report.md +31 -0
- scitex/ai/classification/examples/verify_multi/task2/validation_report.json +47 -0
- scitex/ai/classification/examples/verify_test/metadata.json +62 -0
- scitex/ai/classification/examples/verify_test/paper_export/README.md +20 -0
- scitex/ai/classification/examples/verify_test/paper_export/raw_results.json +131 -0
- scitex/ai/classification/examples/verify_test/paper_export/summary_table.tex +15 -0
- scitex/ai/classification/examples/verify_test/report.md +31 -0
- scitex/ai/classification/examples/verify_test/validation_report.json +52 -0
- scitex/ai/classification/reporters/_BaseClassificationReporter.py +283 -0
- scitex/ai/classification/reporters/_ClassificationReporter.py +758 -0
- scitex/ai/classification/reporters/_MultiClassificationReporter.py +403 -0
- scitex/ai/classification/reporters/_SingleClassificationReporter.py +1778 -0
- scitex/ai/classification/reporters/__init__.py +11 -0
- scitex/ai/classification/reporters/reporter_utils/_Plotter.py +1122 -0
- scitex/ai/classification/reporters/reporter_utils/__init__.py +72 -0
- scitex/ai/classification/reporters/reporter_utils/aggregation.py +439 -0
- scitex/ai/classification/reporters/reporter_utils/data_models.py +321 -0
- scitex/ai/classification/reporters/reporter_utils/reporting.py +1223 -0
- scitex/ai/classification/reporters/reporter_utils/storage.py +224 -0
- scitex/ai/classification/reporters/reporter_utils/validation.py +382 -0
- scitex/ai/classification/timeseries/README.md +313 -0
- scitex/ai/classification/timeseries/_TimeSeriesBlockingSplit.py +643 -0
- scitex/ai/classification/timeseries/_TimeSeriesCalendarSplit.py +732 -0
- scitex/ai/classification/timeseries/_TimeSeriesMetadata.py +139 -0
- scitex/ai/classification/timeseries/_TimeSeriesSlidingWindowSplit.py +1640 -0
- scitex/ai/classification/timeseries/_TimeSeriesSlidingWindowSplit_v01-not-using-n_splits.py +1609 -0
- scitex/ai/classification/timeseries/_TimeSeriesStrategy.py +83 -0
- scitex/ai/classification/timeseries/_TimeSeriesStratifiedSplit.py +697 -0
- scitex/ai/classification/timeseries/__init__.py +37 -0
- scitex/ai/classification/timeseries/_normalize_timestamp.py +439 -0
- scitex/ai/classification/timeseries/run_all.sh +30 -0
- scitex/ai/clustering/__init__.py +11 -0
- scitex/ai/clustering/_pca.py +114 -0
- scitex/ai/clustering/_umap.py +375 -0
- scitex/ai/feature_extraction/__init__.py +58 -0
- scitex/ai/feature_extraction/vit.py +148 -0
- scitex/ai/feature_selection/__init__.py +30 -0
- scitex/ai/feature_selection/feature_selection.py +359 -0
- scitex/ai/loss/_L1L2Losses.py +34 -0
- scitex/ai/loss/__init__.py +12 -0
- scitex/ai/loss/multi_task_loss.py +47 -0
- scitex/ai/metrics/__init__.py +56 -0
- scitex/ai/metrics/_calc_bacc.py +61 -0
- scitex/ai/metrics/_calc_bacc_from_conf_mat.py +38 -0
- scitex/ai/metrics/_calc_clf_report.py +78 -0
- scitex/ai/metrics/_calc_conf_mat.py +95 -0
- scitex/ai/metrics/_calc_feature_importance.py +179 -0
- scitex/ai/metrics/_calc_mcc.py +61 -0
- scitex/ai/metrics/_calc_pre_rec_auc.py +116 -0
- scitex/ai/metrics/_calc_roc_auc.py +112 -0
- scitex/ai/metrics/_calc_seizure_prediction_metrics.py +505 -0
- scitex/ai/metrics/_calc_silhouette_score.py +501 -0
- scitex/ai/metrics/_normalize_labels.py +83 -0
- scitex/ai/optim/MIGRATION.md +43 -0
- scitex/ai/optim/Ranger_Deep_Learning_Optimizer/LICENSE +201 -0
- scitex/ai/optim/Ranger_Deep_Learning_Optimizer/README.md +80 -0
- scitex/ai/optim/Ranger_Deep_Learning_Optimizer/__init__.py +0 -0
- scitex/ai/optim/Ranger_Deep_Learning_Optimizer/ranger/__init__.py +3 -0
- scitex/ai/optim/Ranger_Deep_Learning_Optimizer/ranger/ranger.py +204 -0
- scitex/ai/optim/Ranger_Deep_Learning_Optimizer/ranger/ranger2020.py +235 -0
- scitex/ai/optim/Ranger_Deep_Learning_Optimizer/ranger/ranger913A.py +212 -0
- scitex/ai/optim/Ranger_Deep_Learning_Optimizer/ranger/rangerqh.py +184 -0
- scitex/ai/optim/Ranger_Deep_Learning_Optimizer/ranger-init.jpg +0 -0
- scitex/ai/optim/Ranger_Deep_Learning_Optimizer/ranger-with-gc-options.jpg +0 -0
- scitex/ai/optim/Ranger_Deep_Learning_Optimizer/setup.py +24 -0
- scitex/ai/optim/__init__.py +13 -0
- scitex/ai/optim/_get_set.py +31 -0
- scitex/ai/optim/_optimizers.py +71 -0
- scitex/ai/plt/__init__.py +65 -0
- scitex/ai/plt/_plot_feature_importance.py +321 -0
- scitex/ai/plt/_plot_learning_curve.py +330 -0
- scitex/ai/plt/_plot_optuna_study.py +226 -0
- scitex/ai/plt/_plot_pre_rec_curve.py +284 -0
- scitex/ai/plt/_plot_roc_curve.py +253 -0
- scitex/ai/plt/_stx_conf_mat.py +662 -0
- scitex/ai/sampling/undersample.py +44 -0
- scitex/ai/sk/__init__.py +11 -0
- scitex/ai/sk/_clf.py +58 -0
- scitex/ai/sk/_to_sktime.py +100 -0
- scitex/ai/sklearn/__init__.py +26 -0
- scitex/ai/sklearn/clf.py +58 -0
- scitex/ai/sklearn/to_sktime.py +100 -0
- scitex/ai/training/_EarlyStopping.py +149 -0
- scitex/ai/training/_LearningCurveLogger.py +552 -0
- scitex/ai/training/__init__.py +7 -0
- scitex/ai/utils/__init__.py +22 -0
- scitex/ai/utils/_check_params.py +49 -0
- scitex/ai/utils/_default_dataset.py +46 -0
- scitex/ai/utils/_format_samples_for_sktime.py +26 -0
- scitex/ai/utils/_label_encoder.py +134 -0
- scitex/ai/utils/_merge_labels.py +22 -0
- scitex/ai/utils/_sliding_window_data_augmentation.py +11 -0
- scitex/ai/utils/_under_sample.py +51 -0
- scitex/ai/utils/_verify_n_gpus.py +16 -0
- scitex/ai/utils/grid_search.py +148 -0
- scitex/audio/README.md +129 -0
- scitex/audio/__init__.py +392 -0
- scitex/audio/__main__.py +125 -0
- scitex/audio/_cross_process_lock.py +139 -0
- scitex/audio/_mcp/__init__.py +4 -0
- scitex/audio/_mcp/handlers.py +366 -0
- scitex/audio/_mcp/tool_schemas.py +203 -0
- scitex/audio/_tts.py +334 -0
- scitex/audio/engines/__init__.py +44 -0
- scitex/audio/engines/base.py +275 -0
- scitex/audio/engines/elevenlabs_engine.py +146 -0
- scitex/audio/engines/gtts_engine.py +162 -0
- scitex/audio/engines/pyttsx3_engine.py +131 -0
- scitex/audio/mcp_server.py +293 -0
- scitex/benchmark/__init__.py +42 -0
- scitex/benchmark/benchmark.py +407 -0
- scitex/benchmark/monitor.py +380 -0
- scitex/benchmark/profiler.py +300 -0
- scitex/bridge/__init__.py +121 -0
- scitex/bridge/_figrecipe.py +277 -0
- scitex/bridge/_helpers.py +150 -0
- scitex/bridge/_plt_vis.py +542 -0
- scitex/bridge/_protocol.py +283 -0
- scitex/bridge/_stats_plt.py +272 -0
- scitex/bridge/_stats_vis.py +281 -0
- scitex/browser/README.md +167 -0
- scitex/browser/__init__.py +139 -0
- scitex/browser/auth/__init__.py +35 -0
- scitex/browser/auth/google.py +386 -0
- scitex/browser/automation/CookieHandler.py +215 -0
- scitex/browser/automation/__init__.py +11 -0
- scitex/browser/collaboration/__init__.py +62 -0
- scitex/browser/collaboration/auth_helpers.py +96 -0
- scitex/browser/collaboration/collaborative_agent.py +138 -0
- scitex/browser/collaboration/credential_manager.py +188 -0
- scitex/browser/collaboration/interactive_panel.py +400 -0
- scitex/browser/collaboration/persistent_browser.py +170 -0
- scitex/browser/collaboration/shared_session.py +392 -0
- scitex/browser/collaboration/standard_interactions.py +247 -0
- scitex/browser/collaboration/visual_feedback.py +181 -0
- scitex/browser/core/BrowserMixin.py +321 -0
- scitex/browser/core/ChromeProfileManager.py +438 -0
- scitex/browser/core/__init__.py +18 -0
- scitex/browser/debugging/__init__.py +74 -0
- scitex/browser/debugging/_browser_logger.py +644 -0
- scitex/browser/debugging/_failure_capture.py +379 -0
- scitex/browser/debugging/_highlight_element.py +152 -0
- scitex/browser/debugging/_show_grid.py +153 -0
- scitex/browser/debugging/_sync_session.py +260 -0
- scitex/browser/debugging/_test_monitor.py +293 -0
- scitex/browser/debugging/_visual_cursor.py +443 -0
- scitex/browser/docs/ABOUT_PLAYWRIGHT.md +49 -0
- scitex/browser/interaction/__init__.py +24 -0
- scitex/browser/interaction/click_center.py +150 -0
- scitex/browser/interaction/click_with_fallbacks.py +203 -0
- scitex/browser/interaction/close_popups.py +511 -0
- scitex/browser/interaction/fill_with_fallbacks.py +209 -0
- scitex/browser/pdf/__init__.py +16 -0
- scitex/browser/pdf/click_download_for_chrome_pdf_viewer.py +201 -0
- scitex/browser/pdf/detect_chrome_pdf_viewer.py +195 -0
- scitex/browser/remote/CaptchaHandler.py +447 -0
- scitex/browser/remote/ZenRowsAPIClient.py +341 -0
- scitex/browser/remote/ZenRowsBrowserManager.py +596 -0
- scitex/browser/remote/__init__.py +11 -0
- scitex/browser/stealth/HumanBehavior.py +339 -0
- scitex/browser/stealth/StealthManager.py +993 -0
- scitex/browser/stealth/__init__.py +9 -0
- scitex/canvas/README.md +386 -0
- scitex/canvas/__init__.py +340 -0
- scitex/canvas/_mcp/__init__.py +4 -0
- scitex/canvas/_mcp/handlers.py +372 -0
- scitex/canvas/_mcp/tool_schemas.py +219 -0
- scitex/canvas/backend/__init__.py +56 -0
- scitex/canvas/backend/_export.py +166 -0
- scitex/canvas/backend/_parser.py +188 -0
- scitex/canvas/backend/_render.py +385 -0
- scitex/canvas/canvas.py +434 -0
- scitex/canvas/docs/CANVAS_ARCHITECTURE.md +307 -0
- scitex/canvas/editor/__init__.py +26 -0
- scitex/canvas/editor/_dearpygui_editor.py +1976 -0
- scitex/canvas/editor/_defaults.py +300 -0
- scitex/canvas/editor/_flask_editor.py +37 -0
- scitex/canvas/editor/_mpl_editor.py +246 -0
- scitex/canvas/editor/_qt_editor.py +1096 -0
- scitex/canvas/editor/_tkinter_editor.py +523 -0
- scitex/canvas/editor/edit/__init__.py +47 -0
- scitex/canvas/editor/edit/backend_detector.py +109 -0
- scitex/canvas/editor/edit/bundle_resolver.py +248 -0
- scitex/canvas/editor/edit/editor_launcher.py +292 -0
- scitex/canvas/editor/edit/manual_handler.py +53 -0
- scitex/canvas/editor/edit/panel_loader.py +246 -0
- scitex/canvas/editor/edit/path_resolver.py +67 -0
- scitex/canvas/editor/flask_editor/__init__.py +21 -0
- scitex/canvas/editor/flask_editor/_bbox.py +1340 -0
- scitex/canvas/editor/flask_editor/_core.py +1688 -0
- scitex/canvas/editor/flask_editor/_plotter.py +601 -0
- scitex/canvas/editor/flask_editor/_renderer.py +854 -0
- scitex/canvas/editor/flask_editor/_utils.py +80 -0
- scitex/canvas/editor/flask_editor/static/css/base/reset.css +41 -0
- scitex/canvas/editor/flask_editor/static/css/base/typography.css +16 -0
- scitex/canvas/editor/flask_editor/static/css/base/variables.css +85 -0
- scitex/canvas/editor/flask_editor/static/css/components/buttons.css +217 -0
- scitex/canvas/editor/flask_editor/static/css/components/context-menu.css +93 -0
- scitex/canvas/editor/flask_editor/static/css/components/dropdown.css +57 -0
- scitex/canvas/editor/flask_editor/static/css/components/forms.css +112 -0
- scitex/canvas/editor/flask_editor/static/css/components/modal.css +59 -0
- scitex/canvas/editor/flask_editor/static/css/components/sections.css +212 -0
- scitex/canvas/editor/flask_editor/static/css/features/canvas.css +176 -0
- scitex/canvas/editor/flask_editor/static/css/features/element-inspector.css +190 -0
- scitex/canvas/editor/flask_editor/static/css/features/loading.css +59 -0
- scitex/canvas/editor/flask_editor/static/css/features/overlay.css +45 -0
- scitex/canvas/editor/flask_editor/static/css/features/panel-grid.css +95 -0
- scitex/canvas/editor/flask_editor/static/css/features/selection.css +101 -0
- scitex/canvas/editor/flask_editor/static/css/features/statistics.css +138 -0
- scitex/canvas/editor/flask_editor/static/css/index.css +31 -0
- scitex/canvas/editor/flask_editor/static/css/layout/container.css +7 -0
- scitex/canvas/editor/flask_editor/static/css/layout/controls.css +56 -0
- scitex/canvas/editor/flask_editor/static/css/layout/preview.css +78 -0
- scitex/canvas/editor/flask_editor/static/js/alignment/axis.js +314 -0
- scitex/canvas/editor/flask_editor/static/js/alignment/basic.js +107 -0
- scitex/canvas/editor/flask_editor/static/js/alignment/distribute.js +54 -0
- scitex/canvas/editor/flask_editor/static/js/canvas/canvas.js +172 -0
- scitex/canvas/editor/flask_editor/static/js/canvas/dragging.js +258 -0
- scitex/canvas/editor/flask_editor/static/js/canvas/resize.js +48 -0
- scitex/canvas/editor/flask_editor/static/js/canvas/selection.js +71 -0
- scitex/canvas/editor/flask_editor/static/js/core/api.js +287 -0
- scitex/canvas/editor/flask_editor/static/js/core/state.js +143 -0
- scitex/canvas/editor/flask_editor/static/js/core/utils.js +245 -0
- scitex/canvas/editor/flask_editor/static/js/dev/element-inspector.js +992 -0
- scitex/canvas/editor/flask_editor/static/js/editor/bbox.js +339 -0
- scitex/canvas/editor/flask_editor/static/js/editor/element-drag.js +286 -0
- scitex/canvas/editor/flask_editor/static/js/editor/overlay.js +371 -0
- scitex/canvas/editor/flask_editor/static/js/editor/preview.js +293 -0
- scitex/canvas/editor/flask_editor/static/js/main.js +426 -0
- scitex/canvas/editor/flask_editor/static/js/shortcuts/context-menu.js +152 -0
- scitex/canvas/editor/flask_editor/static/js/shortcuts/keyboard.js +265 -0
- scitex/canvas/editor/flask_editor/static/js/ui/controls.js +184 -0
- scitex/canvas/editor/flask_editor/static/js/ui/download.js +57 -0
- scitex/canvas/editor/flask_editor/static/js/ui/help.js +100 -0
- scitex/canvas/editor/flask_editor/static/js/ui/theme.js +34 -0
- scitex/canvas/editor/flask_editor/templates/__init__.py +123 -0
- scitex/canvas/editor/flask_editor/templates/_html.py +852 -0
- scitex/canvas/editor/flask_editor/templates/_scripts.py +4933 -0
- scitex/canvas/editor/flask_editor/templates/_styles.py +1658 -0
- scitex/canvas/io/__init__.py +105 -0
- scitex/canvas/io/_bundle.py +1121 -0
- scitex/canvas/io/_canvas.py +230 -0
- scitex/canvas/io/_data.py +208 -0
- scitex/canvas/io/_directory.py +205 -0
- scitex/canvas/io/_export.py +463 -0
- scitex/canvas/io/_load.py +168 -0
- scitex/canvas/io/_panel.py +432 -0
- scitex/canvas/io/_save.py +131 -0
- scitex/canvas/mcp_server.py +151 -0
- scitex/canvas/model/__init__.py +118 -0
- scitex/canvas/model/_annotations.py +116 -0
- scitex/canvas/model/_axes.py +153 -0
- scitex/canvas/model/_figure.py +139 -0
- scitex/canvas/model/_guides.py +105 -0
- scitex/canvas/model/_plot.py +124 -0
- scitex/canvas/model/_plot_types.py +682 -0
- scitex/canvas/model/_styles.py +246 -0
- scitex/canvas/utils/__init__.py +75 -0
- scitex/canvas/utils/_defaults.py +334 -0
- scitex/canvas/utils/_validate.py +198 -0
- scitex/capture/README.md +284 -0
- scitex/capture/TODO.md +40 -0
- scitex/capture/__init__.py +109 -0
- scitex/capture/__main__.py +24 -0
- scitex/capture/_mcp/__init__.py +11 -0
- scitex/capture/_mcp/handlers.py +438 -0
- scitex/capture/_mcp/tool_schemas.py +270 -0
- scitex/capture/capture.py +820 -0
- scitex/capture/cli.py +208 -0
- scitex/capture/gif.py +340 -0
- scitex/capture/grid.py +487 -0
- scitex/capture/mcp_server.py +977 -0
- scitex/capture/powershell/capture_all_desktops.ps1 +79 -0
- scitex/capture/powershell/capture_all_monitors.ps1 +63 -0
- scitex/capture/powershell/capture_single_monitor.ps1 +77 -0
- scitex/capture/powershell/capture_url.ps1 +170 -0
- scitex/capture/powershell/capture_window_by_handle.ps1 +129 -0
- scitex/capture/powershell/detect_monitors_and_desktops.ps1 +143 -0
- scitex/capture/powershell/enumerate_virtual_desktops.ps1 +60 -0
- scitex/capture/session.py +71 -0
- scitex/capture/utils.py +670 -0
- scitex/cli/__init__.py +64 -0
- scitex/cli/audio.py +331 -0
- scitex/cli/browser.py +310 -0
- scitex/cli/capture.py +325 -0
- scitex/cli/cloud.py +764 -0
- scitex/cli/config.py +224 -0
- scitex/cli/convert.py +425 -0
- scitex/cli/main.py +235 -0
- scitex/cli/mcp.py +336 -0
- scitex/cli/repro.py +251 -0
- scitex/cli/resource.py +258 -0
- scitex/cli/scholar/__init__.py +89 -0
- scitex/cli/scholar/_fetch.py +347 -0
- scitex/cli/scholar/_jobs.py +306 -0
- scitex/cli/scholar/_library.py +152 -0
- scitex/cli/scholar/_utils.py +35 -0
- scitex/cli/security.py +103 -0
- scitex/cli/stats.py +343 -0
- scitex/cli/template.py +254 -0
- scitex/cli/tex.py +304 -0
- scitex/cli/web.py +405 -0
- scitex/cli/writer.py +329 -0
- scitex/cloud/__init__.py +134 -0
- scitex/cloud/_matplotlib_hook.py +146 -0
- scitex/compat/__init__.py +74 -0
- scitex/config/README.md +313 -0
- scitex/config/_PriorityConfig.py +292 -0
- scitex/config/_ScitexConfig.py +319 -0
- scitex/config/__init__.py +56 -0
- scitex/config/_paths.py +325 -0
- scitex/config/default.yaml +112 -0
- scitex/context/__init__.py +9 -0
- scitex/context/_suppress_output.py +39 -0
- scitex/cv/__init__.py +87 -0
- scitex/cv/_draw.py +236 -0
- scitex/cv/_filters.py +217 -0
- scitex/cv/_io.py +157 -0
- scitex/cv/_transform.py +184 -0
- scitex/datetime/__init__.py +46 -0
- scitex/datetime/_linspace.py +100 -0
- scitex/datetime/_normalize_timestamp.py +306 -0
- scitex/db/README.md +281 -0
- scitex/db/_BaseMixins/_BaseBackupMixin.py +32 -0
- scitex/db/_BaseMixins/_BaseBatchMixin.py +33 -0
- scitex/db/_BaseMixins/_BaseBlobMixin.py +83 -0
- scitex/db/_BaseMixins/_BaseConnectionMixin.py +43 -0
- scitex/db/_BaseMixins/_BaseImportExportMixin.py +37 -0
- scitex/db/_BaseMixins/_BaseIndexMixin.py +31 -0
- scitex/db/_BaseMixins/_BaseMaintenanceMixin.py +31 -0
- scitex/db/_BaseMixins/_BaseQueryMixin.py +54 -0
- scitex/db/_BaseMixins/_BaseRowMixin.py +34 -0
- scitex/db/_BaseMixins/_BaseSchemaMixin.py +44 -0
- scitex/db/_BaseMixins/_BaseTableMixin.py +68 -0
- scitex/db/_BaseMixins/_BaseTransactionMixin.py +50 -0
- scitex/db/_BaseMixins/__init__.py +30 -0
- scitex/db/__init__.py +49 -0
- scitex/db/__main__.py +62 -0
- scitex/db/_check_health.py +365 -0
- scitex/db/_delete_duplicates.py +39 -0
- scitex/db/_inspect.py +368 -0
- scitex/db/_inspect_optimized.py +305 -0
- scitex/db/_postgresql/_PostgreSQL.py +125 -0
- scitex/db/_postgresql/_PostgreSQLMixins/_BackupMixin.py +166 -0
- scitex/db/_postgresql/_PostgreSQLMixins/_BatchMixin.py +82 -0
- scitex/db/_postgresql/_PostgreSQLMixins/_BlobMixin.py +233 -0
- scitex/db/_postgresql/_PostgreSQLMixins/_ConnectionMixin.py +90 -0
- scitex/db/_postgresql/_PostgreSQLMixins/_ImportExportMixin.py +57 -0
- scitex/db/_postgresql/_PostgreSQLMixins/_IndexMixin.py +64 -0
- scitex/db/_postgresql/_PostgreSQLMixins/_MaintenanceMixin.py +172 -0
- scitex/db/_postgresql/_PostgreSQLMixins/_QueryMixin.py +108 -0
- scitex/db/_postgresql/_PostgreSQLMixins/_RowMixin.py +77 -0
- scitex/db/_postgresql/_PostgreSQLMixins/_SchemaMixin.py +126 -0
- scitex/db/_postgresql/_PostgreSQLMixins/_TableMixin.py +176 -0
- scitex/db/_postgresql/_PostgreSQLMixins/_TransactionMixin.py +55 -0
- scitex/db/_postgresql/_PostgreSQLMixins/__init__.py +34 -0
- scitex/db/_postgresql/__init__.py +6 -0
- scitex/db/_sqlite3/README.md +191 -0
- scitex/db/_sqlite3/README_v01.md +126 -0
- scitex/db/_sqlite3/_SQLite3.py +208 -0
- scitex/db/_sqlite3/_SQLite3Mixins/_ArrayMixin.py +580 -0
- scitex/db/_sqlite3/_SQLite3Mixins/_ArrayMixin_v01-need-_hash-col.py +522 -0
- scitex/db/_sqlite3/_SQLite3Mixins/_BatchMixin.py +245 -0
- scitex/db/_sqlite3/_SQLite3Mixins/_BlobMixin.py +281 -0
- scitex/db/_sqlite3/_SQLite3Mixins/_ColumnMixin.py +555 -0
- scitex/db/_sqlite3/_SQLite3Mixins/_ConnectionMixin.py +122 -0
- scitex/db/_sqlite3/_SQLite3Mixins/_GitMixin.py +555 -0
- scitex/db/_sqlite3/_SQLite3Mixins/_ImportExportMixin.py +78 -0
- scitex/db/_sqlite3/_SQLite3Mixins/_IndexMixin.py +34 -0
- scitex/db/_sqlite3/_SQLite3Mixins/_MaintenanceMixin.py +177 -0
- scitex/db/_sqlite3/_SQLite3Mixins/_QueryMixin.py +109 -0
- scitex/db/_sqlite3/_SQLite3Mixins/_RowMixin.py +115 -0
- scitex/db/_sqlite3/_SQLite3Mixins/_TableMixin.py +236 -0
- scitex/db/_sqlite3/_SQLite3Mixins/_TransactionMixin.py +71 -0
- scitex/db/_sqlite3/_SQLite3Mixins/__init__.py +43 -0
- scitex/db/_sqlite3/__init__.py +7 -0
- scitex/db/_sqlite3/_delete_duplicates.py +285 -0
- scitex/decorators/BLUEPRINT.md +101 -0
- scitex/decorators/README.md +179 -0
- scitex/decorators/__init__.py +90 -0
- scitex/decorators/_auto_order.py +172 -0
- scitex/decorators/_batch_fn.py +134 -0
- scitex/decorators/_cache_disk.py +39 -0
- scitex/decorators/_cache_disk_async.py +50 -0
- scitex/decorators/_cache_mem.py +12 -0
- scitex/decorators/_combined.py +104 -0
- scitex/decorators/_converters.py +303 -0
- scitex/decorators/_deprecated.py +201 -0
- scitex/decorators/_not_implemented.py +30 -0
- scitex/decorators/_numpy_fn.py +101 -0
- scitex/decorators/_pandas_fn.py +138 -0
- scitex/decorators/_preserve_doc.py +19 -0
- scitex/decorators/_signal_fn.py +111 -0
- scitex/decorators/_timeout.py +55 -0
- scitex/decorators/_torch_fn.py +155 -0
- scitex/decorators/_wrap.py +39 -0
- scitex/decorators/_xarray_fn.py +108 -0
- scitex/dev/__init__.py +20 -0
- scitex/dev/_analyze_code_flow.py +282 -0
- scitex/dev/_pyproject.py +405 -0
- scitex/dev/_reload.py +59 -0
- scitex/dev/cv/__init__.py +60 -0
- scitex/dev/cv/_compose.py +330 -0
- scitex/dev/cv/_title_card.py +351 -0
- scitex/dev/plt/__init__.py +271 -0
- scitex/dev/plt/data/mpl/PLOTTING_FUNCTIONS.yaml +90 -0
- scitex/dev/plt/data/mpl/PLOTTING_SIGNATURES.yaml +1571 -0
- scitex/dev/plt/data/mpl/PLOTTING_SIGNATURES_DETAILED.yaml +6262 -0
- scitex/dev/plt/data/mpl/SIGNATURES_FLATTENED.yaml +1274 -0
- scitex/dev/plt/data/mpl/dir_ax.txt +459 -0
- scitex/dev/plt/demo_plotters/__init__.py +0 -0
- scitex/dev/plt/demo_plotters/plot_mpl_axhline.py +28 -0
- scitex/dev/plt/demo_plotters/plot_mpl_axhspan.py +28 -0
- scitex/dev/plt/demo_plotters/plot_mpl_axvline.py +28 -0
- scitex/dev/plt/demo_plotters/plot_mpl_axvspan.py +28 -0
- scitex/dev/plt/demo_plotters/plot_mpl_bar.py +29 -0
- scitex/dev/plt/demo_plotters/plot_mpl_barh.py +29 -0
- scitex/dev/plt/demo_plotters/plot_mpl_boxplot.py +28 -0
- scitex/dev/plt/demo_plotters/plot_mpl_contour.py +31 -0
- scitex/dev/plt/demo_plotters/plot_mpl_contourf.py +31 -0
- scitex/dev/plt/demo_plotters/plot_mpl_errorbar.py +30 -0
- scitex/dev/plt/demo_plotters/plot_mpl_eventplot.py +28 -0
- scitex/dev/plt/demo_plotters/plot_mpl_fill.py +30 -0
- scitex/dev/plt/demo_plotters/plot_mpl_fill_between.py +31 -0
- scitex/dev/plt/demo_plotters/plot_mpl_hexbin.py +28 -0
- scitex/dev/plt/demo_plotters/plot_mpl_hist.py +28 -0
- scitex/dev/plt/demo_plotters/plot_mpl_hist2d.py +28 -0
- scitex/dev/plt/demo_plotters/plot_mpl_imshow.py +29 -0
- scitex/dev/plt/demo_plotters/plot_mpl_pcolormesh.py +31 -0
- scitex/dev/plt/demo_plotters/plot_mpl_pie.py +29 -0
- scitex/dev/plt/demo_plotters/plot_mpl_plot.py +29 -0
- scitex/dev/plt/demo_plotters/plot_mpl_quiver.py +31 -0
- scitex/dev/plt/demo_plotters/plot_mpl_scatter.py +28 -0
- scitex/dev/plt/demo_plotters/plot_mpl_stackplot.py +31 -0
- scitex/dev/plt/demo_plotters/plot_mpl_stem.py +29 -0
- scitex/dev/plt/demo_plotters/plot_mpl_step.py +29 -0
- scitex/dev/plt/demo_plotters/plot_mpl_violinplot.py +28 -0
- scitex/dev/plt/demo_plotters/plot_sns_barplot.py +29 -0
- scitex/dev/plt/demo_plotters/plot_sns_boxplot.py +29 -0
- scitex/dev/plt/demo_plotters/plot_sns_heatmap.py +28 -0
- scitex/dev/plt/demo_plotters/plot_sns_histplot.py +29 -0
- scitex/dev/plt/demo_plotters/plot_sns_kdeplot.py +29 -0
- scitex/dev/plt/demo_plotters/plot_sns_lineplot.py +31 -0
- scitex/dev/plt/demo_plotters/plot_sns_scatterplot.py +29 -0
- scitex/dev/plt/demo_plotters/plot_sns_stripplot.py +29 -0
- scitex/dev/plt/demo_plotters/plot_sns_swarmplot.py +29 -0
- scitex/dev/plt/demo_plotters/plot_sns_violinplot.py +29 -0
- scitex/dev/plt/demo_plotters/plot_stx_bar.py +29 -0
- scitex/dev/plt/demo_plotters/plot_stx_barh.py +29 -0
- scitex/dev/plt/demo_plotters/plot_stx_box.py +28 -0
- scitex/dev/plt/demo_plotters/plot_stx_boxplot.py +28 -0
- scitex/dev/plt/demo_plotters/plot_stx_conf_mat.py +28 -0
- scitex/dev/plt/demo_plotters/plot_stx_contour.py +31 -0
- scitex/dev/plt/demo_plotters/plot_stx_ecdf.py +28 -0
- scitex/dev/plt/demo_plotters/plot_stx_errorbar.py +30 -0
- scitex/dev/plt/demo_plotters/plot_stx_fill_between.py +31 -0
- scitex/dev/plt/demo_plotters/plot_stx_fillv.py +28 -0
- scitex/dev/plt/demo_plotters/plot_stx_heatmap.py +28 -0
- scitex/dev/plt/demo_plotters/plot_stx_image.py +28 -0
- scitex/dev/plt/demo_plotters/plot_stx_imshow.py +28 -0
- scitex/dev/plt/demo_plotters/plot_stx_joyplot.py +28 -0
- scitex/dev/plt/demo_plotters/plot_stx_kde.py +28 -0
- scitex/dev/plt/demo_plotters/plot_stx_line.py +28 -0
- scitex/dev/plt/demo_plotters/plot_stx_mean_ci.py +28 -0
- scitex/dev/plt/demo_plotters/plot_stx_mean_std.py +28 -0
- scitex/dev/plt/demo_plotters/plot_stx_median_iqr.py +28 -0
- scitex/dev/plt/demo_plotters/plot_stx_raster.py +28 -0
- scitex/dev/plt/demo_plotters/plot_stx_rectangle.py +28 -0
- scitex/dev/plt/demo_plotters/plot_stx_scatter.py +29 -0
- scitex/dev/plt/demo_plotters/plot_stx_shaded_line.py +29 -0
- scitex/dev/plt/demo_plotters/plot_stx_violin.py +28 -0
- scitex/dev/plt/demo_plotters/plot_stx_violinplot.py +28 -0
- scitex/dev/plt/mpl/get_dir_ax.py +46 -0
- scitex/dev/plt/mpl/get_signatures.py +176 -0
- scitex/dev/plt/mpl/get_signatures_details.py +522 -0
- scitex/dev/plt/plot_mpl_axhline.py +28 -0
- scitex/dev/plt/plot_mpl_axhspan.py +28 -0
- scitex/dev/plt/plot_mpl_axvline.py +28 -0
- scitex/dev/plt/plot_mpl_axvspan.py +28 -0
- scitex/dev/plt/plot_mpl_bar.py +29 -0
- scitex/dev/plt/plot_mpl_barh.py +29 -0
- scitex/dev/plt/plot_mpl_boxplot.py +28 -0
- scitex/dev/plt/plot_mpl_contour.py +31 -0
- scitex/dev/plt/plot_mpl_contourf.py +31 -0
- scitex/dev/plt/plot_mpl_errorbar.py +30 -0
- scitex/dev/plt/plot_mpl_eventplot.py +28 -0
- scitex/dev/plt/plot_mpl_fill.py +30 -0
- scitex/dev/plt/plot_mpl_fill_between.py +31 -0
- scitex/dev/plt/plot_mpl_hexbin.py +28 -0
- scitex/dev/plt/plot_mpl_hist.py +28 -0
- scitex/dev/plt/plot_mpl_hist2d.py +28 -0
- scitex/dev/plt/plot_mpl_imshow.py +29 -0
- scitex/dev/plt/plot_mpl_pcolormesh.py +31 -0
- scitex/dev/plt/plot_mpl_pie.py +29 -0
- scitex/dev/plt/plot_mpl_plot.py +29 -0
- scitex/dev/plt/plot_mpl_quiver.py +31 -0
- scitex/dev/plt/plot_mpl_scatter.py +28 -0
- scitex/dev/plt/plot_mpl_stackplot.py +31 -0
- scitex/dev/plt/plot_mpl_stem.py +29 -0
- scitex/dev/plt/plot_mpl_step.py +29 -0
- scitex/dev/plt/plot_mpl_violinplot.py +28 -0
- scitex/dev/plt/plot_sns_barplot.py +29 -0
- scitex/dev/plt/plot_sns_boxplot.py +29 -0
- scitex/dev/plt/plot_sns_heatmap.py +28 -0
- scitex/dev/plt/plot_sns_histplot.py +29 -0
- scitex/dev/plt/plot_sns_kdeplot.py +29 -0
- scitex/dev/plt/plot_sns_lineplot.py +31 -0
- scitex/dev/plt/plot_sns_scatterplot.py +29 -0
- scitex/dev/plt/plot_sns_stripplot.py +29 -0
- scitex/dev/plt/plot_sns_swarmplot.py +29 -0
- scitex/dev/plt/plot_sns_violinplot.py +29 -0
- scitex/dev/plt/plot_stx_bar.py +29 -0
- scitex/dev/plt/plot_stx_barh.py +29 -0
- scitex/dev/plt/plot_stx_box.py +28 -0
- scitex/dev/plt/plot_stx_boxplot.py +28 -0
- scitex/dev/plt/plot_stx_conf_mat.py +28 -0
- scitex/dev/plt/plot_stx_contour.py +31 -0
- scitex/dev/plt/plot_stx_ecdf.py +28 -0
- scitex/dev/plt/plot_stx_errorbar.py +30 -0
- scitex/dev/plt/plot_stx_fill_between.py +31 -0
- scitex/dev/plt/plot_stx_fillv.py +28 -0
- scitex/dev/plt/plot_stx_heatmap.py +28 -0
- scitex/dev/plt/plot_stx_image.py +28 -0
- scitex/dev/plt/plot_stx_imshow.py +28 -0
- scitex/dev/plt/plot_stx_joyplot.py +28 -0
- scitex/dev/plt/plot_stx_kde.py +28 -0
- scitex/dev/plt/plot_stx_line.py +28 -0
- scitex/dev/plt/plot_stx_mean_ci.py +28 -0
- scitex/dev/plt/plot_stx_mean_std.py +28 -0
- scitex/dev/plt/plot_stx_median_iqr.py +28 -0
- scitex/dev/plt/plot_stx_raster.py +28 -0
- scitex/dev/plt/plot_stx_rectangle.py +28 -0
- scitex/dev/plt/plot_stx_scatter.py +29 -0
- scitex/dev/plt/plot_stx_shaded_line.py +29 -0
- scitex/dev/plt/plot_stx_violin.py +28 -0
- scitex/dev/plt/plot_stx_violinplot.py +28 -0
- scitex/diagram/README.md +197 -0
- scitex/diagram/__init__.py +48 -0
- scitex/diagram/_compile.py +312 -0
- scitex/diagram/_diagram.py +355 -0
- scitex/diagram/_mcp/__init__.py +4 -0
- scitex/diagram/_mcp/handlers.py +400 -0
- scitex/diagram/_mcp/tool_schemas.py +157 -0
- scitex/diagram/_presets.py +173 -0
- scitex/diagram/_schema.py +182 -0
- scitex/diagram/_split.py +278 -0
- scitex/diagram/mcp_server.py +151 -0
- scitex/dict/_DotDict.py +423 -0
- scitex/dict/__init__.py +21 -0
- scitex/dict/_flatten.py +20 -0
- scitex/dict/_listed_dict.py +42 -0
- scitex/dict/_pop_keys.py +30 -0
- scitex/dict/_replace.py +12 -0
- scitex/dict/_safe_merge.py +53 -0
- scitex/dict/_to_str.py +34 -0
- scitex/dsp/README.md +147 -0
- scitex/dsp/__init__.py +86 -0
- scitex/dsp/_crop.py +124 -0
- scitex/dsp/_demo_sig.py +377 -0
- scitex/dsp/_detect_ripples.py +214 -0
- scitex/dsp/_ensure_3d.py +18 -0
- scitex/dsp/_hilbert.py +78 -0
- scitex/dsp/_listen.py +702 -0
- scitex/dsp/_misc.py +30 -0
- scitex/dsp/_mne.py +43 -0
- scitex/dsp/_modulation_index.py +93 -0
- scitex/dsp/_pac.py +337 -0
- scitex/dsp/_psd.py +114 -0
- scitex/dsp/_resample.py +86 -0
- scitex/dsp/_time.py +39 -0
- scitex/dsp/_transform.py +81 -0
- scitex/dsp/_wavelet.py +212 -0
- scitex/dsp/add_noise.py +128 -0
- scitex/dsp/audio.md +22 -0
- scitex/dsp/example.py +262 -0
- scitex/dsp/filt.py +165 -0
- scitex/dsp/norm.py +33 -0
- scitex/dsp/params.py +51 -0
- scitex/dsp/reference.py +60 -0
- scitex/dsp/template.py +26 -0
- scitex/dsp/utils/__init__.py +19 -0
- scitex/dsp/utils/_differential_bandpass_filters.py +148 -0
- scitex/dsp/utils/_ensure_3d.py +18 -0
- scitex/dsp/utils/_ensure_even_len.py +10 -0
- scitex/dsp/utils/_zero_pad.py +62 -0
- scitex/dsp/utils/filter.py +408 -0
- scitex/dsp/utils/pac.py +177 -0
- scitex/dt/__init__.py +45 -0
- scitex/dt/_linspace.py +130 -0
- scitex/dt/_normalize_timestamp.py +435 -0
- scitex/errors.py +130 -0
- scitex/etc/__init__.py +15 -0
- scitex/etc/wait_key.py +34 -0
- scitex/gen/README.md +71 -0
- scitex/gen/_DimHandler.py +196 -0
- scitex/gen/_TimeStamper.py +244 -0
- scitex/gen/__init__.py +170 -0
- scitex/gen/_alternate_kwarg.py +13 -0
- scitex/gen/_cache.py +11 -0
- scitex/gen/_check_host.py +34 -0
- scitex/gen/_ci.py +12 -0
- scitex/gen/_deprecated_close.py +81 -0
- scitex/gen/_deprecated_start.py +28 -0
- scitex/gen/_detect_environment.py +155 -0
- scitex/gen/_detect_notebook_path.py +173 -0
- scitex/gen/_embed.py +82 -0
- scitex/gen/_get_notebook_path.py +276 -0
- scitex/gen/_inspect_module.py +256 -0
- scitex/gen/_is_ipython.py +12 -0
- scitex/gen/_less.py +48 -0
- scitex/gen/_list_packages.py +139 -0
- scitex/gen/_mat2py.py +88 -0
- scitex/gen/_norm.py +214 -0
- scitex/gen/_norm_cache.py +282 -0
- scitex/gen/_paste.py +18 -0
- scitex/gen/_print_config.py +81 -0
- scitex/gen/_shell.py +48 -0
- scitex/gen/_src.py +108 -0
- scitex/gen/_symlink.py +55 -0
- scitex/gen/_symlog.py +27 -0
- scitex/gen/_title2path.py +60 -0
- scitex/gen/_title_case.py +89 -0
- scitex/gen/_to_even.py +84 -0
- scitex/gen/_to_odd.py +34 -0
- scitex/gen/_to_rank.py +39 -0
- scitex/gen/_transpose.py +37 -0
- scitex/gen/_type.py +78 -0
- scitex/gen/_var_info.py +73 -0
- scitex/gen/_wrap.py +17 -0
- scitex/gen/_xml2dict.py +76 -0
- scitex/gen/misc.py +755 -0
- scitex/gen/path.py +0 -0
- scitex/gists/_SigMacro_processFigure_S.py +128 -0
- scitex/gists/_SigMacro_toBlue.py +172 -0
- scitex/gists/__init__.py +15 -0
- scitex/gists/guideline.md +42 -0
- scitex/git/README.md +121 -0
- scitex/git/__init__.py +61 -0
- scitex/git/_branch.py +185 -0
- scitex/git/_clone.py +167 -0
- scitex/git/_commit.py +170 -0
- scitex/git/_constants.py +17 -0
- scitex/git/_init.py +179 -0
- scitex/git/_remote.py +191 -0
- scitex/git/_result.py +70 -0
- scitex/git/_retry.py +113 -0
- scitex/git/_session.py +60 -0
- scitex/git/_types.py +136 -0
- scitex/git/_utils.py +28 -0
- scitex/git/_validation.py +117 -0
- scitex/git/_workflow.py +87 -0
- scitex/io/.gitkeep +0 -0
- scitex/io/README.md +376 -0
- scitex/io/__init__.py +116 -0
- scitex/io/_cache.py +101 -0
- scitex/io/_flush.py +27 -0
- scitex/io/_glob.py +109 -0
- scitex/io/_json2md.py +118 -0
- scitex/io/_load.py +453 -0
- scitex/io/_load_cache.py +303 -0
- scitex/io/_load_configs.py +175 -0
- scitex/io/_load_modules/_H5Explorer.py +352 -0
- scitex/io/_load_modules/_ZarrExplorer.py +114 -0
- scitex/io/_load_modules/__init__.py +41 -0
- scitex/io/_load_modules/_bibtex.py +206 -0
- scitex/io/_load_modules/_canvas.py +166 -0
- scitex/io/_load_modules/_catboost.py +71 -0
- scitex/io/_load_modules/_con.py +31 -0
- scitex/io/_load_modules/_docx.py +42 -0
- scitex/io/_load_modules/_eeg.py +125 -0
- scitex/io/_load_modules/_hdf5.py +73 -0
- scitex/io/_load_modules/_image.py +66 -0
- scitex/io/_load_modules/_joblib.py +19 -0
- scitex/io/_load_modules/_json.py +20 -0
- scitex/io/_load_modules/_markdown.py +103 -0
- scitex/io/_load_modules/_matlab.py +39 -0
- scitex/io/_load_modules/_numpy.py +39 -0
- scitex/io/_load_modules/_optuna.py +112 -0
- scitex/io/_load_modules/_pandas.py +75 -0
- scitex/io/_load_modules/_pdf.py +958 -0
- scitex/io/_load_modules/_pickle.py +24 -0
- scitex/io/_load_modules/_sqlite3.py +16 -0
- scitex/io/_load_modules/_torch.py +25 -0
- scitex/io/_load_modules/_txt.py +164 -0
- scitex/io/_load_modules/_xml.py +49 -0
- scitex/io/_load_modules/_yaml.py +24 -0
- scitex/io/_load_modules/_zarr.py +128 -0
- scitex/io/_metadata.py +47 -0
- scitex/io/_metadata_modules/__init__.py +46 -0
- scitex/io/_metadata_modules/_embed.py +70 -0
- scitex/io/_metadata_modules/_read.py +64 -0
- scitex/io/_metadata_modules/_utils.py +79 -0
- scitex/io/_metadata_modules/embed_metadata_jpeg.py +74 -0
- scitex/io/_metadata_modules/embed_metadata_pdf.py +53 -0
- scitex/io/_metadata_modules/embed_metadata_png.py +26 -0
- scitex/io/_metadata_modules/embed_metadata_svg.py +62 -0
- scitex/io/_metadata_modules/read_metadata_jpeg.py +57 -0
- scitex/io/_metadata_modules/read_metadata_pdf.py +51 -0
- scitex/io/_metadata_modules/read_metadata_png.py +39 -0
- scitex/io/_metadata_modules/read_metadata_svg.py +44 -0
- scitex/io/_mv_to_tmp.py +19 -0
- scitex/io/_path.py +286 -0
- scitex/io/_qr_utils.py +82 -0
- scitex/io/_reload.py +78 -0
- scitex/io/_save.py +497 -0
- scitex/io/_save_modules/__init__.py +130 -0
- scitex/io/_save_modules/_bibtex.py +199 -0
- scitex/io/_save_modules/_canvas.py +355 -0
- scitex/io/_save_modules/_catboost.py +22 -0
- scitex/io/_save_modules/_csv.py +95 -0
- scitex/io/_save_modules/_excel.py +210 -0
- scitex/io/_save_modules/_figure_utils.py +90 -0
- scitex/io/_save_modules/_hdf5.py +262 -0
- scitex/io/_save_modules/_html.py +48 -0
- scitex/io/_save_modules/_image.py +300 -0
- scitex/io/_save_modules/_image_csv.py +497 -0
- scitex/io/_save_modules/_joblib.py +25 -0
- scitex/io/_save_modules/_json.py +115 -0
- scitex/io/_save_modules/_legends.py +91 -0
- scitex/io/_save_modules/_listed_dfs_as_csv.py +56 -0
- scitex/io/_save_modules/_listed_scalars_as_csv.py +42 -0
- scitex/io/_save_modules/_matlab.py +24 -0
- scitex/io/_save_modules/_mp4.py +29 -0
- scitex/io/_save_modules/_numpy.py +55 -0
- scitex/io/_save_modules/_optuna_study_as_csv_and_pngs.py +49 -0
- scitex/io/_save_modules/_pickle.py +45 -0
- scitex/io/_save_modules/_plot_bundle.py +112 -0
- scitex/io/_save_modules/_plot_scitex.py +536 -0
- scitex/io/_save_modules/_plotly.py +27 -0
- scitex/io/_save_modules/_stx_bundle.py +104 -0
- scitex/io/_save_modules/_symlink.py +96 -0
- scitex/io/_save_modules/_tex.py +278 -0
- scitex/io/_save_modules/_text.py +23 -0
- scitex/io/_save_modules/_torch.py +35 -0
- scitex/io/_save_modules/_yaml.py +68 -0
- scitex/io/_save_modules/_zarr.py +206 -0
- scitex/io/bundle/README.md +221 -0
- scitex/io/bundle/_Bundle.py +759 -0
- scitex/io/bundle/__init__.py +142 -0
- scitex/io/bundle/_children.py +208 -0
- scitex/io/bundle/_conversion/__init__.py +15 -0
- scitex/io/bundle/_conversion/_bundle2dict.py +44 -0
- scitex/io/bundle/_conversion/_dict2bundle.py +50 -0
- scitex/io/bundle/_core.py +651 -0
- scitex/io/bundle/_dataclasses/_Axes.py +57 -0
- scitex/io/bundle/_dataclasses/_BBox.py +54 -0
- scitex/io/bundle/_dataclasses/_ColumnDef.py +72 -0
- scitex/io/bundle/_dataclasses/_DataFormat.py +40 -0
- scitex/io/bundle/_dataclasses/_DataInfo.py +135 -0
- scitex/io/bundle/_dataclasses/_DataSource.py +44 -0
- scitex/io/bundle/_dataclasses/_SizeMM.py +38 -0
- scitex/io/bundle/_dataclasses/_Spec.py +325 -0
- scitex/io/bundle/_dataclasses/_SpecRefs.py +45 -0
- scitex/io/bundle/_dataclasses/__init__.py +35 -0
- scitex/io/bundle/_extractors/__init__.py +32 -0
- scitex/io/bundle/_extractors/_extract_bar.py +131 -0
- scitex/io/bundle/_extractors/_extract_line.py +71 -0
- scitex/io/bundle/_extractors/_extract_scatter.py +79 -0
- scitex/io/bundle/_loader.py +134 -0
- scitex/io/bundle/_manifest.py +99 -0
- scitex/io/bundle/_mpl_helpers.py +480 -0
- scitex/io/bundle/_nested.py +726 -0
- scitex/io/bundle/_saver.py +268 -0
- scitex/io/bundle/_storage.py +200 -0
- scitex/io/bundle/_types.py +99 -0
- scitex/io/bundle/_validation.py +436 -0
- scitex/io/bundle/_zip.py +477 -0
- scitex/io/bundle/kinds/__init__.py +45 -0
- scitex/io/bundle/kinds/_figure/__init__.py +19 -0
- scitex/io/bundle/kinds/_figure/_composite.py +345 -0
- scitex/io/bundle/kinds/_plot/__init__.py +25 -0
- scitex/io/bundle/kinds/_plot/_backend/__init__.py +53 -0
- scitex/io/bundle/kinds/_plot/_backend/_export.py +165 -0
- scitex/io/bundle/kinds/_plot/_backend/_parser.py +188 -0
- scitex/io/bundle/kinds/_plot/_backend/_render.py +538 -0
- scitex/io/bundle/kinds/_plot/_dataclasses/_ChannelEncoding.py +46 -0
- scitex/io/bundle/kinds/_plot/_dataclasses/_Encoding.py +82 -0
- scitex/io/bundle/kinds/_plot/_dataclasses/_Theme.py +441 -0
- scitex/io/bundle/kinds/_plot/_dataclasses/_TraceEncoding.py +52 -0
- scitex/io/bundle/kinds/_plot/_dataclasses/__init__.py +47 -0
- scitex/io/bundle/kinds/_plot/_models/_Annotations.py +115 -0
- scitex/io/bundle/kinds/_plot/_models/_Axes.py +152 -0
- scitex/io/bundle/kinds/_plot/_models/_Figure.py +138 -0
- scitex/io/bundle/kinds/_plot/_models/_Guides.py +104 -0
- scitex/io/bundle/kinds/_plot/_models/_Plot.py +123 -0
- scitex/io/bundle/kinds/_plot/_models/_Styles.py +245 -0
- scitex/io/bundle/kinds/_plot/_models/__init__.py +80 -0
- scitex/io/bundle/kinds/_plot/_models/_plot_types/__init__.py +156 -0
- scitex/io/bundle/kinds/_plot/_models/_plot_types/_bar.py +43 -0
- scitex/io/bundle/kinds/_plot/_models/_plot_types/_box.py +38 -0
- scitex/io/bundle/kinds/_plot/_models/_plot_types/_distribution.py +36 -0
- scitex/io/bundle/kinds/_plot/_models/_plot_types/_errorbar.py +60 -0
- scitex/io/bundle/kinds/_plot/_models/_plot_types/_histogram.py +30 -0
- scitex/io/bundle/kinds/_plot/_models/_plot_types/_image.py +61 -0
- scitex/io/bundle/kinds/_plot/_models/_plot_types/_line.py +57 -0
- scitex/io/bundle/kinds/_plot/_models/_plot_types/_scatter.py +30 -0
- scitex/io/bundle/kinds/_plot/_models/_plot_types/_seaborn.py +121 -0
- scitex/io/bundle/kinds/_plot/_models/_plot_types/_violin.py +36 -0
- scitex/io/bundle/kinds/_plot/_utils/__init__.py +129 -0
- scitex/io/bundle/kinds/_plot/_utils/_auto_layout.py +127 -0
- scitex/io/bundle/kinds/_plot/_utils/_calc_bounds.py +111 -0
- scitex/io/bundle/kinds/_plot/_utils/_const_sizes.py +48 -0
- scitex/io/bundle/kinds/_plot/_utils/_convert_coords.py +77 -0
- scitex/io/bundle/kinds/_plot/_utils/_get_template.py +178 -0
- scitex/io/bundle/kinds/_plot/_utils/_normalize.py +73 -0
- scitex/io/bundle/kinds/_plot/_utils/_plot_layout.py +397 -0
- scitex/io/bundle/kinds/_plot/_utils/_validate.py +197 -0
- scitex/io/bundle/kinds/_shape/__init__.py +141 -0
- scitex/io/bundle/kinds/_stats/__init__.py +56 -0
- scitex/io/bundle/kinds/_stats/_dataclasses/_Stats.py +423 -0
- scitex/io/bundle/kinds/_stats/_dataclasses/__init__.py +48 -0
- scitex/io/bundle/kinds/_table/__init__.py +72 -0
- scitex/io/bundle/kinds/_table/_latex/__init__.py +93 -0
- scitex/io/bundle/kinds/_table/_latex/_editor/__init__.py +11 -0
- scitex/io/bundle/kinds/_table/_latex/_editor/_app.py +725 -0
- scitex/io/bundle/kinds/_table/_latex/_export.py +279 -0
- scitex/io/bundle/kinds/_table/_latex/_figure_exporter.py +153 -0
- scitex/io/bundle/kinds/_table/_latex/_stats_formatter.py +274 -0
- scitex/io/bundle/kinds/_table/_latex/_table_exporter.py +362 -0
- scitex/io/bundle/kinds/_table/_latex/_utils.py +369 -0
- scitex/io/bundle/kinds/_table/_latex/_validator.py +445 -0
- scitex/io/bundle/kinds/_text/__init__.py +77 -0
- scitex/io/bundle/schemas/__init__.py +30 -0
- scitex/io/bundle/schemas/data_info.schema.json +75 -0
- scitex/io/bundle/schemas/encoding.schema.json +90 -0
- scitex/io/bundle/schemas/node.schema.json +145 -0
- scitex/io/bundle/schemas/render_manifest.schema.json +62 -0
- scitex/io/bundle/schemas/stats.schema.json +132 -0
- scitex/io/bundle/schemas/theme.schema.json +141 -0
- scitex/io/utils/__init__.py +21 -0
- scitex/io/utils/h5_to_zarr.py +636 -0
- scitex/linalg/__init__.py +17 -0
- scitex/linalg/_distance.py +63 -0
- scitex/linalg/_geometric_median.py +69 -0
- scitex/linalg/_misc.py +73 -0
- scitex/logging/README.md +374 -0
- scitex/logging/_Tee.py +195 -0
- scitex/logging/__init__.py +216 -0
- scitex/logging/_config.py +161 -0
- scitex/logging/_context.py +105 -0
- scitex/logging/_errors.py +508 -0
- scitex/logging/_formatters.py +158 -0
- scitex/logging/_handlers.py +63 -0
- scitex/logging/_levels.py +27 -0
- scitex/logging/_logger.py +261 -0
- scitex/logging/_print_capture.py +95 -0
- scitex/logging/_warnings.py +261 -0
- scitex/mcp_server.py +91 -0
- scitex/ml/__init__.py +30 -0
- scitex/msword/__init__.py +255 -0
- scitex/msword/profiles.py +357 -0
- scitex/msword/reader.py +753 -0
- scitex/msword/utils.py +289 -0
- scitex/msword/writer.py +362 -0
- scitex/nn/_AxiswiseDropout.py +27 -0
- scitex/nn/_BNet.py +130 -0
- scitex/nn/_BNet_Res.py +164 -0
- scitex/nn/_ChannelGainChanger.py +44 -0
- scitex/nn/_DropoutChannels.py +50 -0
- scitex/nn/_Filters.py +493 -0
- scitex/nn/_FreqGainChanger.py +110 -0
- scitex/nn/_GaussianFilter.py +48 -0
- scitex/nn/_Hilbert.py +111 -0
- scitex/nn/_MNet_1000.py +157 -0
- scitex/nn/_ModulationIndex.py +223 -0
- scitex/nn/_PAC.py +415 -0
- scitex/nn/_PSD.py +39 -0
- scitex/nn/_ResNet1D.py +120 -0
- scitex/nn/_SpatialAttention.py +25 -0
- scitex/nn/_Spectrogram.py +170 -0
- scitex/nn/_SwapChannels.py +50 -0
- scitex/nn/_TransposeLayer.py +19 -0
- scitex/nn/_Wavelet.py +183 -0
- scitex/nn/__init__.py +75 -0
- scitex/os/__init__.py +8 -0
- scitex/os/_mv.py +50 -0
- scitex/parallel/__init__.py +8 -0
- scitex/parallel/_run.py +152 -0
- scitex/path/README.md +77 -0
- scitex/path/__init__.py +51 -0
- scitex/path/_clean.py +58 -0
- scitex/path/_find.py +90 -0
- scitex/path/_get_module_path.py +53 -0
- scitex/path/_get_spath.py +13 -0
- scitex/path/_getsize.py +32 -0
- scitex/path/_increment_version.py +80 -0
- scitex/path/_mk_spath.py +48 -0
- scitex/path/_path.py +12 -0
- scitex/path/_split.py +35 -0
- scitex/path/_symlink.py +343 -0
- scitex/path/_this_path.py +33 -0
- scitex/path/_version.py +102 -0
- scitex/pd/__init__.py +46 -0
- scitex/pd/_find_indi.py +126 -0
- scitex/pd/_find_pval.py +113 -0
- scitex/pd/_force_df.py +154 -0
- scitex/pd/_from_xyz.py +71 -0
- scitex/pd/_get_unique.py +100 -0
- scitex/pd/_ignore_SettingWithCopyWarning.py +34 -0
- scitex/pd/_melt_cols.py +81 -0
- scitex/pd/_merge_columns.py +221 -0
- scitex/pd/_mv.py +63 -0
- scitex/pd/_replace.py +62 -0
- scitex/pd/_round.py +97 -0
- scitex/pd/_slice.py +63 -0
- scitex/pd/_sort.py +91 -0
- scitex/pd/_to_numeric.py +53 -0
- scitex/pd/_to_xy.py +58 -0
- scitex/pd/_to_xyz.py +110 -0
- scitex/plt/README.md +1675 -0
- scitex/plt/__init__.py +803 -0
- scitex/plt/_figrecipe.py +236 -0
- scitex/plt/_figrecipe_integration.py +121 -0
- scitex/plt/_mcp/__init__.py +4 -0
- scitex/plt/_mcp/_handlers_annotation.py +102 -0
- scitex/plt/_mcp/_handlers_figure.py +195 -0
- scitex/plt/_mcp/_handlers_plot.py +252 -0
- scitex/plt/_mcp/_handlers_style.py +219 -0
- scitex/plt/_mcp/handlers.py +74 -0
- scitex/plt/_mcp/tool_schemas.py +497 -0
- scitex/plt/_subplots/_AxesWrapper.py +197 -0
- scitex/plt/_subplots/_AxisWrapper.py +374 -0
- scitex/plt/_subplots/_AxisWrapperMixins/_AdjustmentMixin/__init__.py +36 -0
- scitex/plt/_subplots/_AxisWrapperMixins/_AdjustmentMixin/_labels.py +264 -0
- scitex/plt/_subplots/_AxisWrapperMixins/_AdjustmentMixin/_metadata.py +213 -0
- scitex/plt/_subplots/_AxisWrapperMixins/_AdjustmentMixin/_visual.py +128 -0
- scitex/plt/_subplots/_AxisWrapperMixins/_MatplotlibPlotMixin/__init__.py +60 -0
- scitex/plt/_subplots/_AxisWrapperMixins/_MatplotlibPlotMixin/_base.py +34 -0
- scitex/plt/_subplots/_AxisWrapperMixins/_MatplotlibPlotMixin/_scientific.py +593 -0
- scitex/plt/_subplots/_AxisWrapperMixins/_MatplotlibPlotMixin/_statistical.py +654 -0
- scitex/plt/_subplots/_AxisWrapperMixins/_MatplotlibPlotMixin/_stx_aliases.py +527 -0
- scitex/plt/_subplots/_AxisWrapperMixins/_RawMatplotlibMixin.py +321 -0
- scitex/plt/_subplots/_AxisWrapperMixins/_SeabornMixin/__init__.py +33 -0
- scitex/plt/_subplots/_AxisWrapperMixins/_SeabornMixin/_base.py +152 -0
- scitex/plt/_subplots/_AxisWrapperMixins/_SeabornMixin/_wrappers.py +600 -0
- scitex/plt/_subplots/_AxisWrapperMixins/_TrackingMixin.py +197 -0
- scitex/plt/_subplots/_AxisWrapperMixins/_UnitAwareMixin.py +440 -0
- scitex/plt/_subplots/_AxisWrapperMixins/__init__.py +91 -0
- scitex/plt/_subplots/_FigWrapper.py +475 -0
- scitex/plt/_subplots/_SubplotsWrapper.py +329 -0
- scitex/plt/_subplots/__init__.py +121 -0
- scitex/plt/_subplots/_export_as_csv.py +467 -0
- scitex/plt/_subplots/_export_as_csv_formatters/__init__.py +83 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_annotate.py +72 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_bar.py +138 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_barh.py +58 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_boxplot.py +74 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_contour.py +50 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_contourf.py +62 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_errorbar.py +88 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_eventplot.py +83 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_fill.py +46 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_fill_between.py +46 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_hexbin.py +51 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_hist.py +91 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_hist2d.py +51 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_imshow.py +85 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_imshow2d.py +48 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_matshow.py +50 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_pcolormesh.py +66 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_pie.py +51 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_plot.py +222 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_plot_box.py +98 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_plot_imshow.py +53 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_plot_kde.py +59 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_plot_scatter.py +44 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_quiver.py +61 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_scatter.py +43 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_sns_barplot.py +66 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_sns_boxplot.py +112 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_sns_heatmap.py +80 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_sns_histplot.py +95 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_sns_jointplot.py +76 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_sns_kdeplot.py +91 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_sns_lineplot.py +64 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_sns_pairplot.py +57 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_sns_scatterplot.py +72 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_sns_stripplot.py +80 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_sns_swarmplot.py +80 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_sns_violinplot.py +134 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stackplot.py +62 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stem.py +51 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_step.py +51 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_streamplot.py +56 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_bar.py +84 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_barh.py +85 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_conf_mat.py +75 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_contour.py +54 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_ecdf.py +55 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_errorbar.py +120 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_fillv.py +72 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_heatmap.py +74 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_image.py +109 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_imshow.py +63 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_joyplot.py +84 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_line.py +55 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_mean_ci.py +50 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_mean_std.py +50 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_median_iqr.py +50 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_raster.py +52 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_rectangle.py +129 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_scatter.py +51 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_scatter_hist.py +92 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_shaded_line.py +72 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_stx_violin.py +115 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_text.py +61 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_violin.py +65 -0
- scitex/plt/_subplots/_export_as_csv_formatters/_format_violinplot.py +82 -0
- scitex/plt/_subplots/_export_as_csv_formatters/test_formatters.py +208 -0
- scitex/plt/_subplots/_export_as_csv_formatters/verify_formatters.py +359 -0
- scitex/plt/_subplots/_fonts.py +71 -0
- scitex/plt/_subplots/_mm_layout.py +282 -0
- scitex/plt/_tpl.py +29 -0
- scitex/plt/ax/__init__.py +123 -0
- scitex/plt/ax/_plot/__init__.py +79 -0
- scitex/plt/ax/_plot/_add_fitted_line.py +152 -0
- scitex/plt/ax/_plot/_plot_circular_hist.py +126 -0
- scitex/plt/ax/_plot/_plot_cube.py +57 -0
- scitex/plt/ax/_plot/_plot_statistical_shaded_line.py +255 -0
- scitex/plt/ax/_plot/_stx_conf_mat.py +139 -0
- scitex/plt/ax/_plot/_stx_ecdf.py +113 -0
- scitex/plt/ax/_plot/_stx_fillv.py +57 -0
- scitex/plt/ax/_plot/_stx_heatmap.py +369 -0
- scitex/plt/ax/_plot/_stx_image.py +96 -0
- scitex/plt/ax/_plot/_stx_joyplot.py +135 -0
- scitex/plt/ax/_plot/_stx_raster.py +199 -0
- scitex/plt/ax/_plot/_stx_rectangle.py +70 -0
- scitex/plt/ax/_plot/_stx_scatter_hist.py +133 -0
- scitex/plt/ax/_plot/_stx_shaded_line.py +219 -0
- scitex/plt/ax/_plot/_stx_violin.py +352 -0
- scitex/plt/ax/_style/__init__.py +42 -0
- scitex/plt/ax/_style/_add_marginal_ax.py +46 -0
- scitex/plt/ax/_style/_add_panel.py +92 -0
- scitex/plt/ax/_style/_auto_scale_axis.py +199 -0
- scitex/plt/ax/_style/_extend.py +66 -0
- scitex/plt/ax/_style/_force_aspect.py +39 -0
- scitex/plt/ax/_style/_format_label.py +23 -0
- scitex/plt/ax/_style/_format_units.py +103 -0
- scitex/plt/ax/_style/_hide_spines.py +86 -0
- scitex/plt/ax/_style/_map_ticks.py +184 -0
- scitex/plt/ax/_style/_rotate_labels.py +320 -0
- scitex/plt/ax/_style/_rotate_labels_v01.py +258 -0
- scitex/plt/ax/_style/_sci_note.py +279 -0
- scitex/plt/ax/_style/_set_log_scale.py +334 -0
- scitex/plt/ax/_style/_set_meta.py +291 -0
- scitex/plt/ax/_style/_set_n_ticks.py +37 -0
- scitex/plt/ax/_style/_set_size.py +16 -0
- scitex/plt/ax/_style/_set_supxyt.py +133 -0
- scitex/plt/ax/_style/_set_ticks.py +276 -0
- scitex/plt/ax/_style/_set_xyt.py +130 -0
- scitex/plt/ax/_style/_share_axes.py +274 -0
- scitex/plt/ax/_style/_shift.py +139 -0
- scitex/plt/ax/_style/_show_spines.py +334 -0
- scitex/plt/ax/_style/_style_barplot.py +69 -0
- scitex/plt/ax/_style/_style_boxplot.py +155 -0
- scitex/plt/ax/_style/_style_errorbar.py +82 -0
- scitex/plt/ax/_style/_style_scatter.py +82 -0
- scitex/plt/ax/_style/_style_suptitles.py +76 -0
- scitex/plt/ax/_style/_style_violinplot.py +115 -0
- scitex/plt/color/_PARAMS.py +70 -0
- scitex/plt/color/__init__.py +84 -0
- scitex/plt/color/_add_hue_col.py +42 -0
- scitex/plt/color/_colors.py +204 -0
- scitex/plt/color/_get_colors_from_conf_matap.py +136 -0
- scitex/plt/color/_interpolate.py +29 -0
- scitex/plt/color/_vizualize_colors.py +53 -0
- scitex/plt/docs/FIGURE_ARCHITECTURE.md +315 -0
- scitex/plt/gallery/README.md +75 -0
- scitex/plt/gallery/__init__.py +126 -0
- scitex/plt/gallery/_generate.py +586 -0
- scitex/plt/gallery/_plots.py +594 -0
- scitex/plt/gallery/_registry.py +153 -0
- scitex/plt/gallery.md +170 -0
- scitex/plt/io/__init__.py +51 -0
- scitex/plt/io/_bundle.py +537 -0
- scitex/plt/io/_layered_bundle.py +1478 -0
- scitex/plt/mcp_server.py +231 -0
- scitex/plt/styles/SCITEX_STYLE.yaml +130 -0
- scitex/plt/styles/__init__.py +71 -0
- scitex/plt/styles/_plot_defaults.py +210 -0
- scitex/plt/styles/_plot_postprocess.py +487 -0
- scitex/plt/styles/_postprocess_helpers.py +158 -0
- scitex/plt/styles/_style_loader.py +268 -0
- scitex/plt/styles/presets.py +311 -0
- scitex/plt/utils/__init__.py +114 -0
- scitex/plt/utils/_calc_bacc_from_conf_mat.py +46 -0
- scitex/plt/utils/_calc_nice_ticks.py +101 -0
- scitex/plt/utils/_close.py +73 -0
- scitex/plt/utils/_collect_figure_metadata.py +3384 -0
- scitex/plt/utils/_colorbar.py +156 -0
- scitex/plt/utils/_configure_mpl.py +427 -0
- scitex/plt/utils/_crop.py +297 -0
- scitex/plt/utils/_csv_column_naming.py +474 -0
- scitex/plt/utils/_dimension_viewer.py +425 -0
- scitex/plt/utils/_figure_from_axes_mm.py +407 -0
- scitex/plt/utils/_figure_mm.py +472 -0
- scitex/plt/utils/_get_actual_font.py +57 -0
- scitex/plt/utils/_histogram_utils.py +134 -0
- scitex/plt/utils/_hitmap.py +1643 -0
- scitex/plt/utils/_im2grid.py +70 -0
- scitex/plt/utils/_is_valid_axis.py +82 -0
- scitex/plt/utils/_mk_colorbar.py +65 -0
- scitex/plt/utils/_mk_patches.py +26 -0
- scitex/plt/utils/_scientific_captions.py +701 -0
- scitex/plt/utils/_scitex_config.py +224 -0
- scitex/plt/utils/_units.py +123 -0
- scitex/plt/utils/metadata/__init__.py +61 -0
- scitex/plt/utils/metadata/_artist_extraction.py +119 -0
- scitex/plt/utils/metadata/_axes_metadata.py +93 -0
- scitex/plt/utils/metadata/_collection_artists.py +292 -0
- scitex/plt/utils/metadata/_core.py +207 -0
- scitex/plt/utils/metadata/_csv_column_extraction.py +186 -0
- scitex/plt/utils/metadata/_csv_hash.py +115 -0
- scitex/plt/utils/metadata/_csv_verification.py +95 -0
- scitex/plt/utils/metadata/_data_linkage.py +263 -0
- scitex/plt/utils/metadata/_dimensions.py +242 -0
- scitex/plt/utils/metadata/_editable_export.py +405 -0
- scitex/plt/utils/metadata/_figure_metadata.py +58 -0
- scitex/plt/utils/metadata/_geometry_extraction.py +570 -0
- scitex/plt/utils/metadata/_image_text_artists.py +168 -0
- scitex/plt/utils/metadata/_label_parsing.py +82 -0
- scitex/plt/utils/metadata/_legend_extraction.py +120 -0
- scitex/plt/utils/metadata/_line_artists.py +367 -0
- scitex/plt/utils/metadata/_line_semantic_handling.py +173 -0
- scitex/plt/utils/metadata/_patch_artists.py +211 -0
- scitex/plt/utils/metadata/_plot_content.py +26 -0
- scitex/plt/utils/metadata/_plot_type_detection.py +184 -0
- scitex/plt/utils/metadata/_precision.py +134 -0
- scitex/plt/utils/metadata/_precision_config.py +68 -0
- scitex/plt/utils/metadata/_precision_sections.py +211 -0
- scitex/plt/utils/metadata/_recipe_extraction.py +267 -0
- scitex/plt/utils/metadata/_style_parsing.py +174 -0
- scitex/repro/README_RandomStateManager.md +211 -0
- scitex/repro/_RandomStateManager.py +684 -0
- scitex/repro/__init__.py +66 -0
- scitex/repro/_gen_ID.py +137 -0
- scitex/repro/_gen_timestamp.py +114 -0
- scitex/repro/_hash_array.py +139 -0
- scitex/reproduce/__init__.py +26 -0
- scitex/resource/README.md +81 -0
- scitex/resource/__init__.py +31 -0
- scitex/resource/_get_processor_usages.py +283 -0
- scitex/resource/_get_specs/info.yaml +122 -0
- scitex/resource/_get_specs/specs.yaml +129 -0
- scitex/resource/_get_specs.py +280 -0
- scitex/resource/_log_processor_usages.py +192 -0
- scitex/resource/_utils/__init__.py +31 -0
- scitex/resource/_utils/_get_env_info.py +469 -0
- scitex/resource/limit_ram.py +33 -0
- scitex/rng/__init__.py +32 -0
- scitex/schema/README.md +178 -0
- scitex/schema/__init__.py +222 -0
- scitex/schema/_canvas.py +444 -0
- scitex/schema/_encoding.py +273 -0
- scitex/schema/_figure_elements.py +406 -0
- scitex/schema/_plot.py +1015 -0
- scitex/schema/_stats.py +762 -0
- scitex/schema/_theme.py +360 -0
- scitex/schema/_validation.py +492 -0
- scitex/scholar/.gitignore +23 -0
- scitex/scholar/README.md +122 -0
- scitex/scholar/__init__.py +354 -0
- scitex/scholar/__main__.py +339 -0
- scitex/scholar/_mcp/__init__.py +4 -0
- scitex/scholar/_mcp/all_handlers.py +74 -0
- scitex/scholar/_mcp/handlers.py +1478 -0
- scitex/scholar/_mcp/job_handlers.py +172 -0
- scitex/scholar/_mcp/job_tool_schemas.py +166 -0
- scitex/scholar/_mcp/tool_schemas.py +505 -0
- scitex/scholar/auth/README.md +83 -0
- scitex/scholar/auth/ScholarAuthManager.py +285 -0
- scitex/scholar/auth/__init__.py +21 -0
- scitex/scholar/auth/core/AuthenticationGateway.py +459 -0
- scitex/scholar/auth/core/BrowserAuthenticator.py +468 -0
- scitex/scholar/auth/core/StrategyResolver.py +292 -0
- scitex/scholar/auth/core/__init__.py +20 -0
- scitex/scholar/auth/gateway/_OpenURLLinkFinder.py +110 -0
- scitex/scholar/auth/gateway/_OpenURLResolver.py +201 -0
- scitex/scholar/auth/gateway/__init__.py +38 -0
- scitex/scholar/auth/gateway/_resolve_functions.py +101 -0
- scitex/scholar/auth/providers/BaseAuthenticator.py +167 -0
- scitex/scholar/auth/providers/EZProxyAuthenticator.py +477 -0
- scitex/scholar/auth/providers/OpenAthensAuthenticator.py +436 -0
- scitex/scholar/auth/providers/ShibbolethAuthenticator.py +700 -0
- scitex/scholar/auth/providers/__init__.py +18 -0
- scitex/scholar/auth/providers/_notifications.py +417 -0
- scitex/scholar/auth/session/AuthCacheManager.py +186 -0
- scitex/scholar/auth/session/SessionManager.py +157 -0
- scitex/scholar/auth/session/__init__.py +11 -0
- scitex/scholar/auth/sso/BaseSSOAutomator.py +386 -0
- scitex/scholar/auth/sso/OpenAthensSSOAutomator.py +349 -0
- scitex/scholar/auth/sso/README.md +204 -0
- scitex/scholar/auth/sso/SSOAutomator.py +173 -0
- scitex/scholar/auth/sso/UniversityOfMelbourneSSOAutomator.py +363 -0
- scitex/scholar/auth/sso/__init__.py +15 -0
- scitex/scholar/browser/README.md +60 -0
- scitex/scholar/browser/ScholarBrowserManager.py +680 -0
- scitex/scholar/browser/__init__.py +39 -0
- scitex/scholar/browser/utils/README.md +96 -0
- scitex/scholar/browser/utils/__init__.py +13 -0
- scitex/scholar/browser/utils/click_and_wait.py +204 -0
- scitex/scholar/browser/utils/close_unwanted_pages.py +137 -0
- scitex/scholar/browser/utils/wait_redirects.py +707 -0
- scitex/scholar/citation_graph/README.md +117 -0
- scitex/scholar/citation_graph/__init__.py +28 -0
- scitex/scholar/citation_graph/builder.py +210 -0
- scitex/scholar/citation_graph/database.py +239 -0
- scitex/scholar/citation_graph/example.py +91 -0
- scitex/scholar/citation_graph/models.py +80 -0
- scitex/scholar/cli/README.md +368 -0
- scitex/scholar/cli/_CentralArgumentParser.py +137 -0
- scitex/scholar/cli/_argument_groups.py +222 -0
- scitex/scholar/cli/_doi_operations.py +56 -0
- scitex/scholar/cli/_url_utils.py +219 -0
- scitex/scholar/cli/chrome.py +101 -0
- scitex/scholar/cli/download_pdf.py +236 -0
- scitex/scholar/cli/handlers/__init__.py +23 -0
- scitex/scholar/cli/handlers/bibtex_handler.py +165 -0
- scitex/scholar/cli/handlers/doi_handler.py +63 -0
- scitex/scholar/cli/handlers/project_handler.py +261 -0
- scitex/scholar/cli/open_browser.py +297 -0
- scitex/scholar/cli/open_browser_auto.py +368 -0
- scitex/scholar/cli/open_browser_monitored.py +407 -0
- scitex/scholar/config/PublisherRules.py +132 -0
- scitex/scholar/config/README.md +31 -0
- scitex/scholar/config/ScholarConfig.py +143 -0
- scitex/scholar/config/__init__.py +18 -0
- scitex/scholar/config/_categories/README.md +149 -0
- scitex/scholar/config/_categories/api_keys.yaml +19 -0
- scitex/scholar/config/_categories/auth_gateway.yaml +149 -0
- scitex/scholar/config/_categories/authentication.yaml +24 -0
- scitex/scholar/config/_categories/browser.yaml +24 -0
- scitex/scholar/config/_categories/core.yaml +24 -0
- scitex/scholar/config/_categories/doi_publishers.yaml +53 -0
- scitex/scholar/config/_categories/notifications.yaml +11 -0
- scitex/scholar/config/_categories/pdf_download.yaml +15 -0
- scitex/scholar/config/_categories/publishers_pdf_rules.yaml +184 -0
- scitex/scholar/config/_categories/search_engines.yaml +22 -0
- scitex/scholar/config/_categories/url_finder_openurl.yaml +87 -0
- scitex/scholar/config/_categories/url_finder_selectors.yaml +147 -0
- scitex/scholar/config/core/_CascadeConfig.py +156 -0
- scitex/scholar/config/core/_PathManager.py +535 -0
- scitex/scholar/config/default.yaml +727 -0
- scitex/scholar/core/Paper.py +708 -0
- scitex/scholar/core/Papers.py +711 -0
- scitex/scholar/core/README.md +109 -0
- scitex/scholar/core/Scholar.py +1857 -0
- scitex/scholar/core/__init__.py +53 -0
- scitex/scholar/core/journal_normalizer.py +538 -0
- scitex/scholar/core/oa_cache.py +289 -0
- scitex/scholar/core/open_access.py +466 -0
- scitex/scholar/data/.gitkeep +0 -0
- scitex/scholar/data/README.md +44 -0
- scitex/scholar/data/bib_files/bibliography.bib +1952 -0
- scitex/scholar/data/bib_files/neurovista.bib +277 -0
- scitex/scholar/data/bib_files/neurovista_enriched.bib +441 -0
- scitex/scholar/data/bib_files/neurovista_enriched_enriched.bib +441 -0
- scitex/scholar/data/bib_files/neurovista_processed.bib +338 -0
- scitex/scholar/data/bib_files/openaccess.bib +89 -0
- scitex/scholar/data/bib_files/pac-seizure_prediction_enriched.bib +2178 -0
- scitex/scholar/data/bib_files/pac.bib +698 -0
- scitex/scholar/data/bib_files/pac_enriched.bib +1061 -0
- scitex/scholar/data/bib_files/pac_processed.bib +0 -0
- scitex/scholar/data/bib_files/pac_titles.txt +75 -0
- scitex/scholar/data/bib_files/paywalled.bib +98 -0
- scitex/scholar/data/bib_files/related-papers-by-coauthors.bib +58 -0
- scitex/scholar/data/bib_files/related-papers-by-coauthors_enriched.bib +87 -0
- scitex/scholar/data/bib_files/seizure_prediction.bib +694 -0
- scitex/scholar/data/bib_files/seizure_prediction_processed.bib +0 -0
- scitex/scholar/data/bib_files/test_complete_enriched.bib +437 -0
- scitex/scholar/data/bib_files/test_final_enriched.bib +437 -0
- scitex/scholar/data/bib_files/test_seizure.bib +46 -0
- scitex/scholar/data/impact_factor/JCR_IF_2022.xlsx +0 -0
- scitex/scholar/data/impact_factor/JCR_IF_2024.db +0 -0
- scitex/scholar/data/impact_factor/JCR_IF_2024.xlsx +0 -0
- scitex/scholar/data/impact_factor/JCR_IF_2024_v01.db +0 -0
- scitex/scholar/data/impact_factor.db +0 -0
- scitex/scholar/docs/00_SUMMARY.md +127 -0
- scitex/scholar/docs/01_STORAGE_ARCHITECTURE.md +118 -0
- scitex/scholar/docs/02_PIPELINE_ORGANIZATION.md +251 -0
- scitex/scholar/docs/03_PIPELINE_IMPLEMENTATION_PLAN.md +425 -0
- scitex/scholar/docs/04_PIPELINE_IMPLEMENTATION_COMPLETE.md +324 -0
- scitex/scholar/docs/05_DETAILS_FOR_DEVELOPERS.md +464 -0
- scitex/scholar/docs/06_IMPACT_FACTOR_CITATION_INTEGRATION.md +295 -0
- scitex/scholar/docs/bibfile.bib +66 -0
- scitex/scholar/docs/from_agents/OPENATHENS_SECURITY.md +101 -0
- scitex/scholar/docs/from_agents/feature-requests/scholar-openathens-authentication.md +107 -0
- scitex/scholar/docs/from_agents/scholar_enhancements_summary.md +66 -0
- scitex/scholar/docs/from_user/crawl4ai.md +15 -0
- scitex/scholar/docs/from_user/medium_article_on_logined_page_for_zenrows_1.md +363 -0
- scitex/scholar/docs/from_user/medium_article_on_logined_page_for_zenrows_2.md +1496 -0
- scitex/scholar/docs/from_user/papers.bib +698 -0
- scitex/scholar/docs/from_user/renamed-async_functions.md +970 -0
- scitex/scholar/docs/from_user/suggestions.md +97 -0
- scitex/scholar/docs/sample_data/PAYWALLED.json +84 -0
- scitex/scholar/docs/sample_data/openaccess.json +74 -0
- scitex/scholar/docs/sample_data/papers-enriched.bib +675 -0
- scitex/scholar/docs/sample_data/papers-partial-enriched.bib +601 -0
- scitex/scholar/docs/sample_data/test_papers.bib +18 -0
- scitex/scholar/docs/template.py +123 -0
- scitex/scholar/docs/zenrows_official/FAQ.md +434 -0
- scitex/scholar/docs/zenrows_official/captcha_integration.md +698 -0
- scitex/scholar/docs/zenrows_official/final_url.md +204 -0
- scitex/scholar/docs/zenrows_official/with_playwright.md +336 -0
- scitex/scholar/examples/00_config.py +270 -0
- scitex/scholar/examples/01_auth.py +165 -0
- scitex/scholar/examples/02_browser.py +169 -0
- scitex/scholar/examples/03_01-engine.py +486 -0
- scitex/scholar/examples/03_02-engine-for-bibtex.py +166 -0
- scitex/scholar/examples/04_01-url.py +218 -0
- scitex/scholar/examples/04_02-url-for-bibtex.py +283 -0
- scitex/scholar/examples/04_02-url-for-dois.py +305 -0
- scitex/scholar/examples/05_download_pdf.py +225 -0
- scitex/scholar/examples/06_find_and_download.py +217 -0
- scitex/scholar/examples/06_parse_bibtex.py +256 -0
- scitex/scholar/examples/07_storage_integration.py +253 -0
- scitex/scholar/examples/99_fullpipeline-for-bibtex.py +349 -0
- scitex/scholar/examples/99_fullpipeline-for-one-entry.py +293 -0
- scitex/scholar/examples/99_maintenance.py +134 -0
- scitex/scholar/examples/README.md +18 -0
- scitex/scholar/examples/SUGGESTIONS.md +865 -0
- scitex/scholar/examples/dev.py +38 -0
- scitex/scholar/examples/zotero_integration.py +246 -0
- scitex/scholar/impact_factor/ImpactFactorEngine.py +202 -0
- scitex/scholar/impact_factor/README.md +135 -0
- scitex/scholar/impact_factor/__init__.py +20 -0
- scitex/scholar/impact_factor/estimation/ImpactFactorEstimationEngine.py +0 -0
- scitex/scholar/impact_factor/estimation/README.md +351 -0
- scitex/scholar/impact_factor/estimation/__init__.py +39 -0
- scitex/scholar/impact_factor/estimation/build_database.py +0 -0
- scitex/scholar/impact_factor/estimation/core/__init__.py +24 -0
- scitex/scholar/impact_factor/estimation/core/cache_manager.py +535 -0
- scitex/scholar/impact_factor/estimation/core/calculator.py +389 -0
- scitex/scholar/impact_factor/estimation/core/journal_matcher.py +473 -0
- scitex/scholar/impact_factor/jcr/ImpactFactorJCREngine.py +323 -0
- scitex/scholar/impact_factor/jcr/README.md +84 -0
- scitex/scholar/impact_factor/jcr/TODO.md +28 -0
- scitex/scholar/impact_factor/jcr/build_database.py +268 -0
- scitex/scholar/integration/MIGRATION.md +138 -0
- scitex/scholar/integration/README.md +464 -0
- scitex/scholar/integration/__init__.py +59 -0
- scitex/scholar/integration/base.py +494 -0
- scitex/scholar/integration/mendeley/__init__.py +22 -0
- scitex/scholar/integration/mendeley/exporter.py +164 -0
- scitex/scholar/integration/mendeley/importer.py +236 -0
- scitex/scholar/integration/mendeley/linker.py +79 -0
- scitex/scholar/integration/mendeley/mapper.py +215 -0
- scitex/scholar/integration/zotero/README.md +336 -0
- scitex/scholar/integration/zotero/__init__.py +27 -0
- scitex/scholar/integration/zotero/__main__.py +247 -0
- scitex/scholar/integration/zotero/exporter.py +355 -0
- scitex/scholar/integration/zotero/importer.py +374 -0
- scitex/scholar/integration/zotero/linker.py +411 -0
- scitex/scholar/integration/zotero/mapper.py +291 -0
- scitex/scholar/jobs/_Job.py +266 -0
- scitex/scholar/jobs/_JobManager.py +458 -0
- scitex/scholar/jobs/__init__.py +74 -0
- scitex/scholar/jobs/_errors.py +346 -0
- scitex/scholar/jobs/_executors.py +373 -0
- scitex/scholar/mcp_server.py +365 -0
- scitex/scholar/metadata_engines/.combined-SemanticScholarSource/_SemanticScholarSource.py +289 -0
- scitex/scholar/metadata_engines/.combined-SemanticScholarSource/_SemanticScholarSourceEnhanced.py +229 -0
- scitex/scholar/metadata_engines/README.md +42 -0
- scitex/scholar/metadata_engines/ScholarEngine.py +574 -0
- scitex/scholar/metadata_engines/__init__.py +28 -0
- scitex/scholar/metadata_engines/individual/ArXivEngine.py +378 -0
- scitex/scholar/metadata_engines/individual/CrossRefEngine.py +264 -0
- scitex/scholar/metadata_engines/individual/CrossRefLocalEngine.py +303 -0
- scitex/scholar/metadata_engines/individual/OpenAlexEngine.py +340 -0
- scitex/scholar/metadata_engines/individual/PubMedEngine.py +312 -0
- scitex/scholar/metadata_engines/individual/SemanticScholarEngine.py +431 -0
- scitex/scholar/metadata_engines/individual/URLDOIEngine.py +399 -0
- scitex/scholar/metadata_engines/individual/_BaseDOIEngine.py +458 -0
- scitex/scholar/metadata_engines/individual/__init__.py +7 -0
- scitex/scholar/metadata_engines/utils/_PubMedConverter.py +465 -0
- scitex/scholar/metadata_engines/utils/_URLDOIExtractor.py +296 -0
- scitex/scholar/metadata_engines/utils/__init__.py +32 -0
- scitex/scholar/metadata_engines/utils/_metadata2bibtex.py +106 -0
- scitex/scholar/metadata_engines/utils/_standardize_metadata.py +375 -0
- scitex/scholar/pdf_download/README.md +51 -0
- scitex/scholar/pdf_download/ScholarPDFDownloader.py +704 -0
- scitex/scholar/pdf_download/__init__.py +5 -0
- scitex/scholar/pdf_download/strategies/__init__.py +44 -0
- scitex/scholar/pdf_download/strategies/chrome_pdf_viewer.py +368 -0
- scitex/scholar/pdf_download/strategies/direct_download.py +127 -0
- scitex/scholar/pdf_download/strategies/manual_download_fallback.py +167 -0
- scitex/scholar/pdf_download/strategies/manual_download_utils.py +989 -0
- scitex/scholar/pdf_download/strategies/open_access_download.py +204 -0
- scitex/scholar/pdf_download/strategies/response_body.py +196 -0
- scitex/scholar/pipelines/README.md +172 -0
- scitex/scholar/pipelines/ScholarPipelineBibTeX.py +355 -0
- scitex/scholar/pipelines/ScholarPipelineMetadataParallel.py +269 -0
- scitex/scholar/pipelines/ScholarPipelineMetadataSingle.py +403 -0
- scitex/scholar/pipelines/ScholarPipelineParallel.py +462 -0
- scitex/scholar/pipelines/ScholarPipelineSearchParallel.py +873 -0
- scitex/scholar/pipelines/ScholarPipelineSearchSingle.py +553 -0
- scitex/scholar/pipelines/ScholarPipelineSingle.py +188 -0
- scitex/scholar/pipelines/SearchQueryParser.py +270 -0
- scitex/scholar/pipelines/__init__.py +55 -0
- scitex/scholar/pipelines/_single_steps.py +430 -0
- scitex/scholar/search_engines/ScholarSearchEngine.py +278 -0
- scitex/scholar/search_engines/_BaseSearchEngine.py +180 -0
- scitex/scholar/search_engines/__init__.py +0 -0
- scitex/scholar/search_engines/individual/ArXivSearchEngine.py +194 -0
- scitex/scholar/search_engines/individual/CrossRefSearchEngine.py +159 -0
- scitex/scholar/search_engines/individual/OpenAlexSearchEngine.py +177 -0
- scitex/scholar/search_engines/individual/PubMedSearchEngine.py +186 -0
- scitex/scholar/search_engines/individual/SemanticScholarSearchEngine.py +158 -0
- scitex/scholar/search_engines/individual/__init__.py +0 -0
- scitex/scholar/storage/BibTeXHandler.py +1151 -0
- scitex/scholar/storage/PaperIO.py +500 -0
- scitex/scholar/storage/ScholarLibrary.py +203 -0
- scitex/scholar/storage/_BibTeXValidator.py +571 -0
- scitex/scholar/storage/_DeduplicationManager.py +575 -0
- scitex/scholar/storage/_LibraryCacheManager.py +697 -0
- scitex/scholar/storage/_LibraryManager.py +1725 -0
- scitex/scholar/storage/__init__.py +42 -0
- scitex/scholar/url_finder/README.md +74 -0
- scitex/scholar/url_finder/ScholarURLFinder.py +364 -0
- scitex/scholar/url_finder/__init__.py +7 -0
- scitex/scholar/url_finder/docs/CORE_URL_TYPES.md +187 -0
- scitex/scholar/url_finder/docs/URL_SCHEMA.md +223 -0
- scitex/scholar/url_finder/strategies/__init__.py +33 -0
- scitex/scholar/url_finder/strategies/find_pdf_urls_by_direct_links.py +261 -0
- scitex/scholar/url_finder/strategies/find_pdf_urls_by_dropdown.py +67 -0
- scitex/scholar/url_finder/strategies/find_pdf_urls_by_href.py +200 -0
- scitex/scholar/url_finder/strategies/find_pdf_urls_by_navigation.py +254 -0
- scitex/scholar/url_finder/strategies/find_pdf_urls_by_publisher_patterns.py +158 -0
- scitex/scholar/url_finder/strategies/find_pdf_urls_by_zotero_translators.py +164 -0
- scitex/scholar/url_finder/strategies/find_supplementary_urls_by_href.py +70 -0
- scitex/scholar/url_finder/translators/__init__.py +31 -0
- scitex/scholar/url_finder/translators/core/__init__.py +6 -0
- scitex/scholar/url_finder/translators/core/base.py +50 -0
- scitex/scholar/url_finder/translators/core/patterns.py +40 -0
- scitex/scholar/url_finder/translators/core/registry.py +372 -0
- scitex/scholar/url_finder/translators/individual/BOFiP_Impots.py +200 -0
- scitex/scholar/url_finder/translators/individual/Baidu_Scholar.py +229 -0
- scitex/scholar/url_finder/translators/individual/Bangkok_Post.py +152 -0
- scitex/scholar/url_finder/translators/individual/Baruch_Foundation.py +197 -0
- scitex/scholar/url_finder/translators/individual/Beobachter.py +193 -0
- scitex/scholar/url_finder/translators/individual/Bezneng_Gajit.py +203 -0
- scitex/scholar/url_finder/translators/individual/BibLaTeX.py +329 -0
- scitex/scholar/url_finder/translators/individual/BibTeX.py +330 -0
- scitex/scholar/url_finder/translators/individual/Biblio_com.py +222 -0
- scitex/scholar/url_finder/translators/individual/Bibliontology_RDF.py +228 -0
- scitex/scholar/url_finder/translators/individual/CLASE.py +23 -0
- scitex/scholar/url_finder/translators/individual/COBISS.py +23 -0
- scitex/scholar/url_finder/translators/individual/COinS.py +23 -0
- scitex/scholar/url_finder/translators/individual/CQ_Press.py +23 -0
- scitex/scholar/url_finder/translators/individual/CROSBI.py +23 -0
- scitex/scholar/url_finder/translators/individual/CSL_JSON.py +23 -0
- scitex/scholar/url_finder/translators/individual/CSV.py +23 -0
- scitex/scholar/url_finder/translators/individual/CalMatters.py +23 -0
- scitex/scholar/url_finder/translators/individual/Calisphere.py +23 -0
- scitex/scholar/url_finder/translators/individual/Camara_Brasileira_do_Livro_ISBN.py +290 -0
- scitex/scholar/url_finder/translators/individual/CanLII.py +151 -0
- scitex/scholar/url_finder/translators/individual/Canada_com.py +127 -0
- scitex/scholar/url_finder/translators/individual/Canadian_Letters_and_Images.py +152 -0
- scitex/scholar/url_finder/translators/individual/Canadiana_ca.py +195 -0
- scitex/scholar/url_finder/translators/individual/Cascadilla_Proceedings_Project.py +202 -0
- scitex/scholar/url_finder/translators/individual/Central_and_Eastern_European_Online_Library_Journals.py +172 -0
- scitex/scholar/url_finder/translators/individual/Champlain_Society_Collection.py +172 -0
- scitex/scholar/url_finder/translators/individual/Chicago_Journal_of_Theoretical_Computer_Science.py +210 -0
- scitex/scholar/url_finder/translators/individual/Christian_Science_Monitor.py +118 -0
- scitex/scholar/url_finder/translators/individual/Columbia_University_Press.py +232 -0
- scitex/scholar/url_finder/translators/individual/Common_Place.py +245 -0
- scitex/scholar/url_finder/translators/individual/Cornell_LII.py +165 -0
- scitex/scholar/url_finder/translators/individual/Cornell_University_Press.py +156 -0
- scitex/scholar/url_finder/translators/individual/CourtListener.py +173 -0
- scitex/scholar/url_finder/translators/individual/DAI_Zenon.py +160 -0
- scitex/scholar/url_finder/translators/individual/FULL_DOWNLOAD_LOG.txt +2 -0
- scitex/scholar/url_finder/translators/individual/FULL_DOWNLOAD_LOG_colored.txt +1 -0
- scitex/scholar/url_finder/translators/individual/__init__.py +25 -0
- scitex/scholar/url_finder/translators/individual/abc_news_australia.py +23 -0
- scitex/scholar/url_finder/translators/individual/access_engineering.py +23 -0
- scitex/scholar/url_finder/translators/individual/access_medicine.py +255 -0
- scitex/scholar/url_finder/translators/individual/access_science.py +23 -0
- scitex/scholar/url_finder/translators/individual/acls_humanities_ebook.py +23 -0
- scitex/scholar/url_finder/translators/individual/aclweb.py +23 -0
- scitex/scholar/url_finder/translators/individual/acm.py +11 -0
- scitex/scholar/url_finder/translators/individual/acm_digital_library.py +355 -0
- scitex/scholar/url_finder/translators/individual/acs.py +23 -0
- scitex/scholar/url_finder/translators/individual/acs_publications.py +23 -0
- scitex/scholar/url_finder/translators/individual/adam_matthew_digital.py +23 -0
- scitex/scholar/url_finder/translators/individual/ads_bibcode.py +23 -0
- scitex/scholar/url_finder/translators/individual/aea_web.py +23 -0
- scitex/scholar/url_finder/translators/individual/agencia_del_isbn.py +23 -0
- scitex/scholar/url_finder/translators/individual/agris.py +23 -0
- scitex/scholar/url_finder/translators/individual/ahval_news.py +23 -0
- scitex/scholar/url_finder/translators/individual/aip.py +23 -0
- scitex/scholar/url_finder/translators/individual/air_university_journals.py +23 -0
- scitex/scholar/url_finder/translators/individual/airiti.py +25 -0
- scitex/scholar/url_finder/translators/individual/alexander_street_press.py +23 -0
- scitex/scholar/url_finder/translators/individual/all_africa.py +23 -0
- scitex/scholar/url_finder/translators/individual/allafrica.py +23 -0
- scitex/scholar/url_finder/translators/individual/alsharekh.py +23 -0
- scitex/scholar/url_finder/translators/individual/alternet.py +23 -0
- scitex/scholar/url_finder/translators/individual/aluka.py +23 -0
- scitex/scholar/url_finder/translators/individual/amazon.py +23 -0
- scitex/scholar/url_finder/translators/individual/american_archive_public_broadcasting.py +23 -0
- scitex/scholar/url_finder/translators/individual/american_institute_aeronautics_astronautics.py +23 -0
- scitex/scholar/url_finder/translators/individual/american_prospect.py +23 -0
- scitex/scholar/url_finder/translators/individual/ams_journals.py +23 -0
- scitex/scholar/url_finder/translators/individual/ams_mathscinet.py +23 -0
- scitex/scholar/url_finder/translators/individual/ams_mathscinet_legacy.py +23 -0
- scitex/scholar/url_finder/translators/individual/ancestry_com_us_federal_census.py +23 -0
- scitex/scholar/url_finder/translators/individual/ancestry_us_federal_census.py +23 -0
- scitex/scholar/url_finder/translators/individual/annual_reviews.py +23 -0
- scitex/scholar/url_finder/translators/individual/antikvarium_hu.py +23 -0
- scitex/scholar/url_finder/translators/individual/aosic.py +25 -0
- scitex/scholar/url_finder/translators/individual/apa_psycnet.py +23 -0
- scitex/scholar/url_finder/translators/individual/apn_ru.py +23 -0
- scitex/scholar/url_finder/translators/individual/aps.py +23 -0
- scitex/scholar/url_finder/translators/individual/aps_physics.py +115 -0
- scitex/scholar/url_finder/translators/individual/aquadocs.py +23 -0
- scitex/scholar/url_finder/translators/individual/archeion.py +23 -0
- scitex/scholar/url_finder/translators/individual/archiv_fuer_sozialgeschichte.py +23 -0
- scitex/scholar/url_finder/translators/individual/archive_ouverte_aosic.py +25 -0
- scitex/scholar/url_finder/translators/individual/archive_ouverte_en_sciences_de_l_information_et_de_la_communication___aosic_.py +27 -0
- scitex/scholar/url_finder/translators/individual/archives_canada.py +23 -0
- scitex/scholar/url_finder/translators/individual/ariana_news.py +23 -0
- scitex/scholar/url_finder/translators/individual/art_institute_of_chicago.py +23 -0
- scitex/scholar/url_finder/translators/individual/artefacts_canada.py +23 -0
- scitex/scholar/url_finder/translators/individual/artfl_encyclopedie.py +23 -0
- scitex/scholar/url_finder/translators/individual/artforum.py +152 -0
- scitex/scholar/url_finder/translators/individual/artnet.py +23 -0
- scitex/scholar/url_finder/translators/individual/artnews.py +23 -0
- scitex/scholar/url_finder/translators/individual/artstor.py +23 -0
- scitex/scholar/url_finder/translators/individual/arxiv.py +39 -0
- scitex/scholar/url_finder/translators/individual/arxiv_org.py +122 -0
- scitex/scholar/url_finder/translators/individual/asce.py +23 -0
- scitex/scholar/url_finder/translators/individual/asco_meeting_library.py +23 -0
- scitex/scholar/url_finder/translators/individual/astis.py +23 -0
- scitex/scholar/url_finder/translators/individual/atlanta_journal_constitution.py +145 -0
- scitex/scholar/url_finder/translators/individual/ats_international_journal.py +23 -0
- scitex/scholar/url_finder/translators/individual/atypon.py +23 -0
- scitex/scholar/url_finder/translators/individual/atypon_journals.py +253 -0
- scitex/scholar/url_finder/translators/individual/austlii_and_nzlii.py +251 -0
- scitex/scholar/url_finder/translators/individual/australian_dictionary_of_biography.py +124 -0
- scitex/scholar/url_finder/translators/individual/bailii.py +122 -0
- scitex/scholar/url_finder/translators/individual/bbc.py +229 -0
- scitex/scholar/url_finder/translators/individual/bbc_genome.py +180 -0
- scitex/scholar/url_finder/translators/individual/beck_online.py +23 -0
- scitex/scholar/url_finder/translators/individual/biblioteca_nacional_de_maestros.py +144 -0
- scitex/scholar/url_finder/translators/individual/bibliotheque_archives_nationale_quebec_pistard.py +134 -0
- scitex/scholar/url_finder/translators/individual/bibliotheque_archives_nationales_quebec.py +187 -0
- scitex/scholar/url_finder/translators/individual/bibliotheque_et_archives_nationale_du_quebec__pistard_.py +23 -0
- scitex/scholar/url_finder/translators/individual/bibliotheque_et_archives_nationales_du_quebec.py +23 -0
- scitex/scholar/url_finder/translators/individual/bibliotheque_nationale_de_france.py +23 -0
- scitex/scholar/url_finder/translators/individual/bibliotheque_nationale_france.py +163 -0
- scitex/scholar/url_finder/translators/individual/bibsys.py +162 -0
- scitex/scholar/url_finder/translators/individual/bioconductor.py +184 -0
- scitex/scholar/url_finder/translators/individual/biomed_central.py +151 -0
- scitex/scholar/url_finder/translators/individual/bioone.py +23 -0
- scitex/scholar/url_finder/translators/individual/biorxiv.py +146 -0
- scitex/scholar/url_finder/translators/individual/blaetter.py +23 -0
- scitex/scholar/url_finder/translators/individual/blogger.py +148 -0
- scitex/scholar/url_finder/translators/individual/bloomberg.py +168 -0
- scitex/scholar/url_finder/translators/individual/bloomsbury_food_library.py +135 -0
- scitex/scholar/url_finder/translators/individual/bluesky.py +129 -0
- scitex/scholar/url_finder/translators/individual/bnf_isbn.py +118 -0
- scitex/scholar/url_finder/translators/individual/bocc.py +247 -0
- scitex/scholar/url_finder/translators/individual/boe.py +196 -0
- scitex/scholar/url_finder/translators/individual/bookmarks.py +23 -0
- scitex/scholar/url_finder/translators/individual/bookshop_org.py +23 -0
- scitex/scholar/url_finder/translators/individual/boston_review.py +23 -0
- scitex/scholar/url_finder/translators/individual/bosworth_toller_anglo_saxon_dictionary.py +23 -0
- scitex/scholar/url_finder/translators/individual/bosworth_toller_s_anglo_saxon_dictionary_online.py +23 -0
- scitex/scholar/url_finder/translators/individual/bracero_history_archive.py +23 -0
- scitex/scholar/url_finder/translators/individual/brill.py +25 -0
- scitex/scholar/url_finder/translators/individual/brukerhandboken.py +23 -0
- scitex/scholar/url_finder/translators/individual/bryn_mawr_classical_review.py +23 -0
- scitex/scholar/url_finder/translators/individual/bundesgesetzblatt.py +23 -0
- scitex/scholar/url_finder/translators/individual/business_standard.py +178 -0
- scitex/scholar/url_finder/translators/individual/cabi___cab_abstracts.py +23 -0
- scitex/scholar/url_finder/translators/individual/cabi_cab_abstracts.py +176 -0
- scitex/scholar/url_finder/translators/individual/cairn.py +23 -0
- scitex/scholar/url_finder/translators/individual/cairn_info.py +23 -0
- scitex/scholar/url_finder/translators/individual/cambridge.py +25 -0
- scitex/scholar/url_finder/translators/individual/cambridge_core.py +178 -0
- scitex/scholar/url_finder/translators/individual/cambridge_engage_preprints.py +23 -0
- scitex/scholar/url_finder/translators/individual/caod.py +224 -0
- scitex/scholar/url_finder/translators/individual/cbc.py +281 -0
- scitex/scholar/url_finder/translators/individual/ccfr__bnf_.py +23 -0
- scitex/scholar/url_finder/translators/individual/ccfr_bnf.py +223 -0
- scitex/scholar/url_finder/translators/individual/cell_press.py +150 -0
- scitex/scholar/url_finder/translators/individual/cern_document_server.py +23 -0
- scitex/scholar/url_finder/translators/individual/ceur_workshop_proceedings.py +23 -0
- scitex/scholar/url_finder/translators/individual/cff.py +23 -0
- scitex/scholar/url_finder/translators/individual/cff_references.py +23 -0
- scitex/scholar/url_finder/translators/individual/champlain_society___collection.py +23 -0
- scitex/scholar/url_finder/translators/individual/chronicling_america.py +23 -0
- scitex/scholar/url_finder/translators/individual/cia_world_factbook.py +163 -0
- scitex/scholar/url_finder/translators/individual/cinii_research.py +23 -0
- scitex/scholar/url_finder/translators/individual/citavi_5_xml.py +23 -0
- scitex/scholar/url_finder/translators/individual/citeseer.py +23 -0
- scitex/scholar/url_finder/translators/individual/citizen_lab.py +23 -0
- scitex/scholar/url_finder/translators/individual/civilization_ca.py +23 -0
- scitex/scholar/url_finder/translators/individual/clacso.py +23 -0
- scitex/scholar/url_finder/translators/individual/climate_change_and_human_health_literature_portal.py +23 -0
- scitex/scholar/url_finder/translators/individual/climate_change_human_health.py +23 -0
- scitex/scholar/url_finder/translators/individual/clinical_key.py +23 -0
- scitex/scholar/url_finder/translators/individual/clinicaltrials_gov.py +23 -0
- scitex/scholar/url_finder/translators/individual/cnki.py +23 -0
- scitex/scholar/url_finder/translators/individual/code4lib_journal.py +23 -0
- scitex/scholar/url_finder/translators/individual/colorado_state_legislature.py +23 -0
- scitex/scholar/url_finder/translators/individual/computer_history_museum_archive.py +23 -0
- scitex/scholar/url_finder/translators/individual/copernicus.py +23 -0
- scitex/scholar/url_finder/translators/individual/crossref_rest.py +618 -0
- scitex/scholar/url_finder/translators/individual/crossref_unixref_xml.py +23 -0
- scitex/scholar/url_finder/translators/individual/csiro_publishing.py +23 -0
- scitex/scholar/url_finder/translators/individual/csiro_publishing_dup.py +23 -0
- scitex/scholar/url_finder/translators/individual/current_affairs.py +112 -0
- scitex/scholar/url_finder/translators/individual/dabi.py +205 -0
- scitex/scholar/url_finder/translators/individual/dagens_nyheter.py +25 -0
- scitex/scholar/url_finder/translators/individual/dagstuhl.py +147 -0
- scitex/scholar/url_finder/translators/individual/dagstuhl_research_online_publication_server.py +23 -0
- scitex/scholar/url_finder/translators/individual/dar_almandumah.py +160 -0
- scitex/scholar/url_finder/translators/individual/dart_europe.py +172 -0
- scitex/scholar/url_finder/translators/individual/data_gov.py +145 -0
- scitex/scholar/url_finder/translators/individual/databrary.py +144 -0
- scitex/scholar/url_finder/translators/individual/datacite_json.py +278 -0
- scitex/scholar/url_finder/translators/individual/dataverse.py +190 -0
- scitex/scholar/url_finder/translators/individual/daum_news.py +133 -0
- scitex/scholar/url_finder/translators/individual/dblp.py +151 -0
- scitex/scholar/url_finder/translators/individual/dblp_computer_science_bibliography.py +104 -0
- scitex/scholar/url_finder/translators/individual/dbpia.py +102 -0
- scitex/scholar/url_finder/translators/individual/de_gruyter_brill.py +23 -0
- scitex/scholar/url_finder/translators/individual/defense_technical_information_center.py +114 -0
- scitex/scholar/url_finder/translators/individual/dejure_org.py +23 -0
- scitex/scholar/url_finder/translators/individual/delpher.py +224 -0
- scitex/scholar/url_finder/translators/individual/demographic_research.py +160 -0
- scitex/scholar/url_finder/translators/individual/denik_cz.py +155 -0
- scitex/scholar/url_finder/translators/individual/depatisnet.py +145 -0
- scitex/scholar/url_finder/translators/individual/der_freitag.py +214 -0
- scitex/scholar/url_finder/translators/individual/der_spiegel.py +189 -0
- scitex/scholar/url_finder/translators/individual/desiring_god.py +23 -0
- scitex/scholar/url_finder/translators/individual/deutsche_fotothek.py +23 -0
- scitex/scholar/url_finder/translators/individual/deutsche_nationalbibliothek.py +23 -0
- scitex/scholar/url_finder/translators/individual/dhistory.py +23 -0
- scitex/scholar/url_finder/translators/individual/dialnet.py +23 -0
- scitex/scholar/url_finder/translators/individual/die_zeit.py +23 -0
- scitex/scholar/url_finder/translators/individual/digibib_net.py +25 -0
- scitex/scholar/url_finder/translators/individual/digital_humanities_quarterly.py +23 -0
- scitex/scholar/url_finder/translators/individual/digital_spy.py +23 -0
- scitex/scholar/url_finder/translators/individual/digizeitschriften.py +25 -0
- scitex/scholar/url_finder/translators/individual/dimensions.py +23 -0
- scitex/scholar/url_finder/translators/individual/dlibra.py +23 -0
- scitex/scholar/url_finder/translators/individual/doaj.py +23 -0
- scitex/scholar/url_finder/translators/individual/doi.py +23 -0
- scitex/scholar/url_finder/translators/individual/doi_content_negotiation.py +23 -0
- scitex/scholar/url_finder/translators/individual/douban.py +23 -0
- scitex/scholar/url_finder/translators/individual/dpla.py +200 -0
- scitex/scholar/url_finder/translators/individual/dreier_neuerscheinungsdienst.py +23 -0
- scitex/scholar/url_finder/translators/individual/drugbank_ca.py +23 -0
- scitex/scholar/url_finder/translators/individual/dryad.py +23 -0
- scitex/scholar/url_finder/translators/individual/dryad_digital_repository.py +23 -0
- scitex/scholar/url_finder/translators/individual/dspace.py +109 -0
- scitex/scholar/url_finder/translators/individual/dspace_intermediate_metadata.py +23 -0
- scitex/scholar/url_finder/translators/individual/duke_university_press_books.py +23 -0
- scitex/scholar/url_finder/translators/individual/e_periodica_switzerland.py +23 -0
- scitex/scholar/url_finder/translators/individual/eastview.py +23 -0
- scitex/scholar/url_finder/translators/individual/ebrary.py +25 -0
- scitex/scholar/url_finder/translators/individual/ebsco_discovery_layer.py +23 -0
- scitex/scholar/url_finder/translators/individual/ebscohost.py +25 -0
- scitex/scholar/url_finder/translators/individual/edinburgh_university_press_journals.py +23 -0
- scitex/scholar/url_finder/translators/individual/education_week.py +23 -0
- scitex/scholar/url_finder/translators/individual/eidr.py +23 -0
- scitex/scholar/url_finder/translators/individual/el_comercio__peru_.py +23 -0
- scitex/scholar/url_finder/translators/individual/el_pais.py +23 -0
- scitex/scholar/url_finder/translators/individual/electronic_colloquium_on_computational_complexity.py +25 -0
- scitex/scholar/url_finder/translators/individual/elibrary_ru.py +23 -0
- scitex/scholar/url_finder/translators/individual/elicit.py +23 -0
- scitex/scholar/url_finder/translators/individual/elife.py +25 -0
- scitex/scholar/url_finder/translators/individual/elsevier_health.py +23 -0
- scitex/scholar/url_finder/translators/individual/elsevier_health_journals.py +25 -0
- scitex/scholar/url_finder/translators/individual/elsevier_pure.py +23 -0
- scitex/scholar/url_finder/translators/individual/embedded_metadata.py +23 -0
- scitex/scholar/url_finder/translators/individual/emedicine.py +23 -0
- scitex/scholar/url_finder/translators/individual/emerald.py +25 -0
- scitex/scholar/url_finder/translators/individual/emerald_insight.py +25 -0
- scitex/scholar/url_finder/translators/individual/emja.py +23 -0
- scitex/scholar/url_finder/translators/individual/encyclopedia_of_chicago.py +23 -0
- scitex/scholar/url_finder/translators/individual/encyclopedia_of_korean_culture.py +23 -0
- scitex/scholar/url_finder/translators/individual/endnote_xml.py +23 -0
- scitex/scholar/url_finder/translators/individual/engineering_village.py +23 -0
- scitex/scholar/url_finder/translators/individual/envidat.py +23 -0
- scitex/scholar/url_finder/translators/individual/epa_national_library_catalog.py +23 -0
- scitex/scholar/url_finder/translators/individual/epicurious.py +25 -0
- scitex/scholar/url_finder/translators/individual/eprint_iacr.py +23 -0
- scitex/scholar/url_finder/translators/individual/eric.py +23 -0
- scitex/scholar/url_finder/translators/individual/erudit.py +23 -0
- scitex/scholar/url_finder/translators/individual/espacenet.py +23 -0
- scitex/scholar/url_finder/translators/individual/etatar_ru.py +23 -0
- scitex/scholar/url_finder/translators/individual/euclid.py +23 -0
- scitex/scholar/url_finder/translators/individual/eur_lex.py +23 -0
- scitex/scholar/url_finder/translators/individual/eurasianet.py +23 -0
- scitex/scholar/url_finder/translators/individual/eurogamerusgamer.py +25 -0
- scitex/scholar/url_finder/translators/individual/europe_pmc.py +23 -0
- scitex/scholar/url_finder/translators/individual/evernote.py +23 -0
- scitex/scholar/url_finder/translators/individual/f1000_research.py +23 -0
- scitex/scholar/url_finder/translators/individual/fachportal_padagogik.py +25 -0
- scitex/scholar/url_finder/translators/individual/factiva.py +23 -0
- scitex/scholar/url_finder/translators/individual/failed_architecture.py +23 -0
- scitex/scholar/url_finder/translators/individual/fairfax_australia.py +23 -0
- scitex/scholar/url_finder/translators/individual/fao_publications.py +23 -0
- scitex/scholar/url_finder/translators/individual/fatcat.py +23 -0
- scitex/scholar/url_finder/translators/individual/faz_net.py +23 -0
- scitex/scholar/url_finder/translators/individual/feb_web_ru.py +23 -0
- scitex/scholar/url_finder/translators/individual/figshare.py +23 -0
- scitex/scholar/url_finder/translators/individual/financial_times.py +23 -0
- scitex/scholar/url_finder/translators/individual/finna.py +23 -0
- scitex/scholar/url_finder/translators/individual/fishpond_co_nz.py +23 -0
- scitex/scholar/url_finder/translators/individual/flickr.py +23 -0
- scitex/scholar/url_finder/translators/individual/foreign_affairs.py +23 -0
- scitex/scholar/url_finder/translators/individual/foreign_policy.py +23 -0
- scitex/scholar/url_finder/translators/individual/fr_online_de.py +23 -0
- scitex/scholar/url_finder/translators/individual/freecite.py +23 -0
- scitex/scholar/url_finder/translators/individual/freepatentsonline.py +23 -0
- scitex/scholar/url_finder/translators/individual/frieze.py +23 -0
- scitex/scholar/url_finder/translators/individual/frontiers.py +84 -0
- scitex/scholar/url_finder/translators/individual/gale_databases.py +25 -0
- scitex/scholar/url_finder/translators/individual/galegdc.py +23 -0
- scitex/scholar/url_finder/translators/individual/galegroup.py +23 -0
- scitex/scholar/url_finder/translators/individual/gallica.py +23 -0
- scitex/scholar/url_finder/translators/individual/game_studies.py +23 -0
- scitex/scholar/url_finder/translators/individual/gamespot.py +23 -0
- scitex/scholar/url_finder/translators/individual/gamestar_gamepro.py +23 -0
- scitex/scholar/url_finder/translators/individual/gasyrlar_awazy.py +23 -0
- scitex/scholar/url_finder/translators/individual/gemeinsamer_bibliotheksverbund_isbn.py +23 -0
- scitex/scholar/url_finder/translators/individual/gene_ontology.py +23 -0
- scitex/scholar/url_finder/translators/individual/github.py +23 -0
- scitex/scholar/url_finder/translators/individual/globes.py +23 -0
- scitex/scholar/url_finder/translators/individual/gmail.py +23 -0
- scitex/scholar/url_finder/translators/individual/gms_german_medical_science.py +111 -0
- scitex/scholar/url_finder/translators/individual/goodreads.py +23 -0
- scitex/scholar/url_finder/translators/individual/google_books.py +23 -0
- scitex/scholar/url_finder/translators/individual/google_patents.py +23 -0
- scitex/scholar/url_finder/translators/individual/google_play.py +23 -0
- scitex/scholar/url_finder/translators/individual/google_presentation.py +23 -0
- scitex/scholar/url_finder/translators/individual/google_research.py +23 -0
- scitex/scholar/url_finder/translators/individual/google_scholar.py +23 -0
- scitex/scholar/url_finder/translators/individual/govinfo.py +23 -0
- scitex/scholar/url_finder/translators/individual/gpo_access_e_cfr.py +23 -0
- scitex/scholar/url_finder/translators/individual/gulag_many_days__many_lives.py +23 -0
- scitex/scholar/url_finder/translators/individual/haaretz.py +23 -0
- scitex/scholar/url_finder/translators/individual/habr.py +23 -0
- scitex/scholar/url_finder/translators/individual/hal.py +23 -0
- scitex/scholar/url_finder/translators/individual/hal_archives_ouvertes.py +23 -0
- scitex/scholar/url_finder/translators/individual/handelszeitung.py +23 -0
- scitex/scholar/url_finder/translators/individual/hanrei_watch.py +23 -0
- scitex/scholar/url_finder/translators/individual/harper_s_magazine.py +23 -0
- scitex/scholar/url_finder/translators/individual/harvard_business_review.py +23 -0
- scitex/scholar/url_finder/translators/individual/harvard_caselaw_access_project.py +23 -0
- scitex/scholar/url_finder/translators/individual/harvard_university_press_books.py +23 -0
- scitex/scholar/url_finder/translators/individual/hathitrust.py +23 -0
- scitex/scholar/url_finder/translators/individual/hcsp.py +23 -0
- scitex/scholar/url_finder/translators/individual/heinonline.py +23 -0
- scitex/scholar/url_finder/translators/individual/heise.py +23 -0
- scitex/scholar/url_finder/translators/individual/herder.py +23 -0
- scitex/scholar/url_finder/translators/individual/highbeam.py +23 -0
- scitex/scholar/url_finder/translators/individual/highwire.py +23 -0
- scitex/scholar/url_finder/translators/individual/highwire2.py +23 -0
- scitex/scholar/url_finder/translators/individual/highwire_2_0.py +23 -0
- scitex/scholar/url_finder/translators/individual/hindawi.py +23 -0
- scitex/scholar/url_finder/translators/individual/hindawi_publishers.py +23 -0
- scitex/scholar/url_finder/translators/individual/hispanic_american_periodical_index.py +23 -0
- scitex/scholar/url_finder/translators/individual/homeland_security_digital_library.py +23 -0
- scitex/scholar/url_finder/translators/individual/hudoc.py +23 -0
- scitex/scholar/url_finder/translators/individual/huff_post.py +23 -0
- scitex/scholar/url_finder/translators/individual/human_rights_watch.py +23 -0
- scitex/scholar/url_finder/translators/individual/ibisworld.py +23 -0
- scitex/scholar/url_finder/translators/individual/idea_alm.py +23 -0
- scitex/scholar/url_finder/translators/individual/idref.py +23 -0
- scitex/scholar/url_finder/translators/individual/ieee_computer_society.py +118 -0
- scitex/scholar/url_finder/translators/individual/ieee_xplore.py +265 -0
- scitex/scholar/url_finder/translators/individual/ietf.py +23 -0
- scitex/scholar/url_finder/translators/individual/ign.py +23 -0
- scitex/scholar/url_finder/translators/individual/imdb.py +23 -0
- scitex/scholar/url_finder/translators/individual/imf.py +23 -0
- scitex/scholar/url_finder/translators/individual/in_these_times.py +23 -0
- scitex/scholar/url_finder/translators/individual/informationssystem_medienpaedagogik.py +23 -0
- scitex/scholar/url_finder/translators/individual/informit_database.py +23 -0
- scitex/scholar/url_finder/translators/individual/infotrac.py +23 -0
- scitex/scholar/url_finder/translators/individual/ingenta_connect.py +23 -0
- scitex/scholar/url_finder/translators/individual/ingentaconnect.py +23 -0
- scitex/scholar/url_finder/translators/individual/inside_higher_ed.py +23 -0
- scitex/scholar/url_finder/translators/individual/insignia_opac.py +23 -0
- scitex/scholar/url_finder/translators/individual/inspire.py +23 -0
- scitex/scholar/url_finder/translators/individual/institute_of_contemporary_art.py +23 -0
- scitex/scholar/url_finder/translators/individual/institute_of_physics.py +23 -0
- scitex/scholar/url_finder/translators/individual/integrum.py +23 -0
- scitex/scholar/url_finder/translators/individual/intellixir.py +23 -0
- scitex/scholar/url_finder/translators/individual/inter_research_science_center.py +116 -0
- scitex/scholar/url_finder/translators/individual/international_nuclear_information_system.py +23 -0
- scitex/scholar/url_finder/translators/individual/internet_archive.py +23 -0
- scitex/scholar/url_finder/translators/individual/internet_archive_scholar.py +23 -0
- scitex/scholar/url_finder/translators/individual/internet_archive_wayback.py +23 -0
- scitex/scholar/url_finder/translators/individual/internet_archive_wayback_machine.py +23 -0
- scitex/scholar/url_finder/translators/individual/invenio_rdm.py +23 -0
- scitex/scholar/url_finder/translators/individual/inveniordm.py +23 -0
- scitex/scholar/url_finder/translators/individual/io_port.py +23 -0
- scitex/scholar/url_finder/translators/individual/iop.py +23 -0
- scitex/scholar/url_finder/translators/individual/ipcc.py +23 -0
- scitex/scholar/url_finder/translators/individual/isidore.py +23 -0
- scitex/scholar/url_finder/translators/individual/istc.py +23 -0
- scitex/scholar/url_finder/translators/individual/j_stage.py +23 -0
- scitex/scholar/url_finder/translators/individual/jahrbuch.py +23 -0
- scitex/scholar/url_finder/translators/individual/japan_times_online.py +23 -0
- scitex/scholar/url_finder/translators/individual/jets.py +23 -0
- scitex/scholar/url_finder/translators/individual/jisc_historical_texts.py +25 -0
- scitex/scholar/url_finder/translators/individual/journal_of_electronic_publishing.py +23 -0
- scitex/scholar/url_finder/translators/individual/journal_of_extension.py +23 -0
- scitex/scholar/url_finder/translators/individual/journal_of_machine_learning_research.py +23 -0
- scitex/scholar/url_finder/translators/individual/journal_of_religion_and_society.py +23 -0
- scitex/scholar/url_finder/translators/individual/jrc_publications_repository.py +23 -0
- scitex/scholar/url_finder/translators/individual/jstage.py +23 -0
- scitex/scholar/url_finder/translators/individual/jstor.py +211 -0
- scitex/scholar/url_finder/translators/individual/juricaf.py +23 -0
- scitex/scholar/url_finder/translators/individual/jurion.py +23 -0
- scitex/scholar/url_finder/translators/individual/juris.py +23 -0
- scitex/scholar/url_finder/translators/individual/jurpc.py +23 -0
- scitex/scholar/url_finder/translators/individual/k10plus_isbn.py +23 -0
- scitex/scholar/url_finder/translators/individual/kanopy.py +23 -0
- scitex/scholar/url_finder/translators/individual/karger.py +23 -0
- scitex/scholar/url_finder/translators/individual/khaama_press.py +23 -0
- scitex/scholar/url_finder/translators/individual/kitapyurdu_com.py +23 -0
- scitex/scholar/url_finder/translators/individual/kommersant.py +23 -0
- scitex/scholar/url_finder/translators/individual/korean_national_library.py +25 -0
- scitex/scholar/url_finder/translators/individual/kstudy.py +23 -0
- scitex/scholar/url_finder/translators/individual/l_annee_philologique.py +23 -0
- scitex/scholar/url_finder/translators/individual/la_croix.py +23 -0
- scitex/scholar/url_finder/translators/individual/la_nacion__argentina_.py +23 -0
- scitex/scholar/url_finder/translators/individual/la_presse.py +23 -0
- scitex/scholar/url_finder/translators/individual/la_republica__peru_.py +23 -0
- scitex/scholar/url_finder/translators/individual/la_times.py +25 -0
- scitex/scholar/url_finder/translators/individual/lagen_nu.py +23 -0
- scitex/scholar/url_finder/translators/individual/landesbibliographie_baden_wurttemberg.py +25 -0
- scitex/scholar/url_finder/translators/individual/lapham_s_quarterly.py +23 -0
- scitex/scholar/url_finder/translators/individual/le_devoir.py +23 -0
- scitex/scholar/url_finder/translators/individual/le_figaro.py +23 -0
- scitex/scholar/url_finder/translators/individual/le_maitron.py +23 -0
- scitex/scholar/url_finder/translators/individual/le_monde.py +23 -0
- scitex/scholar/url_finder/translators/individual/le_monde_diplomatique.py +23 -0
- scitex/scholar/url_finder/translators/individual/legifrance.py +23 -0
- scitex/scholar/url_finder/translators/individual/legislative_insight.py +25 -0
- scitex/scholar/url_finder/translators/individual/lexis_.py +23 -0
- scitex/scholar/url_finder/translators/individual/lexisnexis.py +23 -0
- scitex/scholar/url_finder/translators/individual/libraries_tasmania.py +25 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__aleph_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__amicus_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__aquabrowser_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__bibliocommons_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__blacklight_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__capita_prism_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__dra_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__dynix_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__encore_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__innopac_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__koha_.py +25 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__mango_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__opals_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__pica2_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__pica_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__pika_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__polaris_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__quolto_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__rero_ils_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__sirsi_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__sirsi_elibrary_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__slims_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__tind_ils_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__tinread_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__tlcyouseemore_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__visual_library_2021_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__voyager_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__voyager_7_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_catalog__vtls_.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_genesis.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_hub_discover.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_of_congress_digital_collections.py +23 -0
- scitex/scholar/url_finder/translators/individual/library_of_congress_isbn.py +23 -0
- scitex/scholar/url_finder/translators/individual/libris_isbn.py +23 -0
- scitex/scholar/url_finder/translators/individual/lingbuzz.py +180 -0
- scitex/scholar/url_finder/translators/individual/lippincott_williams_and_wilkins.py +23 -0
- scitex/scholar/url_finder/translators/individual/literary_hub.py +23 -0
- scitex/scholar/url_finder/translators/individual/litres.py +23 -0
- scitex/scholar/url_finder/translators/individual/livejournal.py +23 -0
- scitex/scholar/url_finder/translators/individual/livivo.py +23 -0
- scitex/scholar/url_finder/translators/individual/london_review_of_books.py +23 -0
- scitex/scholar/url_finder/translators/individual/lookus.py +23 -0
- scitex/scholar/url_finder/translators/individual/lulu.py +23 -0
- scitex/scholar/url_finder/translators/individual/lwn_net.py +23 -0
- scitex/scholar/url_finder/translators/individual/lww.py +23 -0
- scitex/scholar/url_finder/translators/individual/mab2.py +23 -0
- scitex/scholar/url_finder/translators/individual/magazines_russ_ru.py +23 -0
- scitex/scholar/url_finder/translators/individual/mailman.py +23 -0
- scitex/scholar/url_finder/translators/individual/mainichi_daily_news.py +23 -0
- scitex/scholar/url_finder/translators/individual/marc.py +23 -0
- scitex/scholar/url_finder/translators/individual/marcxml.py +23 -0
- scitex/scholar/url_finder/translators/individual/mastodon.py +23 -0
- scitex/scholar/url_finder/translators/individual/matbugat_ru.py +23 -0
- scitex/scholar/url_finder/translators/individual/max_planck_institute_for_the_history_of_science_virtual_laboratory_library.py +25 -0
- scitex/scholar/url_finder/translators/individual/mcv.py +23 -0
- scitex/scholar/url_finder/translators/individual/mdpi.py +76 -0
- scitex/scholar/url_finder/translators/individual/mdpi_journals.py +23 -0
- scitex/scholar/url_finder/translators/individual/medes.py +23 -0
- scitex/scholar/url_finder/translators/individual/medium.py +23 -0
- scitex/scholar/url_finder/translators/individual/medline_nbib.py +23 -0
- scitex/scholar/url_finder/translators/individual/medlinenbib.py +23 -0
- scitex/scholar/url_finder/translators/individual/medra.py +23 -0
- scitex/scholar/url_finder/translators/individual/metalib.py +23 -0
- scitex/scholar/url_finder/translators/individual/mets.py +23 -0
- scitex/scholar/url_finder/translators/individual/microbiology_society_journals.py +25 -0
- scitex/scholar/url_finder/translators/individual/microsoft_academic.py +23 -0
- scitex/scholar/url_finder/translators/individual/midas_journals.py +25 -0
- scitex/scholar/url_finder/translators/individual/mikromarc.py +23 -0
- scitex/scholar/url_finder/translators/individual/milli_kutuphane.py +23 -0
- scitex/scholar/url_finder/translators/individual/mit_press_books.py +23 -0
- scitex/scholar/url_finder/translators/individual/mods.py +23 -0
- scitex/scholar/url_finder/translators/individual/mpg_pure.py +23 -0
- scitex/scholar/url_finder/translators/individual/musee_du_louvre.py +23 -0
- scitex/scholar/url_finder/translators/individual/nagoya_university_opac.py +25 -0
- scitex/scholar/url_finder/translators/individual/nasa_ads.py +23 -0
- scitex/scholar/url_finder/translators/individual/nasa_ntrs.py +23 -0
- scitex/scholar/url_finder/translators/individual/national_academies_press.py +23 -0
- scitex/scholar/url_finder/translators/individual/national_agriculture_library.py +23 -0
- scitex/scholar/url_finder/translators/individual/national_archives_of_australia.py +23 -0
- scitex/scholar/url_finder/translators/individual/national_archives_of_south_africa.py +23 -0
- scitex/scholar/url_finder/translators/individual/national_bureau_of_economic_research.py +23 -0
- scitex/scholar/url_finder/translators/individual/national_diet_library_catalogue.py +23 -0
- scitex/scholar/url_finder/translators/individual/national_gallery_of_art___usa.py +23 -0
- scitex/scholar/url_finder/translators/individual/national_gallery_of_australia.py +23 -0
- scitex/scholar/url_finder/translators/individual/national_library_of_australia__new_catalog_.py +23 -0
- scitex/scholar/url_finder/translators/individual/national_library_of_belarus.py +23 -0
- scitex/scholar/url_finder/translators/individual/national_library_of_norway.py +23 -0
- scitex/scholar/url_finder/translators/individual/national_library_of_poland_isbn.py +23 -0
- scitex/scholar/url_finder/translators/individual/national_post.py +23 -0
- scitex/scholar/url_finder/translators/individual/national_technical_reports_library.py +23 -0
- scitex/scholar/url_finder/translators/individual/national_transportation_library_rosa_p.py +23 -0
- scitex/scholar/url_finder/translators/individual/nature.py +23 -0
- scitex/scholar/url_finder/translators/individual/nature_publishing_group.py +193 -0
- scitex/scholar/url_finder/translators/individual/nber.py +23 -0
- scitex/scholar/url_finder/translators/individual/ncbi_nucleotide.py +23 -0
- scitex/scholar/url_finder/translators/individual/neural_information_processing_systems.py +23 -0
- scitex/scholar/url_finder/translators/individual/new_left_review.py +23 -0
- scitex/scholar/url_finder/translators/individual/new_zealand_herald.py +23 -0
- scitex/scholar/url_finder/translators/individual/newlines_magazine.py +23 -0
- scitex/scholar/url_finder/translators/individual/news_corp_australia.py +23 -0
- scitex/scholar/url_finder/translators/individual/newsbank.py +23 -0
- scitex/scholar/url_finder/translators/individual/newshub_co_nz.py +23 -0
- scitex/scholar/url_finder/translators/individual/newsnettamedia.py +23 -0
- scitex/scholar/url_finder/translators/individual/newspapers_com.py +23 -0
- scitex/scholar/url_finder/translators/individual/noor_digital_library.py +23 -0
- scitex/scholar/url_finder/translators/individual/note_html.py +23 -0
- scitex/scholar/url_finder/translators/individual/note_markdown.py +23 -0
- scitex/scholar/url_finder/translators/individual/notre_dame_philosophical_reviews.py +23 -0
- scitex/scholar/url_finder/translators/individual/npr.py +23 -0
- scitex/scholar/url_finder/translators/individual/nrc_nl.py +23 -0
- scitex/scholar/url_finder/translators/individual/nrc_research_press.py +23 -0
- scitex/scholar/url_finder/translators/individual/ntsb_accident_reports.py +25 -0
- scitex/scholar/url_finder/translators/individual/nypl_menus.py +23 -0
- scitex/scholar/url_finder/translators/individual/nypl_research_catalog.py +23 -0
- scitex/scholar/url_finder/translators/individual/nytimes_com.py +23 -0
- scitex/scholar/url_finder/translators/individual/nzz_ch.py +23 -0
- scitex/scholar/url_finder/translators/individual/oapen.py +23 -0
- scitex/scholar/url_finder/translators/individual/oclc_worldcat_firstsearch.py +23 -0
- scitex/scholar/url_finder/translators/individual/oecd.py +23 -0
- scitex/scholar/url_finder/translators/individual/oecd_ilibrary.py +23 -0
- scitex/scholar/url_finder/translators/individual/ohiolink.py +23 -0
- scitex/scholar/url_finder/translators/individual/open_conf.py +23 -0
- scitex/scholar/url_finder/translators/individual/open_knowledge_repository.py +23 -0
- scitex/scholar/url_finder/translators/individual/open_worldcat.py +23 -0
- scitex/scholar/url_finder/translators/individual/openalex.py +23 -0
- scitex/scholar/url_finder/translators/individual/openalex_json.py +23 -0
- scitex/scholar/url_finder/translators/individual/openedition_books.py +23 -0
- scitex/scholar/url_finder/translators/individual/openedition_journals.py +161 -0
- scitex/scholar/url_finder/translators/individual/openjur.py +23 -0
- scitex/scholar/url_finder/translators/individual/optical_society_of_america.py +23 -0
- scitex/scholar/url_finder/translators/individual/optimization_online.py +23 -0
- scitex/scholar/url_finder/translators/individual/orcid.py +286 -0
- scitex/scholar/url_finder/translators/individual/osa.py +23 -0
- scitex/scholar/url_finder/translators/individual/osf_preprints.py +23 -0
- scitex/scholar/url_finder/translators/individual/osti_energy_citations.py +23 -0
- scitex/scholar/url_finder/translators/individual/ovid.py +23 -0
- scitex/scholar/url_finder/translators/individual/ovid_tagged.py +23 -0
- scitex/scholar/url_finder/translators/individual/oxford.py +148 -0
- scitex/scholar/url_finder/translators/individual/oxford_dictionaries_premium.py +25 -0
- scitex/scholar/url_finder/translators/individual/oxford_english_dictionary.py +23 -0
- scitex/scholar/url_finder/translators/individual/oxford_music_and_art_online.py +23 -0
- scitex/scholar/url_finder/translators/individual/oxford_reference.py +23 -0
- scitex/scholar/url_finder/translators/individual/oxford_university_press.py +23 -0
- scitex/scholar/url_finder/translators/individual/ozon_ru.py +25 -0
- scitex/scholar/url_finder/translators/individual/pajhwok_afghan_news.py +23 -0
- scitex/scholar/url_finder/translators/individual/papers_past.py +23 -0
- scitex/scholar/url_finder/translators/individual/paris_review.py +23 -0
- scitex/scholar/url_finder/translators/individual/pastebin.py +23 -0
- scitex/scholar/url_finder/translators/individual/patents___uspto.py +23 -0
- scitex/scholar/url_finder/translators/individual/pc_gamer.py +23 -0
- scitex/scholar/url_finder/translators/individual/pc_games.py +23 -0
- scitex/scholar/url_finder/translators/individual/peeters.py +23 -0
- scitex/scholar/url_finder/translators/individual/pei_archival_information_network.py +23 -0
- scitex/scholar/url_finder/translators/individual/pep_web.py +23 -0
- scitex/scholar/url_finder/translators/individual/perceiving_systems.py +23 -0
- scitex/scholar/url_finder/translators/individual/perlego.py +23 -0
- scitex/scholar/url_finder/translators/individual/philosopher_s_imprint.py +23 -0
- scitex/scholar/url_finder/translators/individual/philpapers.py +23 -0
- scitex/scholar/url_finder/translators/individual/pkp_catalog_systems.py +23 -0
- scitex/scholar/url_finder/translators/individual/pleade.py +23 -0
- scitex/scholar/url_finder/translators/individual/plos.py +111 -0
- scitex/scholar/url_finder/translators/individual/plos_journals.py +23 -0
- scitex/scholar/url_finder/translators/individual/pnas.py +23 -0
- scitex/scholar/url_finder/translators/individual/polygon.py +25 -0
- scitex/scholar/url_finder/translators/individual/potsdamer_neueste_nachrichten.py +23 -0
- scitex/scholar/url_finder/translators/individual/prc_history_review.py +23 -0
- scitex/scholar/url_finder/translators/individual/preprints_org.py +23 -0
- scitex/scholar/url_finder/translators/individual/primo.py +25 -0
- scitex/scholar/url_finder/translators/individual/primo_2018.py +23 -0
- scitex/scholar/url_finder/translators/individual/primo_normalized_xml.py +23 -0
- scitex/scholar/url_finder/translators/individual/probing_the_past.py +23 -0
- scitex/scholar/url_finder/translators/individual/project_gutenberg.py +23 -0
- scitex/scholar/url_finder/translators/individual/project_muse.py +25 -0
- scitex/scholar/url_finder/translators/individual/promed.py +23 -0
- scitex/scholar/url_finder/translators/individual/proquest.py +23 -0
- scitex/scholar/url_finder/translators/individual/proquest_ebook_central.py +23 -0
- scitex/scholar/url_finder/translators/individual/proquest_policyfile.py +23 -0
- scitex/scholar/url_finder/translators/individual/protein_data_bank.py +23 -0
- scitex/scholar/url_finder/translators/individual/pubfactory_journals.py +25 -0
- scitex/scholar/url_finder/translators/individual/public_record_office_victoria.py +23 -0
- scitex/scholar/url_finder/translators/individual/publications_du_quebec.py +23 -0
- scitex/scholar/url_finder/translators/individual/publications_office_of_the_european_union.py +23 -0
- scitex/scholar/url_finder/translators/individual/pubmed.py +330 -0
- scitex/scholar/url_finder/translators/individual/pubmed_central.py +71 -0
- scitex/scholar/url_finder/translators/individual/pubmed_xml.py +23 -0
- scitex/scholar/url_finder/translators/individual/pubpub.py +23 -0
- scitex/scholar/url_finder/translators/individual/pypi.py +23 -0
- scitex/scholar/url_finder/translators/individual/qatar_digital_library.py +23 -0
- scitex/scholar/url_finder/translators/individual/queensland_state_archives.py +23 -0
- scitex/scholar/url_finder/translators/individual/r_packages.py +23 -0
- scitex/scholar/url_finder/translators/individual/radio_free_europe__radio_liberty.py +23 -0
- scitex/scholar/url_finder/translators/individual/rand.py +23 -0
- scitex/scholar/url_finder/translators/individual/rdf.py +23 -0
- scitex/scholar/url_finder/translators/individual/rechtspraak_nl.py +25 -0
- scitex/scholar/url_finder/translators/individual/redalyc.py +23 -0
- scitex/scholar/url_finder/translators/individual/reddit.py +23 -0
- scitex/scholar/url_finder/translators/individual/referbibix.py +23 -0
- scitex/scholar/url_finder/translators/individual/refworks_tagged.py +23 -0
- scitex/scholar/url_finder/translators/individual/regeringskansliet.py +23 -0
- scitex/scholar/url_finder/translators/individual/repec___econpapers.py +23 -0
- scitex/scholar/url_finder/translators/individual/repec___ideas.py +23 -0
- scitex/scholar/url_finder/translators/individual/repec_ideas.py +23 -0
- scitex/scholar/url_finder/translators/individual/research_square.py +23 -0
- scitex/scholar/url_finder/translators/individual/researchgate.py +23 -0
- scitex/scholar/url_finder/translators/individual/retsinformation.py +23 -0
- scitex/scholar/url_finder/translators/individual/reuters.py +23 -0
- scitex/scholar/url_finder/translators/individual/ris.py +23 -0
- scitex/scholar/url_finder/translators/individual/rock__paper__shotgun.py +23 -0
- scitex/scholar/url_finder/translators/individual/roll_call.py +23 -0
- scitex/scholar/url_finder/translators/individual/rsc.py +23 -0
- scitex/scholar/url_finder/translators/individual/rsc_publishing.py +23 -0
- scitex/scholar/url_finder/translators/individual/russian_state_library.py +23 -0
- scitex/scholar/url_finder/translators/individual/sacramento_bee.py +23 -0
- scitex/scholar/url_finder/translators/individual/sae_papers.py +23 -0
- scitex/scholar/url_finder/translators/individual/safari_books_online.py +23 -0
- scitex/scholar/url_finder/translators/individual/sage.py +23 -0
- scitex/scholar/url_finder/translators/individual/sage_journals.py +183 -0
- scitex/scholar/url_finder/translators/individual/sage_knowledge.py +23 -0
- scitex/scholar/url_finder/translators/individual/saildart.py +23 -0
- scitex/scholar/url_finder/translators/individual/salt_research_archives.py +23 -0
- scitex/scholar/url_finder/translators/individual/sbn_it.py +23 -0
- scitex/scholar/url_finder/translators/individual/scholars_portal_journals.py +23 -0
- scitex/scholar/url_finder/translators/individual/scholia.py +23 -0
- scitex/scholar/url_finder/translators/individual/schweizer_radio_und_fernsehen_srf.py +23 -0
- scitex/scholar/url_finder/translators/individual/scielo.py +23 -0
- scitex/scholar/url_finder/translators/individual/sciencedirect.py +132 -0
- scitex/scholar/url_finder/translators/individual/scinapse.py +110 -0
- scitex/scholar/url_finder/translators/individual/scopus.py +23 -0
- scitex/scholar/url_finder/translators/individual/semantic_scholar.py +23 -0
- scitex/scholar/url_finder/translators/individual/semantics_visual_library.py +25 -0
- scitex/scholar/url_finder/translators/individual/sfu_ipinch.py +23 -0
- scitex/scholar/url_finder/translators/individual/silverchair.py +301 -0
- scitex/scholar/url_finder/translators/individual/sipri.py +23 -0
- scitex/scholar/url_finder/translators/individual/sirs_knowledge_source.py +23 -0
- scitex/scholar/url_finder/translators/individual/slate.py +23 -0
- scitex/scholar/url_finder/translators/individual/slideshare.py +23 -0
- scitex/scholar/url_finder/translators/individual/slub_dresden.py +23 -0
- scitex/scholar/url_finder/translators/individual/sora.py +25 -0
- scitex/scholar/url_finder/translators/individual/springer.py +121 -0
- scitex/scholar/url_finder/translators/individual/springer_link.py +23 -0
- scitex/scholar/url_finder/translators/individual/ssoar.py +23 -0
- scitex/scholar/url_finder/translators/individual/ssrn.py +63 -0
- scitex/scholar/url_finder/translators/individual/stack_exchange.py +23 -0
- scitex/scholar/url_finder/translators/individual/standard_ebooks.py +23 -0
- scitex/scholar/url_finder/translators/individual/stanford_encyclopedia_of_philosophy.py +25 -0
- scitex/scholar/url_finder/translators/individual/stanford_university_press.py +23 -0
- scitex/scholar/url_finder/translators/individual/state_records_office_of_western_australia.py +23 -0
- scitex/scholar/url_finder/translators/individual/state_records_office_wa.py +23 -0
- scitex/scholar/url_finder/translators/individual/stitcher.py +23 -0
- scitex/scholar/url_finder/translators/individual/store_norske_leksikon.py +23 -0
- scitex/scholar/url_finder/translators/individual/stuff_co_nz.py +23 -0
- scitex/scholar/url_finder/translators/individual/substack.py +23 -0
- scitex/scholar/url_finder/translators/individual/sud_ouest.py +23 -0
- scitex/scholar/url_finder/translators/individual/sueddeutsche_de.py +23 -0
- scitex/scholar/url_finder/translators/individual/summon_2.py +23 -0
- scitex/scholar/url_finder/translators/individual/superlib.py +25 -0
- scitex/scholar/url_finder/translators/individual/svenska_dagbladet.py +23 -0
- scitex/scholar/url_finder/translators/individual/sveriges_radio.py +23 -0
- scitex/scholar/url_finder/translators/individual/svt_nyheter.py +23 -0
- scitex/scholar/url_finder/translators/individual/tagesspiegel.py +23 -0
- scitex/scholar/url_finder/translators/individual/talis_aspire.py +23 -0
- scitex/scholar/url_finder/translators/individual/talisprism.py +23 -0
- scitex/scholar/url_finder/translators/individual/tatknigafund.py +23 -0
- scitex/scholar/url_finder/translators/individual/tatpressa_ru.py +23 -0
- scitex/scholar/url_finder/translators/individual/taylor___francis_ebooks.py +23 -0
- scitex/scholar/url_finder/translators/individual/taylor_and_francis_nejm.py +23 -0
- scitex/scholar/url_finder/translators/individual/taylor_francis.py +23 -0
- scitex/scholar/url_finder/translators/individual/taylor_francis_nejm.py +23 -0
- scitex/scholar/url_finder/translators/individual/taz_de.py +23 -0
- scitex/scholar/url_finder/translators/individual/tei.py +23 -0
- scitex/scholar/url_finder/translators/individual/tesis_doctorals_en_xarxa.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_art_newspaper.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_atlantic.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_boston_globe.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_chronicle_of_higher_education.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_daily_beast.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_economic_times___the_times_of_india.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_economist.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_free_dictionary.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_globe_and_mail.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_guardian.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_hamilton_spectator.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_hindu.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_independent.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_intercept.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_met.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_microfinance_gateway.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_nation.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_national_archives__uk_.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_new_republic.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_new_york_review_of_books.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_new_yorker.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_open_library.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_straits_times.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_telegraph.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_times_and_sunday_times.py +23 -0
- scitex/scholar/url_finder/translators/individual/the_times_of_israel.py +23 -0
- scitex/scholar/url_finder/translators/individual/themarker.py +23 -0
- scitex/scholar/url_finder/translators/individual/theory_of_computing.py +23 -0
- scitex/scholar/url_finder/translators/individual/thesesfr.py +23 -0
- scitex/scholar/url_finder/translators/individual/thieme.py +23 -0
- scitex/scholar/url_finder/translators/individual/time_com.py +23 -0
- scitex/scholar/url_finder/translators/individual/timesmachine.py +23 -0
- scitex/scholar/url_finder/translators/individual/tony_blair_institute.py +23 -0
- scitex/scholar/url_finder/translators/individual/tony_blair_institute_for_global_change.py +23 -0
- scitex/scholar/url_finder/translators/individual/toronto_star.py +23 -0
- scitex/scholar/url_finder/translators/individual/transportation_research_board.py +23 -0
- scitex/scholar/url_finder/translators/individual/treesearch.py +25 -0
- scitex/scholar/url_finder/translators/individual/trove.py +23 -0
- scitex/scholar/url_finder/translators/individual/tumblr.py +23 -0
- scitex/scholar/url_finder/translators/individual/tv_by_the_numbers.py +23 -0
- scitex/scholar/url_finder/translators/individual/tvnz.py +23 -0
- scitex/scholar/url_finder/translators/individual/twitter.py +23 -0
- scitex/scholar/url_finder/translators/individual/ubiquity_journals.py +23 -0
- scitex/scholar/url_finder/translators/individual/uchicago_vufind.py +23 -0
- scitex/scholar/url_finder/translators/individual/unapi.py +23 -0
- scitex/scholar/url_finder/translators/individual/unesco.py +23 -0
- scitex/scholar/url_finder/translators/individual/university_of_california_press_books.py +23 -0
- scitex/scholar/url_finder/translators/individual/university_of_chicago_press_books.py +25 -0
- scitex/scholar/url_finder/translators/individual/university_of_wisconsin_madison_libraries_catalog.py +23 -0
- scitex/scholar/url_finder/translators/individual/university_press_scholarship.py +23 -0
- scitex/scholar/url_finder/translators/individual/unqualified_dublin_core_rdf.py +23 -0
- scitex/scholar/url_finder/translators/individual/unz_print_archive.py +23 -0
- scitex/scholar/url_finder/translators/individual/upcommons.py +23 -0
- scitex/scholar/url_finder/translators/individual/uptodate_references.py +23 -0
- scitex/scholar/url_finder/translators/individual/us_national_archives_research_catalog.py +23 -0
- scitex/scholar/url_finder/translators/individual/usenix.py +23 -0
- scitex/scholar/url_finder/translators/individual/vanity_fair.py +23 -0
- scitex/scholar/url_finder/translators/individual/verniana.py +23 -0
- scitex/scholar/url_finder/translators/individual/verniana_jules_verne_studies.py +23 -0
- scitex/scholar/url_finder/translators/individual/verso_books.py +23 -0
- scitex/scholar/url_finder/translators/individual/vice.py +23 -0
- scitex/scholar/url_finder/translators/individual/victoria___albert_museum.py +23 -0
- scitex/scholar/url_finder/translators/individual/vimeo.py +23 -0
- scitex/scholar/url_finder/translators/individual/vlex.py +25 -0
- scitex/scholar/url_finder/translators/individual/voxeu.py +23 -0
- scitex/scholar/url_finder/translators/individual/wall_street_journal.py +23 -0
- scitex/scholar/url_finder/translators/individual/wanfang_data.py +23 -0
- scitex/scholar/url_finder/translators/individual/washington_monthly.py +23 -0
- scitex/scholar/url_finder/translators/individual/washington_post.py +23 -0
- scitex/scholar/url_finder/translators/individual/web_of_science.py +25 -0
- scitex/scholar/url_finder/translators/individual/web_of_science_nextgen.py +25 -0
- scitex/scholar/url_finder/translators/individual/web_of_science_tagged.py +23 -0
- scitex/scholar/url_finder/translators/individual/welt_online.py +23 -0
- scitex/scholar/url_finder/translators/individual/westlaw_uk.py +23 -0
- scitex/scholar/url_finder/translators/individual/who.py +23 -0
- scitex/scholar/url_finder/translators/individual/wikidata.py +23 -0
- scitex/scholar/url_finder/translators/individual/wikidata_quickstatements.py +23 -0
- scitex/scholar/url_finder/translators/individual/wikileaks_plusd.py +23 -0
- scitex/scholar/url_finder/translators/individual/wikimedia_commons.py +23 -0
- scitex/scholar/url_finder/translators/individual/wikipedia.py +23 -0
- scitex/scholar/url_finder/translators/individual/wikipedia_citation_templates.py +23 -0
- scitex/scholar/url_finder/translators/individual/wikisource.py +23 -0
- scitex/scholar/url_finder/translators/individual/wikiwand.py +23 -0
- scitex/scholar/url_finder/translators/individual/wiktionary.py +23 -0
- scitex/scholar/url_finder/translators/individual/wildlife_biology_in_practice.py +23 -0
- scitex/scholar/url_finder/translators/individual/wiley.py +187 -0
- scitex/scholar/url_finder/translators/individual/wiley_online_library.py +23 -0
- scitex/scholar/url_finder/translators/individual/wilson_center_digital_archive.py +25 -0
- scitex/scholar/url_finder/translators/individual/winnipeg_free_press.py +23 -0
- scitex/scholar/url_finder/translators/individual/wipo.py +23 -0
- scitex/scholar/url_finder/translators/individual/wired.py +23 -0
- scitex/scholar/url_finder/translators/individual/wiso.py +23 -0
- scitex/scholar/url_finder/translators/individual/womennews.py +23 -0
- scitex/scholar/url_finder/translators/individual/world_bank.py +23 -0
- scitex/scholar/url_finder/translators/individual/world_digital_library.py +23 -0
- scitex/scholar/url_finder/translators/individual/world_history_connected.py +23 -0
- scitex/scholar/url_finder/translators/individual/world_shakespeare_bibliography_online.py +23 -0
- scitex/scholar/url_finder/translators/individual/worldcat_discovery_service.py +23 -0
- scitex/scholar/url_finder/translators/individual/xml_contextobject.py +23 -0
- scitex/scholar/url_finder/translators/individual/yandex_books.py +23 -0
- scitex/scholar/url_finder/translators/individual/ynet.py +23 -0
- scitex/scholar/url_finder/translators/individual/youtube.py +23 -0
- scitex/scholar/url_finder/translators/individual/ypfs.py +23 -0
- scitex/scholar/url_finder/translators/individual/ypsf.py +23 -0
- scitex/scholar/url_finder/translators/individual/zbmath.py +102 -0
- scitex/scholar/url_finder/translators/individual/zenodo.py +23 -0
- scitex/scholar/url_finder/translators/individual/ziponline.py +23 -0
- scitex/scholar/url_finder/translators/individual/zobodat.py +23 -0
- scitex/scholar/url_finder/translators/individual/zotero_org.py +23 -0
- scitex/scholar/url_finder/translators/individual/zotero_rdf.py +23 -0
- scitex/scholar/url_finder/translators/individual/zoterobib.py +23 -0
- scitex/scholar/utils/__init__.py +25 -0
- scitex/scholar/utils/bibtex/__init__.py +9 -0
- scitex/scholar/utils/bibtex/_parse_bibtex.py +71 -0
- scitex/scholar/utils/cleanup/__init__.py +8 -0
- scitex/scholar/utils/cleanup/_cleanup_scholar_processes.py +92 -0
- scitex/scholar/utils/create_demo_movie.sh +49 -0
- scitex/scholar/utils/text/_TextNormalizer.py +480 -0
- scitex/scholar/utils/text/__init__.py +9 -0
- scitex/scholar/utils/validation/DOIValidator.py +292 -0
- scitex/scholar/utils/validation/README.md +169 -0
- scitex/scholar/utils/validation/__init__.py +13 -0
- scitex/scholar/utils/validation/validate_library_dois.py +225 -0
- scitex/scholar/zotero/__init__.py +38 -0
- scitex/security/README.md +255 -0
- scitex/security/__init__.py +34 -0
- scitex/security/cli.py +127 -0
- scitex/security/github.py +371 -0
- scitex/session/README.md +252 -0
- scitex/session/__init__.py +55 -0
- scitex/session/_decorator.py +631 -0
- scitex/session/_lifecycle.py +827 -0
- scitex/session/_manager.py +106 -0
- scitex/session/template.py +29 -0
- scitex/sh/README.md +58 -0
- scitex/sh/__init__.py +93 -0
- scitex/sh/_execute.py +224 -0
- scitex/sh/_security.py +67 -0
- scitex/sh/_types.py +28 -0
- scitex/sh/test_sh.py +72 -0
- scitex/sh/test_sh_simple.py +61 -0
- scitex/stats/README.md +1085 -0
- scitex/stats/__init__.py +350 -0
- scitex/stats/__main__.py +281 -0
- scitex/stats/_figrecipe_integration.py +116 -0
- scitex/stats/_mcp/__init__.py +4 -0
- scitex/stats/_mcp/handlers.py +1191 -0
- scitex/stats/_mcp/tool_schemas.py +384 -0
- scitex/stats/_schema.py +50 -0
- scitex/stats/auto/__init__.py +187 -0
- scitex/stats/auto/_context.py +331 -0
- scitex/stats/auto/_formatting.py +679 -0
- scitex/stats/auto/_rules.py +901 -0
- scitex/stats/auto/_selector.py +554 -0
- scitex/stats/auto/_styles.py +721 -0
- scitex/stats/correct/__init__.py +21 -0
- scitex/stats/correct/_correct_bonferroni.py +560 -0
- scitex/stats/correct/_correct_fdr.py +608 -0
- scitex/stats/correct/_correct_fdr_.py +627 -0
- scitex/stats/correct/_correct_holm.py +530 -0
- scitex/stats/correct/_correct_sidak.py +514 -0
- scitex/stats/descriptive/__init__.py +85 -0
- scitex/stats/descriptive/_circular.py +400 -0
- scitex/stats/descriptive/_describe.py +182 -0
- scitex/stats/descriptive/_nan.py +288 -0
- scitex/stats/descriptive/_real.py +176 -0
- scitex/stats/effect_sizes/__init__.py +40 -0
- scitex/stats/effect_sizes/_cliffs_delta.py +328 -0
- scitex/stats/effect_sizes/_cohens_d.py +341 -0
- scitex/stats/effect_sizes/_epsilon_squared.py +312 -0
- scitex/stats/effect_sizes/_eta_squared.py +298 -0
- scitex/stats/effect_sizes/_prob_superiority.py +293 -0
- scitex/stats/io/__init__.py +28 -0
- scitex/stats/io/_bundle.py +156 -0
- scitex/stats/mcp_server.py +405 -0
- scitex/stats/posthoc/__init__.py +19 -0
- scitex/stats/posthoc/_dunnett.py +483 -0
- scitex/stats/posthoc/_games_howell.py +401 -0
- scitex/stats/posthoc/_tukey_hsd.py +375 -0
- scitex/stats/power/__init__.py +19 -0
- scitex/stats/power/_power.py +433 -0
- scitex/stats/run_all.sh +46 -0
- scitex/stats/tests/__init__.py +13 -0
- scitex/stats/tests/correlation/__init__.py +13 -0
- scitex/stats/tests/correlation/_test_pearson.py +252 -0
- scitex/stats/utils/__init__.py +59 -0
- scitex/stats/utils/_effect_size.py +992 -0
- scitex/stats/utils/_formatters.py +276 -0
- scitex/stats/utils/_normalizers.py +923 -0
- scitex/stats/utils/_power.py +433 -0
- scitex/str/__init__.py +112 -0
- scitex/str/_clean_path.py +79 -0
- scitex/str/_color_text.py +52 -0
- scitex/str/_decapitalize.py +58 -0
- scitex/str/_factor_out_digits.py +309 -0
- scitex/str/_format_plot_text.py +567 -0
- scitex/str/_grep.py +48 -0
- scitex/str/_latex.py +97 -0
- scitex/str/_latex_fallback.py +583 -0
- scitex/str/_mask_api.py +39 -0
- scitex/str/_mask_api_key.py +8 -0
- scitex/str/_parse.py +163 -0
- scitex/str/_print_block.py +47 -0
- scitex/str/_print_debug.py +68 -0
- scitex/str/_printc.py +62 -0
- scitex/str/_readable_bytes.py +38 -0
- scitex/str/_remove_ansi.py +23 -0
- scitex/str/_replace.py +134 -0
- scitex/str/_search.py +127 -0
- scitex/str/_squeeze_space.py +36 -0
- scitex/template/README.md +154 -0
- scitex/template/__init__.py +115 -0
- scitex/template/_clone_project.py +167 -0
- scitex/template/_copy.py +105 -0
- scitex/template/_customize.py +116 -0
- scitex/template/_git_strategy.py +122 -0
- scitex/template/_logging_helpers.py +122 -0
- scitex/template/_mcp/__init__.py +4 -0
- scitex/template/_mcp/handlers.py +259 -0
- scitex/template/_mcp/tool_schemas.py +112 -0
- scitex/template/_rename.py +65 -0
- scitex/template/clone_pip_project.py +106 -0
- scitex/template/clone_research.py +106 -0
- scitex/template/clone_singularity.py +106 -0
- scitex/template/clone_writer_directory.py +106 -0
- scitex/template/mcp_server.py +186 -0
- scitex/tex/__init__.py +14 -0
- scitex/tex/_export.py +890 -0
- scitex/tex/_preview.py +118 -0
- scitex/tex/_to_vec.py +119 -0
- scitex/torch/__init__.py +28 -0
- scitex/torch/_apply_to.py +34 -0
- scitex/torch/_nan_funcs.py +77 -0
- scitex/types/_ArrayLike.py +66 -0
- scitex/types/_ColorLike.py +21 -0
- scitex/types/__init__.py +14 -0
- scitex/types/_is_listed_X.py +70 -0
- scitex/ui/__init__.py +173 -0
- scitex/ui/_backends/__init__.py +66 -0
- scitex/ui/_backends/_audio.py +86 -0
- scitex/ui/_backends/_config.py +250 -0
- scitex/ui/_backends/_desktop.py +184 -0
- scitex/ui/_backends/_emacs.py +210 -0
- scitex/ui/_backends/_email.py +78 -0
- scitex/ui/_backends/_matplotlib.py +118 -0
- scitex/ui/_backends/_playwright.py +109 -0
- scitex/ui/_backends/_types.py +58 -0
- scitex/ui/_backends/_webhook.py +77 -0
- scitex/ui/_backends.py +37 -0
- scitex/ui/_mcp/__init__.py +23 -0
- scitex/ui/_mcp/handlers.py +260 -0
- scitex/ui/_mcp/tool_schemas.py +107 -0
- scitex/ui/mcp_server.py +151 -0
- scitex/units.py +324 -0
- scitex/utils/__init__.py +22 -0
- scitex/utils/_compress_hdf5.py +127 -0
- scitex/utils/_email.py +221 -0
- scitex/utils/_grid.py +150 -0
- scitex/utils/_notify.py +267 -0
- scitex/utils/_search.py +123 -0
- scitex/utils/_verify_scitex_format.py +564 -0
- scitex/utils/_verify_scitex_format_v01.py +348 -0
- scitex/utils/template.py +123 -0
- scitex/web/__init__.py +49 -0
- scitex/web/_scraping.py +150 -0
- scitex/web/_search_pubmed.py +501 -0
- scitex/web/_summarize_url.py +158 -0
- scitex/web/download_images.py +316 -0
- scitex/writer/README.md +435 -0
- scitex/writer/Writer.py +487 -0
- scitex/writer/__init__.py +46 -0
- scitex/writer/_clone_writer_project.py +160 -0
- scitex/writer/_compile/__init__.py +41 -0
- scitex/writer/_compile/_compile_async.py +130 -0
- scitex/writer/_compile/_compile_unified.py +148 -0
- scitex/writer/_compile/_parser.py +63 -0
- scitex/writer/_compile/_runner.py +457 -0
- scitex/writer/_compile/_validator.py +46 -0
- scitex/writer/_compile/manuscript.py +110 -0
- scitex/writer/_compile/revision.py +82 -0
- scitex/writer/_compile/supplementary.py +100 -0
- scitex/writer/_dataclasses/__init__.py +44 -0
- scitex/writer/_dataclasses/config/_CONSTANTS.py +46 -0
- scitex/writer/_dataclasses/config/_WriterConfig.py +175 -0
- scitex/writer/_dataclasses/config/__init__.py +9 -0
- scitex/writer/_dataclasses/contents/_ManuscriptContents.py +236 -0
- scitex/writer/_dataclasses/contents/_RevisionContents.py +136 -0
- scitex/writer/_dataclasses/contents/_SupplementaryContents.py +114 -0
- scitex/writer/_dataclasses/contents/__init__.py +9 -0
- scitex/writer/_dataclasses/core/_Document.py +146 -0
- scitex/writer/_dataclasses/core/_DocumentSection.py +546 -0
- scitex/writer/_dataclasses/core/__init__.py +7 -0
- scitex/writer/_dataclasses/results/_CompilationResult.py +165 -0
- scitex/writer/_dataclasses/results/_LaTeXIssue.py +102 -0
- scitex/writer/_dataclasses/results/_SaveSectionsResponse.py +118 -0
- scitex/writer/_dataclasses/results/_SectionReadResponse.py +131 -0
- scitex/writer/_dataclasses/results/__init__.py +11 -0
- scitex/writer/_dataclasses/tree/MINIMUM_FILES.md +121 -0
- scitex/writer/_dataclasses/tree/_ConfigTree.py +86 -0
- scitex/writer/_dataclasses/tree/_ManuscriptTree.py +84 -0
- scitex/writer/_dataclasses/tree/_RevisionTree.py +97 -0
- scitex/writer/_dataclasses/tree/_ScriptsTree.py +118 -0
- scitex/writer/_dataclasses/tree/_SharedTree.py +100 -0
- scitex/writer/_dataclasses/tree/_SupplementaryTree.py +101 -0
- scitex/writer/_dataclasses/tree/__init__.py +23 -0
- scitex/writer/_mcp/__init__.py +4 -0
- scitex/writer/_mcp/handlers.py +765 -0
- scitex/writer/_mcp/tool_schemas.py +363 -0
- scitex/writer/_project/__init__.py +29 -0
- scitex/writer/_project/_create.py +89 -0
- scitex/writer/_project/_trees.py +63 -0
- scitex/writer/_project/_validate.py +61 -0
- scitex/writer/utils/.legacy_git_retry.py +164 -0
- scitex/writer/utils/__init__.py +24 -0
- scitex/writer/utils/_converters.py +635 -0
- scitex/writer/utils/_parse_latex_logs.py +138 -0
- scitex/writer/utils/_parse_script_args.py +156 -0
- scitex/writer/utils/_verify_tree_structure.py +205 -0
- scitex/writer/utils/_watch.py +96 -0
- scitex-2.14.0.dist-info/METADATA +1238 -0
- scitex-2.14.0.dist-info/RECORD +2498 -0
- scitex-2.14.0.dist-info/WHEEL +4 -0
- scitex-2.14.0.dist-info/entry_points.txt +12 -0
- scitex-2.14.0.dist-info/licenses/LICENSE +661 -0
|
@@ -0,0 +1,1496 @@
|
|
|
1
|
+
<!-- ---
|
|
2
|
+
!-- Timestamp: 2025-07-31 02:56:40
|
|
3
|
+
!-- Author: ywatanabe
|
|
4
|
+
!-- File: /home/ywatanabe/proj/scitex_repo/src/scitex/scholar/docs/medium_article_on_logined_page_for_zenrows_2.md
|
|
5
|
+
!-- --- -->
|
|
6
|
+
|
|
7
|
+
In simple terms, ZenRows cannot use your personal login session because it's a separate, remote service. It's like giving a delivery driver a photocopy of your ID—they can show_async it, but they can't pass the real identity check.
|
|
8
|
+
|
|
9
|
+
The Analogy: A Magic Wristband at a Festival 🎟️
|
|
10
|
+
Your Local Browser (with OpenURLResolver): You go to the festival gate and show_async your ticket and ID (you log in via OpenAthens). They give you a secure, non-transferable "magic wristband" (an authenticate_async session). Now you can walk around the festival, and every stall (publisher website) can see your wristband and knows you're allowed to be there.
|
|
11
|
+
|
|
12
|
+
The ZenRows API (with ZenRowsOpenURLResolver): You are outside the festival. You ask a delivery driver (the ZenRows API 🚚) to go to a specific stall for you. You can give them a picture of your wristband (your auth cookies), but when the security guard at the stall (the resolver's JavaScript 🔐) tries to scan it, their machine says it's invalid because it's not the real wristband from the main gate. The driver is stuck.
|
|
13
|
+
|
|
14
|
+
The Technical Reason
|
|
15
|
+
Your ScholarAuthManager uses a real browser (Playwright) to log you in. This creates a rich authentication context, which includes:
|
|
16
|
+
|
|
17
|
+
Cookies
|
|
18
|
+
|
|
19
|
+
Session Storage
|
|
20
|
+
|
|
21
|
+
Local Storage
|
|
22
|
+
|
|
23
|
+
A specific browser fingerprint
|
|
24
|
+
|
|
25
|
+
When your OpenURLResolver clicks a "View Full Text" link, the website's JavaScript doesn't just look for a cookie. It often checks the entire session context to verify you are who you say you are before redirecting you.
|
|
26
|
+
|
|
27
|
+
The ZenRowsOpenURLResolver sends a URL to a completely different browser running on ZenRows' servers. Even when you pass your cookies along, that remote browser is missing the rest of the essential authentication context. The resolver page's JavaScript detects this and refuses to perform the redirect, leaving ZenRows stuck on the resolver page.
|
|
28
|
+
|
|
29
|
+
The Solution
|
|
30
|
+
Your agent's conclusion is the correct approach:
|
|
31
|
+
|
|
32
|
+
For subscription content that needs your institutional login: Use the standard, browser-based OpenURLResolver. It keeps everything inside the same browser where you logged in (the one with the "magic wristband").
|
|
33
|
+
|
|
34
|
+
For public content blocked by bot detection/CAPTCHAs: Use the ZenRowsOpenURLResolver. It's perfect for scraping sites that don't require your personal login but are difficult to access.
|
|
35
|
+
|
|
36
|
+
|
|
37
|
+
|
|
38
|
+
|
|
39
|
+
|
|
40
|
+
|
|
41
|
+
in that case, i do not have any benefits by using zenrows and 2captcha.
|
|
42
|
+
|
|
43
|
+
|
|
44
|
+
|
|
45
|
+
how about this info? https://medium.com/@zenrows/web-scraping-login-python-948c2f4a4662
|
|
46
|
+
|
|
47
|
+
|
|
48
|
+
|
|
49
|
+
How to Scrape a Website that Requires a Login with Python
|
|
50
|
+
|
|
51
|
+
|
|
52
|
+
|
|
53
|
+
ZenRows
|
|
54
|
+
|
|
55
|
+
Follow
|
|
56
|
+
|
|
57
|
+
11 min read
|
|
58
|
+
|
|
59
|
+
·
|
|
60
|
+
|
|
61
|
+
Oct 12, 2023
|
|
62
|
+
|
|
63
|
+
95
|
|
64
|
+
|
|
65
|
+
|
|
66
|
+
|
|
67
|
+
1
|
|
68
|
+
|
|
69
|
+
|
|
70
|
+
|
|
71
|
+
|
|
72
|
+
|
|
73
|
+
|
|
74
|
+
|
|
75
|
+
While web scraping, you might find some data available only after you’ve signed in. In this tutorial, we’ll learn the security measures used and three effective methods to scrape_async a website that requires a login with Python.
|
|
76
|
+
|
|
77
|
+
Let’s find a solution!
|
|
78
|
+
|
|
79
|
+
|
|
80
|
+
|
|
81
|
+
Can You Scrape Websites that Require a Login?
|
|
82
|
+
|
|
83
|
+
Yes, it’s technically possible to scrape_async behind a login. But you must be mindful of the target site’s scraping rules and laws like GDPR to comply with personal data and privacy matters.
|
|
84
|
+
|
|
85
|
+
To get started, it’s essential to have some general knowledge about HTTP Request Methods. And if web scraping is new for you, read our beginner-friendly guide on web scraping with Python to master the fundamentals.
|
|
86
|
+
|
|
87
|
+
How Do You Log into a Website with Python?
|
|
88
|
+
|
|
89
|
+
The first step to scraping a login-protected website with Python is figuring out your target domain’s login type. Some old websites just require sending a username and password. However, modern ones use more advanced security measures. These include:
|
|
90
|
+
|
|
91
|
+
Client-side validations
|
|
92
|
+
|
|
93
|
+
CSRF tokens
|
|
94
|
+
|
|
95
|
+
Web Application Firewalls (WAFs)
|
|
96
|
+
|
|
97
|
+
Keep reading to learn the techniques to get around these strict security protections.
|
|
98
|
+
|
|
99
|
+
How Do You Scrape a Website behind a Login in Python?
|
|
100
|
+
|
|
101
|
+
Time to explore each step of scraping data behind site logins with Python. We’ll start with forms requiring only a username and password and then increase the difficulty progressively.
|
|
102
|
+
|
|
103
|
+
Remember that the methods show_asynccased in this tutorial are for educational purposes only.
|
|
104
|
+
|
|
105
|
+
Three, two, one… let’s code!
|
|
106
|
+
|
|
107
|
+
Sites Requiring a Simple Username and Password Login
|
|
108
|
+
|
|
109
|
+
We assume you’ve already set up Python 3 and Pip; otherwise, you should check a guide on properly installing Python.
|
|
110
|
+
|
|
111
|
+
As dependencies, we’ll use the Requests and Beautiful Soup libraries. Start by installing them:
|
|
112
|
+
|
|
113
|
+
pip install requests beautifulsoup4
|
|
114
|
+
|
|
115
|
+
Tip: If you need any help during the installation, visit this page for Requests and this one for Beautiful Soup.
|
|
116
|
+
|
|
117
|
+
Now, go to Acunetix’s User Information. This is a test page explicitly made for learning purposes and protected by a simple login, so you’ll be redirected to a login page.
|
|
118
|
+
|
|
119
|
+
Before going further, we’ll analyze what happens when attempting a login. For that, use test as a username and password, hit the login button and check the network section on your browser.
|
|
120
|
+
|
|
121
|
+
Zoom image will be displayed
|
|
122
|
+
|
|
123
|
+
|
|
124
|
+
|
|
125
|
+
|
|
126
|
+
|
|
127
|
+
Submitting the form generates a POST request to the User Information page, with the server responding with a cookie and fulfilling the requested section. The screenshot below show_asyncs the headers, payload, response, and cookies.
|
|
128
|
+
|
|
129
|
+
Zoom image will be displayed
|
|
130
|
+
|
|
131
|
+
|
|
132
|
+
|
|
133
|
+
|
|
134
|
+
|
|
135
|
+
The following scraping script will bypass the auth wall. It creates a similar payload and posts the request to the User Information page. Once the response arrives, the program uses Beautiful Soup to parse the response text and print the page name.
|
|
136
|
+
|
|
137
|
+
from bs4 import BeautifulSoup as bs
|
|
138
|
+
|
|
139
|
+
import requests
|
|
140
|
+
|
|
141
|
+
URL = "http://testphp.vulnweb.com/userinfo.php"
|
|
142
|
+
|
|
143
|
+
|
|
144
|
+
payload = {
|
|
145
|
+
|
|
146
|
+
"uname": "test",
|
|
147
|
+
|
|
148
|
+
"pass": "test"
|
|
149
|
+
|
|
150
|
+
}
|
|
151
|
+
|
|
152
|
+
s = requests.session()
|
|
153
|
+
|
|
154
|
+
response = s.post(URL, data=payload)
|
|
155
|
+
|
|
156
|
+
print(response.status_code) # If the request went Ok we usually get a 200 status.
|
|
157
|
+
|
|
158
|
+
|
|
159
|
+
from bs4 import BeautifulSoup
|
|
160
|
+
|
|
161
|
+
soup = BeautifulSoup(response.content, "html.parser")
|
|
162
|
+
|
|
163
|
+
protected_content = soup.find(attrs={"id": "pageName"}).text
|
|
164
|
+
|
|
165
|
+
print(protected_content)
|
|
166
|
+
|
|
167
|
+
This is our output:
|
|
168
|
+
|
|
169
|
+
|
|
170
|
+
|
|
171
|
+
Great! 🎉 You just learned scraping sites behind simple logins with Python. Now, let’s try with a bit more complex protections.
|
|
172
|
+
|
|
173
|
+
Scraping Websites with CSRF Token Authentication for Login
|
|
174
|
+
|
|
175
|
+
It’s not that easy to log into a website in 2023. Most have implemented additional security measures to stop hackers and malicious bots. One of these measures requires a CSRF (Cross-Site Request Forgery) token in the authentication process.
|
|
176
|
+
|
|
177
|
+
To find out if your target website requires CSRF or an authenticity_token, make the most of your browser’s Developer Tools. It doesn’t matter whether you use Safari, Chrome, Edge, Chromium, or Firefox since all provide a similar set of powerful tools for developers. To learn more, we suggest checking out the Chrome DevTools or Mozilla DevTools documentation.
|
|
178
|
+
|
|
179
|
+
Let’s dive into scraping GitHub!
|
|
180
|
+
|
|
181
|
+
Step #1: Log into a GitHub Account
|
|
182
|
+
|
|
183
|
+
GitHub is one of the websites that use CSRF token authentication for logins. We’ll scrape_async all the repositories in our test account for demonstration.
|
|
184
|
+
|
|
185
|
+
Open a web browser (we use Chrome) and navigate to GitHub’s login page. Now, press the F12 key to see the DevTools window in your browser and inspect the HTML to check if the login form element has an action attribute:
|
|
186
|
+
|
|
187
|
+
Zoom image will be displayed
|
|
188
|
+
|
|
189
|
+
|
|
190
|
+
|
|
191
|
+
|
|
192
|
+
|
|
193
|
+
Select the Network tab, click the Sign in button, then fill in and submit the form yourself. This’ll perform a few HTTP requests, visible in this tab.
|
|
194
|
+
|
|
195
|
+
Zoom image will be displayed
|
|
196
|
+
|
|
197
|
+
|
|
198
|
+
|
|
199
|
+
|
|
200
|
+
|
|
201
|
+
Let’s look at what we’ve got after clicking on the Sign in button. To do so, explore the POST request named session that has just been sent.
|
|
202
|
+
|
|
203
|
+
In the Headers section, you’ll find the full URL where the credentials are posted. We’ll use it to send a login request in our script.
|
|
204
|
+
|
|
205
|
+
Zoom image will be displayed
|
|
206
|
+
|
|
207
|
+
|
|
208
|
+
|
|
209
|
+
|
|
210
|
+
|
|
211
|
+
Step #2: Set up Payload for the CSRF-protected Login Request
|
|
212
|
+
|
|
213
|
+
Now, you might be wondering how we know there’s CSRF protection. The answer is in right front of you:
|
|
214
|
+
|
|
215
|
+
Navigate to the Payload section of the session request. Notice that, in addition to login and password, we have payload data for the authentication token and the timestamps. This auth token is the CSRF token and must be passed as a payload along the login POST request.
|
|
216
|
+
|
|
217
|
+
Zoom image will be displayed
|
|
218
|
+
|
|
219
|
+
|
|
220
|
+
|
|
221
|
+
|
|
222
|
+
|
|
223
|
+
Manually copying these fields from the Payload section for each new login request will be tedious. Instead, we’ll write code to get that programmatically.
|
|
224
|
+
|
|
225
|
+
Let’s go back to the HTML source of the login form. You’ll see all the Payload fields are present in the form.
|
|
226
|
+
|
|
227
|
+
Zoom image will be displayed
|
|
228
|
+
|
|
229
|
+
|
|
230
|
+
|
|
231
|
+
|
|
232
|
+
|
|
233
|
+
The following script gets the CSRF token, timestamp, and timestamp_secret from the login page:
|
|
234
|
+
|
|
235
|
+
import requests
|
|
236
|
+
|
|
237
|
+
from bs4 import BeautifulSoup
|
|
238
|
+
|
|
239
|
+
login_url = "https://github.com/session"
|
|
240
|
+
|
|
241
|
+
login = "Your Git username Here"
|
|
242
|
+
|
|
243
|
+
password = "Your Git Password Here"
|
|
244
|
+
|
|
245
|
+
with requests.session() as s:
|
|
246
|
+
|
|
247
|
+
req = s.get(login_url).text
|
|
248
|
+
|
|
249
|
+
html = BeautifulSoup(req,"html.parser")
|
|
250
|
+
|
|
251
|
+
token = html.find("input", {"name": "authenticity_token"}). attrs["value"]
|
|
252
|
+
|
|
253
|
+
time = html.find("input", {"name": "timestamp"}).attrs["value"]
|
|
254
|
+
|
|
255
|
+
timeSecret = html.find("input", {"name": "timestamp_secret"}). attrs["value"]
|
|
256
|
+
|
|
257
|
+
We can now populate the payload dictionary for our Python login request as:
|
|
258
|
+
|
|
259
|
+
payload = {
|
|
260
|
+
|
|
261
|
+
"authenticity_token": token,
|
|
262
|
+
|
|
263
|
+
"login": login,
|
|
264
|
+
|
|
265
|
+
"password": password,
|
|
266
|
+
|
|
267
|
+
"timestamp": time,
|
|
268
|
+
|
|
269
|
+
"timestamp_secret": timeSecret
|
|
270
|
+
|
|
271
|
+
}
|
|
272
|
+
|
|
273
|
+
Note: If you can’t find the CSRF token on the HTML, it’s probably saved in a cookie. In Chromium-based browsers, go to the Application tab in the DevTools. Then, in the left panel, search for cookies and select the domain of your target website.
|
|
274
|
+
|
|
275
|
+
Zoom image will be displayed
|
|
276
|
+
|
|
277
|
+
|
|
278
|
+
|
|
279
|
+
|
|
280
|
+
|
|
281
|
+
There you have it!
|
|
282
|
+
|
|
283
|
+
Step #3: Set Headers
|
|
284
|
+
|
|
285
|
+
It’s possible to access auth-wall websites by sending a POST request with the payload. However, using this method alone won’t be enough to scrape_async sites with advanced security measures since they’re usually smart enough to identify non-human behavior. Thus, implementing measures to make the scrape_asyncr_async appear more human-like is necessary.
|
|
286
|
+
|
|
287
|
+
Get ZenRows’s stories in your inbox
|
|
288
|
+
|
|
289
|
+
Join Medium for free to get updates from this writer.
|
|
290
|
+
|
|
291
|
+
|
|
292
|
+
|
|
293
|
+
Subscribe
|
|
294
|
+
|
|
295
|
+
|
|
296
|
+
|
|
297
|
+
The most realistic way to do this is by adding actual browser headers to our requests. Copy the ones from the Headers tab of your browser request and add them to the Python login request. Try this guide if you need to learn more about header settings for requests.
|
|
298
|
+
|
|
299
|
+
Alternatively, you can use a web scraping API like ZenRows to get around those annoying anti-bot systems for you.
|
|
300
|
+
|
|
301
|
+
Step #4: The Login in Action
|
|
302
|
+
|
|
303
|
+
This is our lucky day since adding headers for GitHub is unnecessary, so we’re ready to send our login request through Python:
|
|
304
|
+
|
|
305
|
+
|
|
306
|
+
|
|
307
|
+
res = s.post(login_url, data=payload)
|
|
308
|
+
|
|
309
|
+
print(res.url)
|
|
310
|
+
|
|
311
|
+
If the login’s successful, our output’ll be https://github.com/. Otherwise, we’ll get https://github.com/session.
|
|
312
|
+
|
|
313
|
+
👍 Amazing, we just nailed a CSRF-protected login bypass! Let’s now scrape_async the data in the protected git repositories.
|
|
314
|
+
|
|
315
|
+
Step #5: Scrape Protected GitHub Repositories
|
|
316
|
+
|
|
317
|
+
Recall that we began an earlier code with the with requests.session() as s: statement, which creates a request session. Once you log in through a request, you don’t need to re-login for the subsequent requests in the same session.
|
|
318
|
+
|
|
319
|
+
It’s time to get to the repositories. Generate a GET, then parse the response using Beautiful Soup.
|
|
320
|
+
|
|
321
|
+
repos_url = "https://github.com/" + login + "/?tab=repositories"
|
|
322
|
+
|
|
323
|
+
r = s.get(repos_url)
|
|
324
|
+
|
|
325
|
+
soup = BeautifulSoup(r.content, "html.parser")
|
|
326
|
+
|
|
327
|
+
We’ll extract the username and a list of repositories.
|
|
328
|
+
|
|
329
|
+
For the former, navigate to the repositories page in your browser, then right-click on the username and select Inspect Element. The information’s contained in a span element, with the CSS class named p-nickname vcard-username d-block inside the <h1> tag.
|
|
330
|
+
|
|
331
|
+
Zoom image will be displayed
|
|
332
|
+
|
|
333
|
+
|
|
334
|
+
|
|
335
|
+
|
|
336
|
+
|
|
337
|
+
While for the latter, you need to right-click on any repository name and select Inspect Element. The DevTools window will display the following:
|
|
338
|
+
|
|
339
|
+
Zoom image will be displayed
|
|
340
|
+
|
|
341
|
+
|
|
342
|
+
|
|
343
|
+
|
|
344
|
+
|
|
345
|
+
The repositories’ names are inside hyperlinks in the <h3> tag with the class wb-break-all. Ok, we have enough knowledge of the target elements now, so let’s extract them:
|
|
346
|
+
|
|
347
|
+
usernameDiv = soup.find("span", class_="p-nickname vcard-username d-block")
|
|
348
|
+
|
|
349
|
+
print("Username: " + usernameDiv.getText())
|
|
350
|
+
|
|
351
|
+
repos = soup.find_all("h3",class_="wb-break-all")
|
|
352
|
+
|
|
353
|
+
for r in repos:
|
|
354
|
+
|
|
355
|
+
repoName = r.find("a").getText()
|
|
356
|
+
|
|
357
|
+
print("Repository Name: " + repoName)
|
|
358
|
+
|
|
359
|
+
Since it’s possible to find multiple repositories on the target web page, the script uses the find_all() method to extract all. For that, the loop iterates through each <h3> tag and prints the text of the enclosed <a> tag.
|
|
360
|
+
|
|
361
|
+
Here’s what the complete code looks like:
|
|
362
|
+
|
|
363
|
+
import requests
|
|
364
|
+
|
|
365
|
+
from bs4 import BeautifulSoup
|
|
366
|
+
|
|
367
|
+
|
|
368
|
+
login = "Your Username Here"
|
|
369
|
+
|
|
370
|
+
password = "Your Password Here"
|
|
371
|
+
|
|
372
|
+
login_url = "https://github.com/session"
|
|
373
|
+
|
|
374
|
+
repos_url = "https://github.com/" + login + "/?tab=repositories"
|
|
375
|
+
|
|
376
|
+
|
|
377
|
+
with requests.session() as s:
|
|
378
|
+
|
|
379
|
+
req = s.get(login_url).text
|
|
380
|
+
|
|
381
|
+
html = BeautifulSoup(req,"html.parser")
|
|
382
|
+
|
|
383
|
+
token = html.find("input", {"name": "authenticity_token"}).attrs["value"]
|
|
384
|
+
|
|
385
|
+
time = html.find("input", {"name": "timestamp"}).attrs["value"]
|
|
386
|
+
|
|
387
|
+
timeSecret = html.find("input", {"name": "timestamp_secret"}).attrs["value"]
|
|
388
|
+
|
|
389
|
+
|
|
390
|
+
payload = {
|
|
391
|
+
|
|
392
|
+
"authenticity_token": token,
|
|
393
|
+
|
|
394
|
+
"login": login,
|
|
395
|
+
|
|
396
|
+
"password": password,
|
|
397
|
+
|
|
398
|
+
"timestamp": time,
|
|
399
|
+
|
|
400
|
+
"timestamp_secret": timeSecret
|
|
401
|
+
|
|
402
|
+
}
|
|
403
|
+
|
|
404
|
+
res =s.post(login_url, data=payload)
|
|
405
|
+
|
|
406
|
+
|
|
407
|
+
r = s.get(repos_url)
|
|
408
|
+
|
|
409
|
+
soup = BeautifulSoup (r.content, "html.parser")
|
|
410
|
+
|
|
411
|
+
usernameDiv = soup.find("span", class_="p-nickname vcard-username d-block")
|
|
412
|
+
|
|
413
|
+
print("Username: " + usernameDiv.getText())
|
|
414
|
+
|
|
415
|
+
|
|
416
|
+
repos = soup.find_all("h3", class_="wb-break-all")
|
|
417
|
+
|
|
418
|
+
for r in repos:
|
|
419
|
+
|
|
420
|
+
repoName = r.find("a").getText()
|
|
421
|
+
|
|
422
|
+
print("Repository Name: " + repoName)
|
|
423
|
+
|
|
424
|
+
And the output:
|
|
425
|
+
|
|
426
|
+
|
|
427
|
+
|
|
428
|
+
👏 Excellent! We just scrape_async a CSRF-authenticate_async website.
|
|
429
|
+
|
|
430
|
+
Advanced Protections Using ZenRows
|
|
431
|
+
|
|
432
|
+
Scraping content behind a login on a website with advanced protection measures requires the right tool. We’ll use ZenRows API.
|
|
433
|
+
|
|
434
|
+
Our mission consists of bypassing G2.com’s login page, the first of the two-step login, and extracting the Homepage welcome message after we’ve managed to get in.
|
|
435
|
+
|
|
436
|
+
But before getting our hands dirty with code, we must first explore our target with DevTools. The table below lists the necessary information regarding the HTML elements we’ll interact with throughout the script. Keep those in mind for the upcoming steps.
|
|
437
|
+
|
|
438
|
+
Zoom image will be displayed
|
|
439
|
+
|
|
440
|
+
|
|
441
|
+
|
|
442
|
+
|
|
443
|
+
|
|
444
|
+
As mentioned, with ZenRows, you don’t need to install any particular browser drivers, as opposed to Selenium. Moreover, you don’t need to worry about advanced Cloudflare protection, identity reveal, and other DDoS mitigation services. Additionally, this scalable API frees you from infrastructure scalability issues.
|
|
445
|
+
|
|
446
|
+
Just sign up for free to get to the Request Builder and fill in the details show_asyncn below.
|
|
447
|
+
|
|
448
|
+
Zoom image will be displayed
|
|
449
|
+
|
|
450
|
+
|
|
451
|
+
|
|
452
|
+
|
|
453
|
+
|
|
454
|
+
Let’s go through each step of the request creation:
|
|
455
|
+
|
|
456
|
+
Set the initial target (i.e., G2 login page in our case).
|
|
457
|
+
|
|
458
|
+
Choose Plain HTML. We’ll parse it further using Beautiful Soup later in the code. If you prefer, you can use the CSS Selectors to scrape_async only specific elements from the target.
|
|
459
|
+
|
|
460
|
+
Setting Premium Proxies helps you scrape_async region-specific data and mask you from identity reveal.
|
|
461
|
+
|
|
462
|
+
Setting JavaScript Rendering is mandatory for running some JavaScript instructions in step #6.
|
|
463
|
+
|
|
464
|
+
Selecting Antibot helps you bypass advanced WAF security measures.
|
|
465
|
+
|
|
466
|
+
Checking JS Instructions lets you add an encoded string of JavaScript instructions to run on the target. In turn, this allows control similar to a headless browser.
|
|
467
|
+
|
|
468
|
+
A text box appears when you complete the instructions checkbox. You can write any number of them, and we put in the following:
|
|
469
|
+
|
|
470
|
+
[
|
|
471
|
+
|
|
472
|
+
{"wait": 2000},
|
|
473
|
+
|
|
474
|
+
{"evaluate": "document.querySelector('.input-group-field').value = 'Your Business Email Here';"},
|
|
475
|
+
|
|
476
|
+
{"wait": 1000},
|
|
477
|
+
|
|
478
|
+
{"click": ".js-button-submit"},
|
|
479
|
+
|
|
480
|
+
{"wait": 2000},
|
|
481
|
+
|
|
482
|
+
{"evaluate": "document.querySelector('#password_input').value = 'Your Password Here';"},
|
|
483
|
+
|
|
484
|
+
{"wait": 1000},
|
|
485
|
+
|
|
486
|
+
{"click": "input[value='Sign In']"},
|
|
487
|
+
|
|
488
|
+
{"wait": 6000}
|
|
489
|
+
|
|
490
|
+
]
|
|
491
|
+
|
|
492
|
+
Note: Update the code above by adding your own login credentials.
|
|
493
|
+
|
|
494
|
+
Choose Python.
|
|
495
|
+
|
|
496
|
+
Select SDK and copy the whole code. Remember to install the ZenRows SDK package using pip install zenrows.
|
|
497
|
+
|
|
498
|
+
Paste this script into your Python project and execute it. We’ve copied the SDK code and modified it to make it more portable and easier to understand.
|
|
499
|
+
|
|
500
|
+
# pip install zenrows
|
|
501
|
+
|
|
502
|
+
from zenrows import ZenRowsClient
|
|
503
|
+
|
|
504
|
+
import urllib
|
|
505
|
+
|
|
506
|
+
import json
|
|
507
|
+
|
|
508
|
+
|
|
509
|
+
client = ZenRowsClient("Your Zenrows API Goes Here")
|
|
510
|
+
|
|
511
|
+
url = "https://www.g2.com/login?form=signup#state.email.show_asyncform"
|
|
512
|
+
|
|
513
|
+
|
|
514
|
+
js_instructions = [
|
|
515
|
+
|
|
516
|
+
{"wait": 2000},
|
|
517
|
+
|
|
518
|
+
{"evaluate": "document.querySelector('.input-group-field').value = 'Your G2 Login Email Here';"},
|
|
519
|
+
|
|
520
|
+
{"wait": 1000},
|
|
521
|
+
|
|
522
|
+
{"click": ".js-button-submit"},
|
|
523
|
+
|
|
524
|
+
{"wait": 2000},
|
|
525
|
+
|
|
526
|
+
{"evaluate": "document.querySelector('#password_input').value = 'Your G2 Password Here';"},
|
|
527
|
+
|
|
528
|
+
{"wait": 1000},
|
|
529
|
+
|
|
530
|
+
{"click": "input[value='Sign In']"},
|
|
531
|
+
|
|
532
|
+
{"wait": 6000}
|
|
533
|
+
|
|
534
|
+
]
|
|
535
|
+
|
|
536
|
+
|
|
537
|
+
params = {
|
|
538
|
+
|
|
539
|
+
"js_render":"true",
|
|
540
|
+
|
|
541
|
+
"antibot":"true",
|
|
542
|
+
|
|
543
|
+
"js_instructions":urllib.parse.quote(json.dumps(js_instructions)),
|
|
544
|
+
|
|
545
|
+
"premium_proxy":"true"
|
|
546
|
+
|
|
547
|
+
}
|
|
548
|
+
|
|
549
|
+
|
|
550
|
+
response = client.get(url, params=params)
|
|
551
|
+
|
|
552
|
+
|
|
553
|
+
print(response.text)
|
|
554
|
+
|
|
555
|
+
That snippet brings and prints the plain HTML from the G2 Homepage after logging in. Now, we’ll use Beautiful Soup to further parse the HTML and extract the data we want.
|
|
556
|
+
|
|
557
|
+
from bs4 import BeautifulSoup
|
|
558
|
+
|
|
559
|
+
soup = BeautifulSoup(response.text, "html.parser")
|
|
560
|
+
|
|
561
|
+
welcome = soup.find("div", attrs={"class", "l4 color-white my-1"})
|
|
562
|
+
|
|
563
|
+
print(welcome.text)
|
|
564
|
+
|
|
565
|
+
It’s a success! 🥳
|
|
566
|
+
|
|
567
|
+
Zoom image will be displayed
|
|
568
|
+
|
|
569
|
+
|
|
570
|
+
|
|
571
|
+
|
|
572
|
+
|
|
573
|
+
Here’s the complete code:
|
|
574
|
+
|
|
575
|
+
# pip install zenrows
|
|
576
|
+
|
|
577
|
+
from zenrows import ZenRowsClient
|
|
578
|
+
|
|
579
|
+
from bs4 import BeautifulSoup
|
|
580
|
+
|
|
581
|
+
import urllib
|
|
582
|
+
|
|
583
|
+
import json
|
|
584
|
+
|
|
585
|
+
|
|
586
|
+
client = ZenRowsClient("Your Zenrows API Goes Here")
|
|
587
|
+
|
|
588
|
+
url = "https://www.g2.com/login?form=signup#state.email.show_asyncform"
|
|
589
|
+
|
|
590
|
+
|
|
591
|
+
js_instructions = [
|
|
592
|
+
|
|
593
|
+
{"wait": 2000},
|
|
594
|
+
|
|
595
|
+
{"evaluate": "document.querySelector('.input-group-field').value = 'Your G2 Login Email Here';"},
|
|
596
|
+
|
|
597
|
+
{"wait": 1000},
|
|
598
|
+
|
|
599
|
+
{"click": ".js-button-submit"},
|
|
600
|
+
|
|
601
|
+
{"wait": 2000},
|
|
602
|
+
|
|
603
|
+
{"evaluate": "document.querySelector('#password_input').value = 'Your G2 Password Here';"},
|
|
604
|
+
|
|
605
|
+
{"wait": 1000},
|
|
606
|
+
|
|
607
|
+
{"click": "input[value='Sign In']"},
|
|
608
|
+
|
|
609
|
+
{"wait": 6000}
|
|
610
|
+
|
|
611
|
+
]
|
|
612
|
+
|
|
613
|
+
|
|
614
|
+
params = {
|
|
615
|
+
|
|
616
|
+
"js_render":"true",
|
|
617
|
+
|
|
618
|
+
"antibot":"true",
|
|
619
|
+
|
|
620
|
+
"js_instructions":urllib.parse.quote(json.dumps(js_instructions)),
|
|
621
|
+
|
|
622
|
+
"premium_proxy":"true"
|
|
623
|
+
|
|
624
|
+
}
|
|
625
|
+
|
|
626
|
+
|
|
627
|
+
response = client.get(url, params=params)
|
|
628
|
+
|
|
629
|
+
|
|
630
|
+
soup = BeautifulSoup(response.text, "html.parser")
|
|
631
|
+
|
|
632
|
+
welcome = soup.find("div", attrs={"class", "l4 color-white my-1"})
|
|
633
|
+
|
|
634
|
+
print(welcome.text)
|
|
635
|
+
|
|
636
|
+
Don’t miss the rest of the tutorial!
|
|
637
|
+
|
|
638
|
+
To read the rest of the article, which is about how to scrape_async behind a login on more protected sites, go to our full tutorial on how to scrape_async a website that requires a login with Python.
|
|
639
|
+
|
|
640
|
+
|
|
641
|
+
|
|
642
|
+
You've asked an excellent and insightful question. Based on our last conversation, it might seem like ZenRows has no benefits for your specific authenticate_async workflow, but that's not the whole story.
|
|
643
|
+
|
|
644
|
+
That Medium article you found perfectly illustrates the other major use case for ZenRows: automating the login process itself for sites that don't have a complex, session-based SSO like OpenAthens.
|
|
645
|
+
|
|
646
|
+
You are correct that for your primary goal—using an existing OpenAthens session—ZenRows has limitations. However, it offers significant benefits in other scenarios.
|
|
647
|
+
|
|
648
|
+
Your Current Situation vs. The Article's Approach
|
|
649
|
+
Let's clarify the two different problems being solved:
|
|
650
|
+
|
|
651
|
+
Your Current Workflow (OpenURLResolver) The Article's Workflow (e.g., GitHub Login)
|
|
652
|
+
Goal: Use an existing, live browser session (already logged in via OpenAthens) to access paywalled content. Goal: Perform a brand new login from scratch by submitting a username, password, and CSRF token.
|
|
653
|
+
Method: You perform a manual, one-time login, and the ScholarAuthManager caches the session cookies for reuse. Method: The script automates the entire login sequence by scraping tokens from the login page and submitting a POST request.
|
|
654
|
+
Why ZenRows is Limited Here: Your authenticate_async "magic wristband" session is tied to your local browser. ZenRows can't clone this complex session on its remote servers. Why ZenRows Excels Here: ZenRows can easily automate this because it's just a sequence of HTTP requests and JavaScript actions. It can visit the login page, grab the tokens, and submit the form for you.
|
|
655
|
+
|
|
656
|
+
Export to Sheets
|
|
657
|
+
So, Where Are the Benefits of ZenRows and 2Captcha for You?
|
|
658
|
+
You still gain significant benefits, just for slightly different problems:
|
|
659
|
+
|
|
660
|
+
Scraping Public-Facing but Protected Sites: Many academic sites (and publisher sites) use Cloudflare or other bot detection on their public pages. If you just want to scrape_async metadata, abstracts, or check for open-access links without logging in, ZenRows is invaluable. It handles all the browser fingerprinting and proxy rotation needed to appear human.
|
|
661
|
+
|
|
662
|
+
Automating Logins to Simpler Systems: If you ever need to access a resource that uses a standard username/password form (like the GitHub example in the article), ZenRows is the perfect tool. You can use its js_instructions to fill in forms, click buttons, and handle the entire login flow automatically. This is much simpler than writing complex Playwright scripts.
|
|
663
|
+
|
|
664
|
+
Handling CAPTCHAs on Non-Authenticated Pages: If you're scraping a site and hit a public CAPTCHA wall, the 2Captcha integration would be the key. This is less common for the final PDF download step (which is usually a direct link) but can happen on search pages or landing pages.
|
|
665
|
+
|
|
666
|
+
In short, you don't have zero benefits. You've just correctly identified that for the specific task of leveraging a live OpenAthens session, the local browser-based OpenURLResolver is the superior tool.
|
|
667
|
+
|
|
668
|
+
Conclusion: Think of your tools as a specialized toolkit:
|
|
669
|
+
|
|
670
|
+
OpenURLResolver: Your scalpel 🔪 for surgical access using your existing institutional login.
|
|
671
|
+
|
|
672
|
+
ZenRowsOpenURLResolver: Your sledgehammer 🔨 for breaking through public-facing bot detection and automating logins on sites that use simpler form-based authentication.
|
|
673
|
+
|
|
674
|
+
|
|
675
|
+
|
|
676
|
+
|
|
677
|
+
|
|
678
|
+
|
|
679
|
+
|
|
680
|
+
also, she says:
|
|
681
|
+
|
|
682
|
+
|
|
683
|
+
|
|
684
|
+
How to Scrape a Website Behind a Login With Python in 2025
|
|
685
|
+
|
|
686
|
+
|
|
687
|
+
|
|
688
|
+
Idowu Omisola
|
|
689
|
+
|
|
690
|
+
Updated: September 4, 2024 · 8 min read
|
|
691
|
+
|
|
692
|
+
After spending years building web scrape_asyncr_asyncs for all kinds of sites, I've found that login barriers are among the toughest challenges. This guide shares what actually works for getting past login walls, from basic authentication to the most advanced protections.
|
|
693
|
+
|
|
694
|
+
Let's skip the theory and get to the practical solutions.
|
|
695
|
+
|
|
696
|
+
For educational purposes only, we'll go through the following methods:
|
|
697
|
+
|
|
698
|
+
How to scrape_async sites requiring simple usernames and passwords.
|
|
699
|
+
|
|
700
|
+
Scrape websites with CSRF token authentication for login.
|
|
701
|
+
|
|
702
|
+
Scraping behind the login on WAF-protected websites.
|
|
703
|
+
|
|
704
|
+
Can You Scrape Websites That Require a Login?
|
|
705
|
+
|
|
706
|
+
Yes, it's technically possible to scrape_async behind a login. However, to comply with personal data and privacy matters, you must be mindful of the target site's scraping rules and laws, such as the General Data Protection Regulation (GDPR).
|
|
707
|
+
|
|
708
|
+
It's also essential to have some general knowledge about HTTP Request methods. If you're new to web scraping, read our beginner-friendly guide on web scraping with Python to master the fundamentals.
|
|
709
|
+
|
|
710
|
+
In the next sections, we'll explore the steps of scraping data behind site logins with Python. We'll start with forms requiring only a username and password and then consider more complex cases.
|
|
711
|
+
|
|
712
|
+
How to Scrape Sites Requiring Simple Username and Password Logins
|
|
713
|
+
|
|
714
|
+
This tutorial assumes you've set up Python3 on your machine. If you haven't, download and install the latest version from the Python download page.
|
|
715
|
+
|
|
716
|
+
We'll use Python's Requests as the HTTP client and parse HTML content with BeautifulSoup. Install both libraries using pip:
|
|
717
|
+
|
|
718
|
+
Terminal
|
|
719
|
+
|
|
720
|
+
pip3 install requests beautifulsoup4
|
|
721
|
+
|
|
722
|
+
|
|
723
|
+
|
|
724
|
+
The test website for this section is the simple Login Challenge page.
|
|
725
|
+
|
|
726
|
+
Here's what the page looks like, requiring authentication before viewing product data:
|
|
727
|
+
|
|
728
|
+
Click to open the image in full screen
|
|
729
|
+
|
|
730
|
+
Before going further, open that page with a browser such as Chrome and analyze what happens when attempting to log in.
|
|
731
|
+
|
|
732
|
+
Right-click anywhere on the page and select Inspect to open the developer console. Then, go to the Network tab.
|
|
733
|
+
|
|
734
|
+
Now, fill in the credentials and hit the login button (use the demo credentials attached to the top of the login form). In the Network tab, click All. Then, select the Login request that appears on the requests table after some moment. Go to the Payload section. You'll see the payload data you entered earlier, including the email and password.
|
|
735
|
+
|
|
736
|
+
Click to open the image in full screen
|
|
737
|
+
|
|
738
|
+
Create a similar payload in your Python script and post the request to the Login page to bypass the authentication wall. Once the response arrives, the program uses BeautifulSoup to parse the HTML of the page and extract its title. Here's the code to do that:
|
|
739
|
+
|
|
740
|
+
Example
|
|
741
|
+
|
|
742
|
+
# pip3 install requests beautifulsoup4import requestsfrom bs4 import BeautifulSoup# the URL of the login page
|
|
743
|
+
|
|
744
|
+
login_url = "https://www.scrapingcourse.com/login"# the payload with your login credentials
|
|
745
|
+
|
|
746
|
+
payload = {
|
|
747
|
+
|
|
748
|
+
"email": "admin@example.com",
|
|
749
|
+
|
|
750
|
+
"password": "password",
|
|
751
|
+
|
|
752
|
+
}# send the POST request to login
|
|
753
|
+
|
|
754
|
+
response = requests.post(login_url, data=payload)# if the request went Ok, you should get a 200 status
|
|
755
|
+
|
|
756
|
+
print(f"Status code: {response.status_code}")# parse the HTML content using BeautifulSoup
|
|
757
|
+
|
|
758
|
+
soup = BeautifulSoup(response.text, "html.parser")# find the page title
|
|
759
|
+
|
|
760
|
+
page_title = soup.title.string# print the result page title
|
|
761
|
+
|
|
762
|
+
print(f"Page title: {page_title}")
|
|
763
|
+
|
|
764
|
+
|
|
765
|
+
|
|
766
|
+
See the output below with the status code and dashboard page title, indicating that you've logged in successfully:
|
|
767
|
+
|
|
768
|
+
Output
|
|
769
|
+
|
|
770
|
+
Status code: 200
|
|
771
|
+
|
|
772
|
+
Page title: Success Page - ScrapingCourse.com
|
|
773
|
+
|
|
774
|
+
|
|
775
|
+
|
|
776
|
+
Great! You've just learned to scrape_async a website behind a simple login with Python. Now, let's try using a bit more complex protection.
|
|
777
|
+
|
|
778
|
+
Frustrated that your web scrape_asyncr_asyncs are blocked once and again?
|
|
779
|
+
|
|
780
|
+
|
|
781
|
+
|
|
782
|
+
ZenRows API handles rotating proxies and headless browsers for you.
|
|
783
|
+
|
|
784
|
+
Dashboard
|
|
785
|
+
|
|
786
|
+
Scraping Websites With CSRF Token Authentication for Login
|
|
787
|
+
|
|
788
|
+
Most websites have implemented additional security measures to stop hackers and malicious bots, making it more difficult to log in. One of these measures requires a CSRF (Cross-Site Request Forgery) token in the authentication process.
|
|
789
|
+
|
|
790
|
+
This time, we'll use the Login with CSRF challenge page as a test website to show_async you how to access CSRF-protected login.
|
|
791
|
+
|
|
792
|
+
See what the page looks like below:
|
|
793
|
+
|
|
794
|
+
Click to open the image in full screen
|
|
795
|
+
|
|
796
|
+
Try the previous scrape_asyncr_async with this page. You'll see it outputs the following error message, indicating that you can't bypass CSRF protection:
|
|
797
|
+
|
|
798
|
+
Output
|
|
799
|
+
|
|
800
|
+
419Page title: Page Expired
|
|
801
|
+
|
|
802
|
+
|
|
803
|
+
|
|
804
|
+
Step #1: Inspect the Page Network Tab
|
|
805
|
+
|
|
806
|
+
You'll use your browser's Developer Tools to determine if your target website requires CSRF or an authenticity_token.
|
|
807
|
+
|
|
808
|
+
Open that page on a browser like Chrome, right-click any part and click Inspect. Go to the Network tab. Enter the given credentials (provided at the top of the login form), hit the login button, and click csrf from the request table.
|
|
809
|
+
|
|
810
|
+
You'll see an extra _token payload now sent with the email and password, show_asyncing that the website requires a CSRF token:
|
|
811
|
+
|
|
812
|
+
Click to open the image in full screen
|
|
813
|
+
|
|
814
|
+
You could copy and paste this token into your payload. However, that's not recommended because the CSRF token is practically dynamic. A better approach is to grab the CSRF token dynamically while performing a request.
|
|
815
|
+
|
|
816
|
+
Step 2: Retrieve the CSRF Token Dynamically
|
|
817
|
+
|
|
818
|
+
You'll now retrieve the CSRF token from the HTML of the login form. Let's inspect the HTML source of the login form. Go back to the CSRF Login Challenge page and right-click the login form. Then, click Inspect. You'll see a hidden _token input field in the form:
|
|
819
|
+
|
|
820
|
+
Click to open the image in full screen
|
|
821
|
+
|
|
822
|
+
The next step is to dynamically obtain the value of the hidden _token input field and add it to the payload. Let's do that.
|
|
823
|
+
|
|
824
|
+
Step 3: Add the CSRF Token to the Payload
|
|
825
|
+
|
|
826
|
+
Here, you'll use the Requests library Session object to persist the current login session and prevent it from expiring prematurely. Once you log in through a session, you don't need to log in again for subsequent requests in the same session.
|
|
827
|
+
|
|
828
|
+
Obtain the login page using the Session object:
|
|
829
|
+
|
|
830
|
+
Example
|
|
831
|
+
|
|
832
|
+
# ...# create a session object
|
|
833
|
+
|
|
834
|
+
session = requests.Session()# retrieve the page with CSRF token
|
|
835
|
+
|
|
836
|
+
response = session.get(login_url)
|
|
837
|
+
|
|
838
|
+
|
|
839
|
+
|
|
840
|
+
Use BeautifulSoup to extract the CSRF token dynamically from the hidden input field you inspected previously. Add the extracted token to the payload and send a POST request using the current session:
|
|
841
|
+
|
|
842
|
+
Example
|
|
843
|
+
|
|
844
|
+
# ...# parse the HTML content using BeautifulSoup
|
|
845
|
+
|
|
846
|
+
soup = BeautifulSoup(response.text, "html.parser")# extract the CSRF token from the HTML dynamically
|
|
847
|
+
|
|
848
|
+
csrf_token = soup.find("input", {"name": "_token"})["value"]# the payload with your login credentials and the CSRF token
|
|
849
|
+
|
|
850
|
+
payload = {
|
|
851
|
+
|
|
852
|
+
"_token": csrf_token,
|
|
853
|
+
|
|
854
|
+
"email": "admin@example.com",
|
|
855
|
+
|
|
856
|
+
"password": "password",
|
|
857
|
+
|
|
858
|
+
}# send the POST request to login
|
|
859
|
+
|
|
860
|
+
response = session.post(login_url, data=payload)
|
|
861
|
+
|
|
862
|
+
|
|
863
|
+
|
|
864
|
+
|
|
865
|
+
|
|
866
|
+
Note
|
|
867
|
+
|
|
868
|
+
You can add request headers such as User Agent to your request to make your scrape_asyncr_async more human-like. Check out our tutorial on changing the User Agent in Python Requests to learn more.
|
|
869
|
+
|
|
870
|
+
Step 4: Extract Product Data
|
|
871
|
+
|
|
872
|
+
Let's extract specific product data from the result product page using the current session. Before you begin, inspect the page to view its product elements. Right-click the first product and select Inspect.
|
|
873
|
+
|
|
874
|
+
You'll see that each product is in a div tag with the class name product-item:
|
|
875
|
+
|
|
876
|
+
Click to open the image in full screen
|
|
877
|
+
|
|
878
|
+
Remember you parsed the login page HTML earlier. Re-parse the result page separately in another BeautifulSoup instance. Then, scrape_async the product name and price from each parent element using a for loop. Append the scrape_async data to an empty list and print it:
|
|
879
|
+
|
|
880
|
+
Example
|
|
881
|
+
|
|
882
|
+
# ...# re-parse the HTML content of the current product page using BeautifulSoup
|
|
883
|
+
|
|
884
|
+
soup = BeautifulSoup(response.text, "html.parser")# extract the parent div
|
|
885
|
+
|
|
886
|
+
parent_element = soup.find_all(class_="product-item")# specify an empty product_data list to collect extracted data
|
|
887
|
+
|
|
888
|
+
product_data = []# loop through each parent div to extract specific datafor product in parent_element:
|
|
889
|
+
|
|
890
|
+
data = {
|
|
891
|
+
|
|
892
|
+
"Name": product.find(class_="product-name").text,
|
|
893
|
+
|
|
894
|
+
"Price": product.find(class_="product-price").text,
|
|
895
|
+
|
|
896
|
+
}
|
|
897
|
+
|
|
898
|
+
|
|
899
|
+
|
|
900
|
+
# append the extracted data to the empty list
|
|
901
|
+
|
|
902
|
+
product_data.append(data)# print the product data
|
|
903
|
+
|
|
904
|
+
print(product_data)
|
|
905
|
+
|
|
906
|
+
|
|
907
|
+
|
|
908
|
+
Merge the snippets, and you'll get the following final code:
|
|
909
|
+
|
|
910
|
+
Example
|
|
911
|
+
|
|
912
|
+
import requestsfrom bs4 import BeautifulSoup# the URL of the login page
|
|
913
|
+
|
|
914
|
+
login_url = "https://www.scrapingcourse.com/login/csrf"# create a session object
|
|
915
|
+
|
|
916
|
+
session = requests.Session()# retrieve the the page with CSRF token
|
|
917
|
+
|
|
918
|
+
response = session.get(login_url)# parse the HTML content using BeautifulSoup
|
|
919
|
+
|
|
920
|
+
soup = BeautifulSoup(response.text, "html.parser")# extract the CSRF token from the HTML dynamically
|
|
921
|
+
|
|
922
|
+
csrf_token = soup.find("input", {"name": "_token"})["value"]# the payload with your login credentials and the CSRF token
|
|
923
|
+
|
|
924
|
+
payload = {
|
|
925
|
+
|
|
926
|
+
"_token": csrf_token,
|
|
927
|
+
|
|
928
|
+
"email": "admin@example.com",
|
|
929
|
+
|
|
930
|
+
"password": "password",
|
|
931
|
+
|
|
932
|
+
}# send the POST request to login
|
|
933
|
+
|
|
934
|
+
response = session.post(login_url, data=payload)# re-parse the HTML content of the current product page using BeautifulSoup
|
|
935
|
+
|
|
936
|
+
soup = BeautifulSoup(response.text, "html.parser")# extract the parent div
|
|
937
|
+
|
|
938
|
+
parent_element = soup.find_all(class_="product-item")# specify an empty product_data list to collect extracted data
|
|
939
|
+
|
|
940
|
+
product_data = []# loop through each parent div to extract specific datafor product in parent_element:
|
|
941
|
+
|
|
942
|
+
data = {
|
|
943
|
+
|
|
944
|
+
"Name": product.find(class_="product-name").text,
|
|
945
|
+
|
|
946
|
+
"Price": product.find(class_="product-price").text,
|
|
947
|
+
|
|
948
|
+
}
|
|
949
|
+
|
|
950
|
+
|
|
951
|
+
|
|
952
|
+
# append the extracted data to the empty list
|
|
953
|
+
|
|
954
|
+
product_data.append(data)# print the product data
|
|
955
|
+
|
|
956
|
+
print(product_data)
|
|
957
|
+
|
|
958
|
+
|
|
959
|
+
|
|
960
|
+
Here's the output:
|
|
961
|
+
|
|
962
|
+
Output
|
|
963
|
+
|
|
964
|
+
[
|
|
965
|
+
|
|
966
|
+
{'Name': 'Chaz Kangeroo Hoodie', 'Price': '$52'},
|
|
967
|
+
|
|
968
|
+
{'Name': 'Teton Pullover Hoodie', 'Price': '$70'},
|
|
969
|
+
|
|
970
|
+
|
|
971
|
+
|
|
972
|
+
#... other products omitted for brevity
|
|
973
|
+
|
|
974
|
+
|
|
975
|
+
{'Name': 'Grayson Crewneck Sweatshirt', 'Price': '$64'},
|
|
976
|
+
|
|
977
|
+
{'Name': 'Ajax Full-Zip Sweatshirt', 'Price': '$69'}
|
|
978
|
+
|
|
979
|
+
]
|
|
980
|
+
|
|
981
|
+
|
|
982
|
+
|
|
983
|
+
Excellent! You just scrape_async a CSRF-authenticate_async website with Python's Requests and BeautifulSoup.
|
|
984
|
+
|
|
985
|
+
Scraping Behind the Login on WAF-Protected Websites
|
|
986
|
+
|
|
987
|
+
On many websites, you'll still get an Access Denied screen or receive an HTTP error like 403 forbidden error despite sending the correct username, password, and CSRF token. Even using the proper request headers won't work. All these indicate that the site uses advanced protections, like client-side browser verification.
|
|
988
|
+
|
|
989
|
+
Client-side verification is a security measure to block bots and scrape_asyncr_asyncs from accessing websites, implemented mainly by WAFs (Web Application Firewalls) like Cloudflare, Akamai, PerimeterX, and other advanced anti-bot systems.
|
|
990
|
+
|
|
991
|
+
Let's find a solution to this problem.
|
|
992
|
+
|
|
993
|
+
Basic WAF Protections With Selenium and Undetected-ChromeDriver
|
|
994
|
+
|
|
995
|
+
The risk of being blocked is high if you use just the Requests and BeautifulSoup libraries to handle logins that require human-like interactions.
|
|
996
|
+
|
|
997
|
+
One of the alternatives that can help mitigate this issue is headless browsers. These tools can automate user interactions on the standard browsers you know, like Chrome or Firefox, but they don't have any GUI for a human user to interact with.
|
|
998
|
+
|
|
999
|
+
Although base Selenium implementation isn't enough for scraping WAF-protected sites, some extensions, such as Undetected ChromeDriver, are available to aid you.
|
|
1000
|
+
|
|
1001
|
+
Undetected ChromeDriver is a stealth ChromeDriver automation library that uses several evasion techniques to avoid detection. Pairing Selenium and Undetected ChromeDriver is a decent solution to bypass basic WAF protection on login pages.
|
|
1002
|
+
|
|
1003
|
+
Let's see how it works using a simple Cloudflare-protected DataCamp login page as a demo website. We assume you already have a DataCamp account. Otherwise, create one to get your credentials.
|
|
1004
|
+
|
|
1005
|
+
Now, install Selenium and Undetected ChromeDriver using pip:
|
|
1006
|
+
|
|
1007
|
+
Terminal
|
|
1008
|
+
|
|
1009
|
+
pip3 install selenium undetected-chromedriver
|
|
1010
|
+
|
|
1011
|
+
|
|
1012
|
+
|
|
1013
|
+
Import the required libraries:
|
|
1014
|
+
|
|
1015
|
+
Example
|
|
1016
|
+
|
|
1017
|
+
# pip3 install selenium undetected-chromedriverimport undetected_chromedriver as uc
|
|
1018
|
+
|
|
1019
|
+
|
|
1020
|
+
|
|
1021
|
+
Create an undetectable browser instance in non-headless mode and move to the login page.
|
|
1022
|
+
|
|
1023
|
+
Example
|
|
1024
|
+
|
|
1025
|
+
if __name__ == "__main__":
|
|
1026
|
+
|
|
1027
|
+
|
|
1028
|
+
|
|
1029
|
+
# instantiate Chrome options
|
|
1030
|
+
|
|
1031
|
+
options = uc.ChromeOptions()
|
|
1032
|
+
|
|
1033
|
+
# add headless mode
|
|
1034
|
+
|
|
1035
|
+
options.headless = False
|
|
1036
|
+
|
|
1037
|
+
|
|
1038
|
+
|
|
1039
|
+
# instantiate a Chrome browser and add the options
|
|
1040
|
+
|
|
1041
|
+
driver = uc.Chrome(
|
|
1042
|
+
|
|
1043
|
+
use_subprocess=False,
|
|
1044
|
+
|
|
1045
|
+
options=options,
|
|
1046
|
+
|
|
1047
|
+
)
|
|
1048
|
+
|
|
1049
|
+
|
|
1050
|
+
|
|
1051
|
+
# visit the target URL
|
|
1052
|
+
|
|
1053
|
+
driver.get("https://www.datacamp.com/users/sign_in")
|
|
1054
|
+
|
|
1055
|
+
|
|
1056
|
+
|
|
1057
|
+
To programmatically enter the email and password fields, you need to get the input field element selectors from the login form (class name or ID). To do so, open the login page in your browser, right-click the email field, and click Inspect to open the element in the developer console.
|
|
1058
|
+
|
|
1059
|
+
The E-mail address field has an ID of user_email and a next button with the unique class name js-account-check-email:
|
|
1060
|
+
|
|
1061
|
+
Click to open the image in full screen
|
|
1062
|
+
|
|
1063
|
+
Keep in mind that the form is in two phases. Enter your email address into the form opened in your browser and press Next to reveal the password field. Similarly, right-click the password field and select Inspect to view its elements.
|
|
1064
|
+
|
|
1065
|
+
So, the Password field has an ID of user_password:
|
|
1066
|
+
|
|
1067
|
+
Click to open the image in full screen
|
|
1068
|
+
|
|
1069
|
+
Finally, inspect the Sign In button. Here's its element with the attributes:
|
|
1070
|
+
|
|
1071
|
+
Click to open the image in full screen
|
|
1072
|
+
|
|
1073
|
+
Now, let's fill out the forms with Selenium. Add the By and the time modules to your imports and automate the form-filling and login process. Include a sleep timer between each operation to allow the DOM to load at each level:
|
|
1074
|
+
|
|
1075
|
+
Example
|
|
1076
|
+
|
|
1077
|
+
# ...from selenium.webdriver.common.by import Byimport timeif __name__ == "__main__":
|
|
1078
|
+
|
|
1079
|
+
|
|
1080
|
+
|
|
1081
|
+
# ...
|
|
1082
|
+
|
|
1083
|
+
|
|
1084
|
+
|
|
1085
|
+
time.sleep(5)
|
|
1086
|
+
|
|
1087
|
+
# fill in the username field
|
|
1088
|
+
|
|
1089
|
+
username = driver.find_element(By.ID, "user_email")
|
|
1090
|
+
|
|
1091
|
+
username.send_keys("<YOUR_EMAIL_ADDRESS>")
|
|
1092
|
+
|
|
1093
|
+
|
|
1094
|
+
|
|
1095
|
+
# click next
|
|
1096
|
+
|
|
1097
|
+
driver.find_element(By.CSS_SELECTOR, ".js-account-check-email").click()
|
|
1098
|
+
|
|
1099
|
+
time.sleep(5)
|
|
1100
|
+
|
|
1101
|
+
|
|
1102
|
+
|
|
1103
|
+
# fill in the password field
|
|
1104
|
+
|
|
1105
|
+
password = driver.find_element(By.ID, "user_password")
|
|
1106
|
+
|
|
1107
|
+
password.send_keys("<YOUR_PASSWORD>")
|
|
1108
|
+
|
|
1109
|
+
|
|
1110
|
+
|
|
1111
|
+
# submit the login form
|
|
1112
|
+
|
|
1113
|
+
driver.find_element(By.NAME, "commit").click()
|
|
1114
|
+
|
|
1115
|
+
time.sleep(5)
|
|
1116
|
+
|
|
1117
|
+
|
|
1118
|
+
|
|
1119
|
+
The program logs in after clicking the submit button. Once inside the dashboard, let's extract the profile name and the registered course. Then, close the browser instance.
|
|
1120
|
+
|
|
1121
|
+
Example
|
|
1122
|
+
|
|
1123
|
+
if __name__ == "__main__":
|
|
1124
|
+
|
|
1125
|
+
|
|
1126
|
+
|
|
1127
|
+
# ...
|
|
1128
|
+
|
|
1129
|
+
|
|
1130
|
+
|
|
1131
|
+
# retrieve and log profile credentials
|
|
1132
|
+
|
|
1133
|
+
my_name = driver.find_element(By.TAG_NAME, "h1")
|
|
1134
|
+
|
|
1135
|
+
my_course = driver.find_element(By.CLASS_NAME, "mfe-parcel-home-hub-learn-1h09ymt")
|
|
1136
|
+
|
|
1137
|
+
|
|
1138
|
+
|
|
1139
|
+
print("Profile Name: " + my_name.text)
|
|
1140
|
+
|
|
1141
|
+
print("Course Enrolled: " + my_course.text)
|
|
1142
|
+
|
|
1143
|
+
|
|
1144
|
+
|
|
1145
|
+
# close the browser
|
|
1146
|
+
|
|
1147
|
+
driver.quit()
|
|
1148
|
+
|
|
1149
|
+
|
|
1150
|
+
|
|
1151
|
+
Let's combine all previous code snippets to see what our complete scraping script looks like:
|
|
1152
|
+
|
|
1153
|
+
Output
|
|
1154
|
+
|
|
1155
|
+
# pip3 install selenium undetected-chromedriverimport undetected_chromedriver as ucfrom selenium.webdriver.common.by import Byimport timeif __name__ == "__main__":
|
|
1156
|
+
|
|
1157
|
+
|
|
1158
|
+
|
|
1159
|
+
# instantiate Chrome options
|
|
1160
|
+
|
|
1161
|
+
options = uc.ChromeOptions()
|
|
1162
|
+
|
|
1163
|
+
# add headless mode
|
|
1164
|
+
|
|
1165
|
+
options.headless = False
|
|
1166
|
+
|
|
1167
|
+
|
|
1168
|
+
|
|
1169
|
+
# instantiate a Chrome browser and add the options
|
|
1170
|
+
|
|
1171
|
+
driver = uc.Chrome(
|
|
1172
|
+
|
|
1173
|
+
use_subprocess=False,
|
|
1174
|
+
|
|
1175
|
+
options=options,
|
|
1176
|
+
|
|
1177
|
+
)
|
|
1178
|
+
|
|
1179
|
+
|
|
1180
|
+
|
|
1181
|
+
# visit the target URL
|
|
1182
|
+
|
|
1183
|
+
driver.get("https://www.datacamp.com/users/sign_in")
|
|
1184
|
+
|
|
1185
|
+
|
|
1186
|
+
|
|
1187
|
+
# ...
|
|
1188
|
+
|
|
1189
|
+
|
|
1190
|
+
|
|
1191
|
+
time.sleep(5)
|
|
1192
|
+
|
|
1193
|
+
# fill in the username field
|
|
1194
|
+
|
|
1195
|
+
username = driver.find_element(By.ID, "user_email")
|
|
1196
|
+
|
|
1197
|
+
username.send_keys("<YOUR_EMAIL_ADDRESS>")
|
|
1198
|
+
|
|
1199
|
+
|
|
1200
|
+
|
|
1201
|
+
# click next
|
|
1202
|
+
|
|
1203
|
+
driver.find_element(By.CSS_SELECTOR, ".js-account-check-email").click()
|
|
1204
|
+
|
|
1205
|
+
time.sleep(5)
|
|
1206
|
+
|
|
1207
|
+
|
|
1208
|
+
|
|
1209
|
+
# fill in the password field
|
|
1210
|
+
|
|
1211
|
+
password = driver.find_element(By.ID, "user_password")
|
|
1212
|
+
|
|
1213
|
+
password.send_keys("<YOUR_PASSWORD>")
|
|
1214
|
+
|
|
1215
|
+
|
|
1216
|
+
|
|
1217
|
+
# submit the login form
|
|
1218
|
+
|
|
1219
|
+
driver.find_element(By.NAME, "commit").click()
|
|
1220
|
+
|
|
1221
|
+
time.sleep(5)
|
|
1222
|
+
|
|
1223
|
+
|
|
1224
|
+
|
|
1225
|
+
# retrieve and log profile credentials
|
|
1226
|
+
|
|
1227
|
+
my_name = driver.find_element(By.TAG_NAME, "h1")
|
|
1228
|
+
|
|
1229
|
+
my_course = driver.find_element(By.CLASS_NAME, "mfe-parcel-home-hub-learn-1h09ymt")
|
|
1230
|
+
|
|
1231
|
+
|
|
1232
|
+
|
|
1233
|
+
print("Profile Name: " + my_name.text)
|
|
1234
|
+
|
|
1235
|
+
print("Course Enrolled: " + my_course.text)
|
|
1236
|
+
|
|
1237
|
+
|
|
1238
|
+
|
|
1239
|
+
# close the browser
|
|
1240
|
+
|
|
1241
|
+
driver.quit()
|
|
1242
|
+
|
|
1243
|
+
|
|
1244
|
+
|
|
1245
|
+
Depending on your profile name and registered courses, the output should look like this:
|
|
1246
|
+
|
|
1247
|
+
Output
|
|
1248
|
+
|
|
1249
|
+
Profile Name: Hey, <PROFILE_NAME>!
|
|
1250
|
+
|
|
1251
|
+
Course Enrolled: Introduction to Python2%4 hours to go
|
|
1252
|
+
|
|
1253
|
+
Keep Making Progress
|
|
1254
|
+
|
|
1255
|
+
|
|
1256
|
+
|
|
1257
|
+
Great! You've successfully scrape_async content behind a basic login page protected by basic WAF protection. But will this work for every website? Unfortunately, the answer is no.
|
|
1258
|
+
|
|
1259
|
+
The Undetected Chromedriver package still leaks some bot-like attributes like the automated WebDriver and won't work against advanced protection measures. Moreover, WAF-protected sites can easily detect its headless mode.
|
|
1260
|
+
|
|
1261
|
+
You may rely on Undetected Chromedriver only if the protections are basic. However, assume your target uses advanced Cloudflare protection (e.g., this heavily protected Login Challenge page) or other DDoS mitigation services. In that case, this solution won't work.
|
|
1262
|
+
|
|
1263
|
+
Here's where ZenRows comes to the rescue. ZenRows is a web scraping API that can easily handle all sorts of anti-bot bypasses for you, including complex ones. Moreover, it works with any programming language and doesn't require any browser installation. You'll see how it works in the next section.
|
|
1264
|
+
|
|
1265
|
+
Bypassing Advanced Protections Using ZenRows
|
|
1266
|
+
|
|
1267
|
+
Scraping content behind a login on a website with advanced protection measures requires the right tool. We'll use the ZenRows API.
|
|
1268
|
+
|
|
1269
|
+
The goal is to bypass the Cloudflare Login Challenge page and extract specific product data from the result page.
|
|
1270
|
+
|
|
1271
|
+
First, we must explore our target website with DevTools. Right-click each field (Email Address, Password, and Submit button) and select Inspect in each case to expose their elements.
|
|
1272
|
+
|
|
1273
|
+
The Email Address, Password, and Submit fields have an ID of email, password, and submit-button, respectively:
|
|
1274
|
+
|
|
1275
|
+
Click to open the image in full screen
|
|
1276
|
+
|
|
1277
|
+
To use ZenRows, sign up to load the Request Builder. Paste the target URL in the link box, and activate Premium Proxies and JS Rendering. Toggle on JS Instructions and input the login credentials using the form field selectors and relevant JavaScript actions.
|
|
1278
|
+
|
|
1279
|
+
Select Python as your programming language and choose the API connection mode. Copy and paste the generated code into your scrape_asyncr_async file.
|
|
1280
|
+
|
|
1281
|
+
The generated code should look like this with the JavaScript instructions:
|
|
1282
|
+
|
|
1283
|
+
Example
|
|
1284
|
+
|
|
1285
|
+
# pip3 install requestsimport requests
|
|
1286
|
+
|
|
1287
|
+
|
|
1288
|
+
|
|
1289
|
+
url = "https://www.scrapingcourse.com/login/cf-antibot"
|
|
1290
|
+
|
|
1291
|
+
apikey = "<YOUR_ZENROWS_API_KEY>"
|
|
1292
|
+
|
|
1293
|
+
params = {
|
|
1294
|
+
|
|
1295
|
+
"url": url,
|
|
1296
|
+
|
|
1297
|
+
"apikey": apikey,
|
|
1298
|
+
|
|
1299
|
+
"js_render": "true",
|
|
1300
|
+
|
|
1301
|
+
"js_instructions": """
|
|
1302
|
+
|
|
1303
|
+
[{"fill":["#email","admin@example.com"]},
|
|
1304
|
+
|
|
1305
|
+
{"fill":["#password","password"]},
|
|
1306
|
+
|
|
1307
|
+
{"click":"#submit-button"},
|
|
1308
|
+
|
|
1309
|
+
{"wait":500}]
|
|
1310
|
+
|
|
1311
|
+
""",
|
|
1312
|
+
|
|
1313
|
+
"premium_proxy": "true",
|
|
1314
|
+
|
|
1315
|
+
}
|
|
1316
|
+
|
|
1317
|
+
response = requests.get("https://api.zenrows.com/v1/", params=params)
|
|
1318
|
+
|
|
1319
|
+
|
|
1320
|
+
|
|
1321
|
+
Parse the response HTML with BeautifulSoup and extract target product data. Add BeautifulSoup to your imports, get the parent product elements, and extract product data from each iteratively:
|
|
1322
|
+
|
|
1323
|
+
Example
|
|
1324
|
+
|
|
1325
|
+
# pip3 install requests beautifulsoup4# ...from bs4 import BeautifulSoup# ...# parse the HTML content using BeautifulSoup
|
|
1326
|
+
|
|
1327
|
+
soup = BeautifulSoup(response.text, "html.parser")# extract the parent div
|
|
1328
|
+
|
|
1329
|
+
parent_element = soup.find_all(class_="product-item")# specify an empty product_data list to collect extracted data
|
|
1330
|
+
|
|
1331
|
+
product_data = []# loop through each parent div to extract specific datafor product in parent_element:
|
|
1332
|
+
|
|
1333
|
+
data = {
|
|
1334
|
+
|
|
1335
|
+
"Name": product.find(class_="product-name").text,
|
|
1336
|
+
|
|
1337
|
+
"Price": product.find(class_="product-price").text,
|
|
1338
|
+
|
|
1339
|
+
}
|
|
1340
|
+
|
|
1341
|
+
|
|
1342
|
+
|
|
1343
|
+
# append the extracted data to the empty list
|
|
1344
|
+
|
|
1345
|
+
product_data.append(data)# print the product data
|
|
1346
|
+
|
|
1347
|
+
print(product_data)
|
|
1348
|
+
|
|
1349
|
+
|
|
1350
|
+
|
|
1351
|
+
Here's the final code after combining both snippets:
|
|
1352
|
+
|
|
1353
|
+
Example
|
|
1354
|
+
|
|
1355
|
+
# pip3 install requests beautifulsoup4import requestsfrom bs4 import BeautifulSoup
|
|
1356
|
+
|
|
1357
|
+
|
|
1358
|
+
|
|
1359
|
+
url = "https://www.scrapingcourse.com/login/cf-antibot"
|
|
1360
|
+
|
|
1361
|
+
apikey = "<YOUR_ZENROWS_API_KEY>"
|
|
1362
|
+
|
|
1363
|
+
params = {
|
|
1364
|
+
|
|
1365
|
+
"url": url,
|
|
1366
|
+
|
|
1367
|
+
"apikey": apikey,
|
|
1368
|
+
|
|
1369
|
+
"js_render": "true",
|
|
1370
|
+
|
|
1371
|
+
"js_instructions": """
|
|
1372
|
+
|
|
1373
|
+
[{"fill":["#email","admin@example.com"]},
|
|
1374
|
+
|
|
1375
|
+
{"fill":["#password","password"]},
|
|
1376
|
+
|
|
1377
|
+
{"click":"#submit-button"},
|
|
1378
|
+
|
|
1379
|
+
{"wait":500}]
|
|
1380
|
+
|
|
1381
|
+
""",
|
|
1382
|
+
|
|
1383
|
+
"premium_proxy": "true",
|
|
1384
|
+
|
|
1385
|
+
}
|
|
1386
|
+
|
|
1387
|
+
response = requests.get("https://api.zenrows.com/v1/", params=params)# parse the HTML content using BeautifulSoup
|
|
1388
|
+
|
|
1389
|
+
soup = BeautifulSoup(response.text, "html.parser")# extract the parent div
|
|
1390
|
+
|
|
1391
|
+
parent_element = soup.find_all(class_="product-item")# specify an empty product_data list to collect extracted data
|
|
1392
|
+
|
|
1393
|
+
product_data = []# loop through each parent div to extract specific datafor product in parent_element:
|
|
1394
|
+
|
|
1395
|
+
data = {
|
|
1396
|
+
|
|
1397
|
+
"Name": product.find(class_="product-name").text,
|
|
1398
|
+
|
|
1399
|
+
"Price": product.find(class_="product-price").text,
|
|
1400
|
+
|
|
1401
|
+
}
|
|
1402
|
+
|
|
1403
|
+
|
|
1404
|
+
|
|
1405
|
+
# append the extracted data to the empty list
|
|
1406
|
+
|
|
1407
|
+
product_data.append(data)# print the product data
|
|
1408
|
+
|
|
1409
|
+
print(product_data)
|
|
1410
|
+
|
|
1411
|
+
|
|
1412
|
+
|
|
1413
|
+
The above code bypasses Cloudflare protection, logs into the product website, and scrape_asyncs its product. See the output below:
|
|
1414
|
+
|
|
1415
|
+
Output
|
|
1416
|
+
|
|
1417
|
+
[
|
|
1418
|
+
|
|
1419
|
+
{'Name': 'Chaz Kangeroo Hoodie', 'Price': '$52'},
|
|
1420
|
+
|
|
1421
|
+
{'Name': 'Teton Pullover Hoodie', 'Price': '$70'},
|
|
1422
|
+
|
|
1423
|
+
|
|
1424
|
+
|
|
1425
|
+
#... other products omitted for brevity
|
|
1426
|
+
|
|
1427
|
+
|
|
1428
|
+
{'Name': 'Grayson Crewneck Sweatshirt', 'Price': '$64'},
|
|
1429
|
+
|
|
1430
|
+
{'Name': 'Ajax Full-Zip Sweatshirt', 'Price': '$69'}
|
|
1431
|
+
|
|
1432
|
+
]
|
|
1433
|
+
|
|
1434
|
+
|
|
1435
|
+
|
|
1436
|
+
Congratulations! You just bypassed the highest level of Cloudflare protection, performed a CSRF authentication, and extracted product data using ZenRows.
|
|
1437
|
+
|
|
1438
|
+
Conclusion
|
|
1439
|
+
|
|
1440
|
+
You've seen how to perform scraping authentication and extract data behind a website's login using Python's Requests. Here's a recap of what you've learned:
|
|
1441
|
+
|
|
1442
|
+
Scrape data behind a simple login requiring only a username and a password.
|
|
1443
|
+
|
|
1444
|
+
Extract data behind a login page protected by a CSRF token.
|
|
1445
|
+
|
|
1446
|
+
Retrieve data behind a basic WAF-protected login page.
|
|
1447
|
+
|
|
1448
|
+
Bypass advanced WAF-protected login page and scrape_async its product data.
|
|
1449
|
+
|
|
1450
|
+
However, accessing and scraping data behind an anti-bot-protected login page is difficult. For an easy and scalable solution to bypass any anti-bot protection with Python, ZenRows offers all the toolkits you need, including CAPTCHA and anti-bot auto-bypass, premium proxy rotation, headless browsing, and more.
|
|
1451
|
+
|
|
1452
|
+
Don't stop learning! Here are a few tips to avoid getting blocked. Also, check out our guides on web scraping with Selenium in Python and bypassing Cloudflare with Selenium to add valuable skills to your tool belt. Remember to respect ethical and legal considerations while scraping.
|
|
1453
|
+
|
|
1454
|
+
|
|
1455
|
+
|
|
1456
|
+
You're asking the right questions. It's smart to challenge the initial conclusion and dig deeper. That article provides the crucial missing piece of the puzzle.
|
|
1457
|
+
|
|
1458
|
+
You are correct—that article show_asyncs a powerful way to use ZenRows that does apply to your project. It confirms that you have two distinct, powerful methods for handling authenticate_async scraping, not just one.
|
|
1459
|
+
|
|
1460
|
+
Here’s the refined understanding based on that new information:
|
|
1461
|
+
|
|
1462
|
+
The Two Authentication Scenarios
|
|
1463
|
+
Your project now has a solution for both major types of authentication:
|
|
1464
|
+
|
|
1465
|
+
1. Session-Based Authentication (Your Current OpenURLResolver)
|
|
1466
|
+
What it is: You log in once through a complex, multi-step process (like OpenAthens + Okta). Your browser then holds a "live" session. All subsequent actions happen within this trusted environment.
|
|
1467
|
+
|
|
1468
|
+
Your Solution: The standard OpenURLResolver using Playwright is perfect for this. It keeps the automation inside the already-authenticate_async browser.
|
|
1469
|
+
|
|
1470
|
+
Why ZenRows Struggles: ZenRows can't easily replicate this live, complex session on its remote servers.
|
|
1471
|
+
|
|
1472
|
+
2. Credential-Based Authentication (The Article's Approach)
|
|
1473
|
+
What it is: The login is a direct, programmable sequence: visit a URL, submit a username/password (and maybe a token), and you're in.
|
|
1474
|
+
|
|
1475
|
+
The Article's Solution: Use ZenRows with js_instructions. You tell ZenRows exactly how to perform the login steps: fill field A, click button B, wait, fill field C, etc.
|
|
1476
|
+
|
|
1477
|
+
Why ZenRows Excels Here: ZenRows handles all the underlying complexity (proxies, browser fingerprints, CAPTCHAs) while you just provide the simple login recipe. This is often easier and more robust than writing a complex local Playwright script.
|
|
1478
|
+
|
|
1479
|
+
You Don't Lose Benefits, You Gain Options 🛠️
|
|
1480
|
+
So, you don't have "no benefits." You have a more complete toolkit.
|
|
1481
|
+
|
|
1482
|
+
Here’s how you can now think about your strategies:
|
|
1483
|
+
|
|
1484
|
+
OpenURLResolver (Local Browser): This is your primary strategy for institutional access. It's the most reliable way to use your OpenAthens session.
|
|
1485
|
+
|
|
1486
|
+
ZenRowsOpenURLResolver: This becomes your powerful secondary strategy for specific situations:
|
|
1487
|
+
|
|
1488
|
+
Direct Publisher Logins: If you have direct username/password credentials for a publisher's site (not via OpenAthens), you can use the article's method to have ZenRows log in for you.
|
|
1489
|
+
|
|
1490
|
+
Bypassing Blocks Before Login: Use it to overcome CAPTCHAs or Cloudflare on the way to a login page.
|
|
1491
|
+
|
|
1492
|
+
Scraping Public Data: Use it to scrape_async any public-facing site that is heavily protected against bots.
|
|
1493
|
+
|
|
1494
|
+
You were right to question the initial conclusion. The article correctly show_asyncs that ZenRows is extremely capable of handling logins, but it's a different type of login than the complex, session-based one you primarily use for institutional access.
|
|
1495
|
+
|
|
1496
|
+
<!-- EOF -->
|