@wentorai/research-plugins 1.0.0 → 1.2.0
This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
- package/README.md +22 -22
- package/curated/analysis/README.md +82 -56
- package/curated/domains/README.md +225 -69
- package/curated/literature/README.md +115 -46
- package/curated/research/README.md +106 -58
- package/curated/tools/README.md +107 -87
- package/curated/writing/README.md +92 -45
- package/mcp-configs/academic-db/alphafold-mcp.json +20 -0
- package/mcp-configs/academic-db/brightspace-mcp.json +21 -0
- package/mcp-configs/academic-db/climatiq-mcp.json +20 -0
- package/mcp-configs/academic-db/gibs-mcp.json +20 -0
- package/mcp-configs/academic-db/gis-mcp-server.json +22 -0
- package/mcp-configs/academic-db/google-earth-engine-mcp.json +21 -0
- package/mcp-configs/academic-db/m4-clinical-mcp.json +21 -0
- package/mcp-configs/academic-db/medical-mcp.json +21 -0
- package/mcp-configs/academic-db/nexonco-mcp.json +20 -0
- package/mcp-configs/academic-db/omop-mcp.json +20 -0
- package/mcp-configs/academic-db/onekgpd-mcp.json +20 -0
- package/mcp-configs/academic-db/openedu-mcp.json +20 -0
- package/mcp-configs/academic-db/opengenes-mcp.json +20 -0
- package/mcp-configs/academic-db/openstax-mcp.json +21 -0
- package/mcp-configs/academic-db/openstreetmap-mcp.json +21 -0
- package/mcp-configs/academic-db/opentargets-mcp.json +21 -0
- package/mcp-configs/academic-db/pdb-mcp.json +21 -0
- package/mcp-configs/academic-db/smithsonian-mcp.json +20 -0
- package/mcp-configs/ai-platform/magi-researchers.json +21 -0
- package/mcp-configs/ai-platform/mcp-academic-researcher.json +22 -0
- package/mcp-configs/ai-platform/open-paper-machine.json +21 -0
- package/mcp-configs/ai-platform/paper-intelligence.json +21 -0
- package/mcp-configs/ai-platform/paper-reader.json +21 -0
- package/mcp-configs/ai-platform/paperdebugger.json +21 -0
- package/mcp-configs/browser/exa-mcp.json +20 -0
- package/mcp-configs/browser/mcp-searxng.json +21 -0
- package/mcp-configs/browser/mcp-webresearch.json +20 -0
- package/mcp-configs/cloud-docs/confluence-mcp.json +37 -0
- package/mcp-configs/cloud-docs/google-drive-mcp.json +35 -0
- package/mcp-configs/cloud-docs/notion-mcp.json +29 -0
- package/mcp-configs/communication/discord-mcp.json +29 -0
- package/mcp-configs/communication/discourse-mcp.json +21 -0
- package/mcp-configs/communication/slack-mcp.json +29 -0
- package/mcp-configs/communication/telegram-mcp.json +28 -0
- package/mcp-configs/data-platform/automl-stat-mcp.json +21 -0
- package/mcp-configs/data-platform/jefferson-stats-mcp.json +22 -0
- package/mcp-configs/data-platform/mcp-excel-server.json +21 -0
- package/mcp-configs/data-platform/mcp-stata.json +21 -0
- package/mcp-configs/data-platform/mcpstack-jupyter.json +21 -0
- package/mcp-configs/data-platform/ml-mcp.json +21 -0
- package/mcp-configs/data-platform/nasdaq-data-link-mcp.json +20 -0
- package/mcp-configs/data-platform/numpy-mcp.json +21 -0
- package/mcp-configs/database/neo4j-mcp.json +37 -0
- package/mcp-configs/database/postgres-mcp.json +28 -0
- package/mcp-configs/database/sqlite-mcp.json +29 -0
- package/mcp-configs/dev-platform/geogebra-mcp.json +21 -0
- package/mcp-configs/dev-platform/github-mcp.json +31 -0
- package/mcp-configs/dev-platform/gitlab-mcp.json +34 -0
- package/mcp-configs/dev-platform/latex-mcp-server.json +21 -0
- package/mcp-configs/dev-platform/manim-mcp.json +20 -0
- package/mcp-configs/dev-platform/mcp-echarts.json +20 -0
- package/mcp-configs/dev-platform/panel-viz-mcp.json +20 -0
- package/mcp-configs/dev-platform/paperbanana.json +20 -0
- package/mcp-configs/dev-platform/texflow-mcp.json +20 -0
- package/mcp-configs/dev-platform/texmcp.json +20 -0
- package/mcp-configs/dev-platform/typst-mcp.json +21 -0
- package/mcp-configs/dev-platform/vizro-mcp.json +20 -0
- package/mcp-configs/email/email-mcp.json +40 -0
- package/mcp-configs/email/gmail-mcp.json +37 -0
- package/mcp-configs/note-knowledge/local-faiss-mcp.json +21 -0
- package/mcp-configs/note-knowledge/mcp-memory-service.json +21 -0
- package/mcp-configs/note-knowledge/mcp-obsidian.json +23 -0
- package/mcp-configs/note-knowledge/mcp-ragdocs.json +20 -0
- package/mcp-configs/note-knowledge/mcp-summarizer.json +21 -0
- package/mcp-configs/note-knowledge/mediawiki-mcp.json +21 -0
- package/mcp-configs/note-knowledge/openzim-mcp.json +20 -0
- package/mcp-configs/note-knowledge/zettelkasten-mcp.json +21 -0
- package/mcp-configs/reference-mgr/academic-paper-mcp-http.json +20 -0
- package/mcp-configs/reference-mgr/academix.json +20 -0
- package/mcp-configs/reference-mgr/arxiv-research-mcp.json +21 -0
- package/mcp-configs/reference-mgr/google-scholar-abstract-mcp.json +19 -0
- package/mcp-configs/reference-mgr/google-scholar-mcp.json +20 -0
- package/mcp-configs/reference-mgr/mcp-paperswithcode.json +21 -0
- package/mcp-configs/reference-mgr/mcp-scholarly.json +20 -0
- package/mcp-configs/reference-mgr/mcp-simple-arxiv.json +20 -0
- package/mcp-configs/reference-mgr/mcp-simple-pubmed.json +20 -0
- package/mcp-configs/reference-mgr/mcp-zotero.json +21 -0
- package/mcp-configs/reference-mgr/mendeley-mcp.json +20 -0
- package/mcp-configs/reference-mgr/ncbi-mcp-server.json +22 -0
- package/mcp-configs/reference-mgr/onecite.json +21 -0
- package/mcp-configs/reference-mgr/paper-search-mcp.json +21 -0
- package/mcp-configs/reference-mgr/pubmed-search-mcp.json +21 -0
- package/mcp-configs/reference-mgr/scholar-mcp.json +21 -0
- package/mcp-configs/reference-mgr/scholar-multi-mcp.json +21 -0
- package/mcp-configs/reference-mgr/seerai.json +21 -0
- package/mcp-configs/reference-mgr/semantic-scholar-fastmcp.json +21 -0
- package/mcp-configs/reference-mgr/sourcelibrary.json +20 -0
- package/mcp-configs/registry.json +178 -149
- package/mcp-configs/repository/dataverse-mcp.json +33 -0
- package/mcp-configs/repository/huggingface-mcp.json +29 -0
- package/openclaw.plugin.json +2 -2
- package/package.json +2 -2
- package/skills/analysis/dataviz/algorithm-visualizer-guide/SKILL.md +259 -0
- package/skills/analysis/dataviz/bokeh-visualization-guide/SKILL.md +270 -0
- package/skills/analysis/dataviz/chart-image-generator/SKILL.md +229 -0
- package/skills/analysis/dataviz/citation-map-guide/SKILL.md +184 -0
- package/skills/analysis/dataviz/d3-visualization-guide/SKILL.md +281 -0
- package/skills/analysis/dataviz/data-visualization-principles/SKILL.md +171 -0
- package/skills/analysis/dataviz/echarts-visualization-guide/SKILL.md +250 -0
- package/skills/analysis/dataviz/metabase-analytics-guide/SKILL.md +242 -0
- package/skills/analysis/dataviz/plotly-interactive-guide/SKILL.md +266 -0
- package/skills/analysis/dataviz/redash-analytics-guide/SKILL.md +284 -0
- package/skills/analysis/econometrics/econml-causal-guide/SKILL.md +163 -0
- package/skills/analysis/econometrics/empirical-paper-analysis/SKILL.md +192 -0
- package/skills/analysis/econometrics/mostly-harmless-guide/SKILL.md +139 -0
- package/skills/analysis/econometrics/panel-data-analyst/SKILL.md +259 -0
- package/skills/analysis/econometrics/panel-data-regression-workflow/SKILL.md +267 -0
- package/skills/analysis/econometrics/python-causality-guide/SKILL.md +134 -0
- package/skills/analysis/econometrics/stata-accounting-guide/SKILL.md +269 -0
- package/skills/analysis/econometrics/stata-analyst-guide/SKILL.md +245 -0
- package/skills/analysis/econometrics/stata-reference-guide/SKILL.md +293 -0
- package/skills/analysis/statistics/data-anomaly-detection/SKILL.md +157 -0
- package/skills/analysis/statistics/general-statistics-guide/SKILL.md +226 -0
- package/skills/analysis/statistics/infiagent-benchmark-guide/SKILL.md +106 -0
- package/skills/analysis/statistics/ml-experiment-tracker/SKILL.md +212 -0
- package/skills/analysis/statistics/pywayne-statistics-guide/SKILL.md +192 -0
- package/skills/analysis/statistics/quantitative-methods-guide/SKILL.md +193 -0
- package/skills/analysis/statistics/senior-data-scientist-guide/SKILL.md +223 -0
- package/skills/analysis/wrangling/claude-data-analysis-guide/SKILL.md +100 -0
- package/skills/analysis/wrangling/csv-data-analyzer/SKILL.md +170 -0
- package/skills/analysis/wrangling/data-cleaning-pipeline/SKILL.md +266 -0
- package/skills/analysis/wrangling/data-cog-guide/SKILL.md +178 -0
- package/skills/analysis/wrangling/open-data-scientist-guide/SKILL.md +197 -0
- package/skills/analysis/wrangling/stata-data-cleaning/SKILL.md +276 -0
- package/skills/analysis/wrangling/streamline-analyst-guide/SKILL.md +119 -0
- package/skills/analysis/wrangling/survey-data-processing/SKILL.md +298 -0
- package/skills/domains/ai-ml/ai-agent-papers-guide/SKILL.md +146 -0
- package/skills/domains/ai-ml/ai-model-benchmarking/SKILL.md +209 -0
- package/skills/domains/ai-ml/annotated-dl-papers-guide/SKILL.md +159 -0
- package/skills/domains/ai-ml/anomaly-detection-papers-guide/SKILL.md +167 -0
- package/skills/domains/ai-ml/autonomous-agents-papers-guide/SKILL.md +178 -0
- package/skills/domains/ai-ml/dl-transformer-finetune/SKILL.md +239 -0
- package/skills/domains/ai-ml/domain-adaptation-papers-guide/SKILL.md +173 -0
- package/skills/domains/ai-ml/generative-ai-guide/SKILL.md +146 -0
- package/skills/domains/ai-ml/graph-learning-papers-guide/SKILL.md +125 -0
- package/skills/domains/ai-ml/huggingface-inference-guide/SKILL.md +196 -0
- package/skills/domains/ai-ml/keras-deep-learning/SKILL.md +210 -0
- package/skills/domains/ai-ml/kolmogorov-arnold-networks-guide/SKILL.md +185 -0
- package/skills/domains/ai-ml/llm-from-scratch-guide/SKILL.md +124 -0
- package/skills/domains/ai-ml/ml-pipeline-guide/SKILL.md +295 -0
- package/skills/domains/ai-ml/nlp-toolkit-guide/SKILL.md +247 -0
- package/skills/domains/ai-ml/npcpy-research-guide/SKILL.md +137 -0
- package/skills/domains/ai-ml/pytorch-guide/SKILL.md +281 -0
- package/skills/domains/ai-ml/pytorch-lightning-guide/SKILL.md +244 -0
- package/skills/domains/ai-ml/responsible-ai-guide/SKILL.md +126 -0
- package/skills/domains/ai-ml/tensorflow-guide/SKILL.md +241 -0
- package/skills/domains/ai-ml/vmas-simulator-guide/SKILL.md +129 -0
- package/skills/domains/biomedical/bioagents-guide/SKILL.md +308 -0
- package/skills/domains/biomedical/clawbio-guide/SKILL.md +167 -0
- package/skills/domains/biomedical/clinical-dialogue-agents-guide/SKILL.md +145 -0
- package/skills/domains/biomedical/ena-sequence-api/SKILL.md +175 -0
- package/skills/domains/biomedical/genomas-guide/SKILL.md +126 -0
- package/skills/domains/biomedical/genotex-benchmark-guide/SKILL.md +125 -0
- package/skills/domains/biomedical/med-researcher-guide/SKILL.md +161 -0
- package/skills/domains/biomedical/med-researcher-r1-guide/SKILL.md +146 -0
- package/skills/domains/biomedical/medgeclaw-guide/SKILL.md +345 -0
- package/skills/domains/biomedical/medical-imaging-guide/SKILL.md +305 -0
- package/skills/domains/biomedical/ncbi-blast-api/SKILL.md +195 -0
- package/skills/domains/biomedical/ncbi-datasets-api/SKILL.md +220 -0
- package/skills/domains/biomedical/quickgo-api/SKILL.md +181 -0
- package/skills/domains/business/architecture-design-guide/SKILL.md +279 -0
- package/skills/domains/business/innovation-management-guide/SKILL.md +257 -0
- package/skills/domains/business/operations-research-guide/SKILL.md +258 -0
- package/skills/domains/business/xpert-bi-guide/SKILL.md +84 -0
- package/skills/domains/chemistry/cactus-cheminformatics-guide/SKILL.md +89 -0
- package/skills/domains/chemistry/chemeagle-guide/SKILL.md +147 -0
- package/skills/domains/chemistry/chemgraph-agent-guide/SKILL.md +120 -0
- package/skills/domains/chemistry/molecular-dynamics-guide/SKILL.md +237 -0
- package/skills/domains/chemistry/pubchem-api-guide/SKILL.md +180 -0
- package/skills/domains/chemistry/spectroscopy-analysis-guide/SKILL.md +290 -0
- package/skills/domains/cs/ai-security-papers-guide/SKILL.md +103 -0
- package/skills/domains/cs/code-llm-papers-guide/SKILL.md +131 -0
- package/skills/domains/cs/distributed-systems-guide/SKILL.md +268 -0
- package/skills/domains/cs/formal-verification-guide/SKILL.md +298 -0
- package/skills/domains/cs/gaussian-splatting-papers-guide/SKILL.md +158 -0
- package/skills/domains/cs/llm-aiops-guide/SKILL.md +70 -0
- package/skills/domains/cs/software-heritage-api/SKILL.md +200 -0
- package/skills/domains/ecology/species-distribution-guide/SKILL.md +343 -0
- package/skills/domains/economics/imf-data-api-guide/SKILL.md +174 -0
- package/skills/domains/economics/nber-working-papers-api/SKILL.md +177 -0
- package/skills/domains/economics/post-labor-economics/SKILL.md +254 -0
- package/skills/domains/economics/pricing-psychology-guide/SKILL.md +273 -0
- package/skills/domains/economics/repec-economics-api/SKILL.md +188 -0
- package/skills/domains/economics/world-bank-data-guide/SKILL.md +179 -0
- package/skills/domains/education/academic-study-methods/SKILL.md +228 -0
- package/skills/domains/education/assessment-design-guide/SKILL.md +213 -0
- package/skills/domains/education/educational-research-methods/SKILL.md +179 -0
- package/skills/domains/education/edumcp-guide/SKILL.md +74 -0
- package/skills/domains/education/mooc-analytics-guide/SKILL.md +206 -0
- package/skills/domains/education/open-syllabus-api/SKILL.md +171 -0
- package/skills/domains/finance/akshare-finance-data/SKILL.md +207 -0
- package/skills/domains/finance/finsight-research-guide/SKILL.md +113 -0
- package/skills/domains/finance/options-analytics-agent-guide/SKILL.md +117 -0
- package/skills/domains/finance/portfolio-optimization-guide/SKILL.md +279 -0
- package/skills/domains/finance/risk-modeling-guide/SKILL.md +260 -0
- package/skills/domains/finance/stata-accounting-research/SKILL.md +372 -0
- package/skills/domains/geoscience/climate-modeling-guide/SKILL.md +215 -0
- package/skills/domains/geoscience/pangaea-data-api/SKILL.md +197 -0
- package/skills/domains/geoscience/satellite-remote-sensing/SKILL.md +193 -0
- package/skills/domains/geoscience/seismology-data-guide/SKILL.md +208 -0
- package/skills/domains/humanities/digital-humanities-methods/SKILL.md +232 -0
- package/skills/domains/humanities/ethical-philosophy-guide/SKILL.md +244 -0
- package/skills/domains/humanities/history-research-guide/SKILL.md +260 -0
- package/skills/domains/humanities/political-history-guide/SKILL.md +241 -0
- package/skills/domains/law/caselaw-access-api/SKILL.md +149 -0
- package/skills/domains/law/legal-agent-skills-guide/SKILL.md +132 -0
- package/skills/domains/law/legal-nlp-guide/SKILL.md +236 -0
- package/skills/domains/law/legal-research-methods/SKILL.md +190 -0
- package/skills/domains/law/opencontracts-guide/SKILL.md +168 -0
- package/skills/domains/law/patent-analysis-guide/SKILL.md +257 -0
- package/skills/domains/law/regulatory-compliance-guide/SKILL.md +267 -0
- package/skills/domains/math/lean-theorem-proving-guide/SKILL.md +140 -0
- package/skills/domains/math/symbolic-computation-guide/SKILL.md +263 -0
- package/skills/domains/math/topology-data-analysis/SKILL.md +305 -0
- package/skills/domains/pharma/clinical-trial-design-guide/SKILL.md +271 -0
- package/skills/domains/pharma/drug-target-interaction/SKILL.md +242 -0
- package/skills/domains/pharma/madd-drug-discovery-guide/SKILL.md +153 -0
- package/skills/domains/pharma/pharmacovigilance-guide/SKILL.md +216 -0
- package/skills/domains/physics/astrophysics-data-guide/SKILL.md +305 -0
- package/skills/domains/physics/particle-physics-guide/SKILL.md +287 -0
- package/skills/domains/social-science/ipums-microdata-api/SKILL.md +211 -0
- package/skills/domains/social-science/network-analysis-guide/SKILL.md +310 -0
- package/skills/domains/social-science/psychology-research-guide/SKILL.md +270 -0
- package/skills/domains/social-science/sociology-research-guide/SKILL.md +238 -0
- package/skills/domains/social-science/sociology-research-methods/SKILL.md +181 -0
- package/skills/literature/discovery/arxiv-paper-monitoring/SKILL.md +233 -0
- package/skills/literature/discovery/paper-recommendation-guide/SKILL.md +120 -0
- package/skills/literature/discovery/papers-we-love-guide/SKILL.md +169 -0
- package/skills/literature/discovery/semantic-paper-radar/SKILL.md +144 -0
- package/skills/literature/discovery/zotero-arxiv-daily-guide/SKILL.md +94 -0
- package/skills/literature/fulltext/bioc-pmc-api/SKILL.md +146 -0
- package/skills/literature/fulltext/core-api-guide/SKILL.md +144 -0
- package/skills/literature/fulltext/dataverse-api/SKILL.md +215 -0
- package/skills/literature/fulltext/hal-archive-api/SKILL.md +218 -0
- package/skills/literature/fulltext/institutional-repository-guide/SKILL.md +212 -0
- package/skills/literature/fulltext/open-access-mining-guide/SKILL.md +341 -0
- package/skills/literature/fulltext/osf-api/SKILL.md +212 -0
- package/skills/literature/fulltext/pmc-ftp-bulk-download/SKILL.md +182 -0
- package/skills/literature/fulltext/zotero-ai-butler-guide/SKILL.md +166 -0
- package/skills/literature/fulltext/zotero-scihub-guide/SKILL.md +168 -0
- package/skills/literature/metadata/academic-paper-summarizer/SKILL.md +101 -0
- package/skills/literature/metadata/bibliometrix-guide/SKILL.md +164 -0
- package/skills/literature/metadata/crossref-event-data-api/SKILL.md +183 -0
- package/skills/literature/metadata/doi-content-negotiation/SKILL.md +202 -0
- package/skills/literature/metadata/orkg-api/SKILL.md +153 -0
- package/skills/literature/metadata/plumx-metrics-api/SKILL.md +188 -0
- package/skills/literature/metadata/ror-organization-api/SKILL.md +208 -0
- package/skills/literature/metadata/sophosia-reference-guide/SKILL.md +110 -0
- package/skills/literature/metadata/viaf-authority-api/SKILL.md +209 -0
- package/skills/literature/metadata/wikidata-api-guide/SKILL.md +156 -0
- package/skills/literature/metadata/zoplicate-dedup-guide/SKILL.md +147 -0
- package/skills/literature/metadata/zotero-actions-tags-guide/SKILL.md +212 -0
- package/skills/literature/metadata/zotmoov-guide/SKILL.md +120 -0
- package/skills/literature/metadata/zutilo-guide/SKILL.md +140 -0
- package/skills/literature/search/arxiv-batch-reporting/SKILL.md +133 -0
- package/skills/literature/search/arxiv-cli-tools/SKILL.md +172 -0
- package/skills/literature/search/arxiv-osiris/SKILL.md +199 -0
- package/skills/literature/search/arxiv-paper-processor/SKILL.md +141 -0
- package/skills/literature/search/baidu-scholar-guide/SKILL.md +110 -0
- package/skills/literature/search/base-academic-search/SKILL.md +196 -0
- package/skills/literature/search/chatpaper-guide/SKILL.md +122 -0
- package/skills/literature/search/citeseerx-api/SKILL.md +183 -0
- package/skills/literature/search/deep-literature-search/SKILL.md +149 -0
- package/skills/literature/search/deepgit-search-guide/SKILL.md +147 -0
- package/skills/literature/search/eric-education-api/SKILL.md +199 -0
- package/skills/literature/search/findpapers-guide/SKILL.md +177 -0
- package/skills/literature/search/ieee-xplore-api/SKILL.md +177 -0
- package/skills/literature/search/lens-scholarly-api/SKILL.md +211 -0
- package/skills/literature/search/multi-database-literature-search/SKILL.md +198 -0
- package/skills/literature/search/open-library-api/SKILL.md +196 -0
- package/skills/literature/search/open-semantic-search-guide/SKILL.md +190 -0
- package/skills/literature/search/openaire-api/SKILL.md +141 -0
- package/skills/literature/search/paper-search-mcp-guide/SKILL.md +107 -0
- package/skills/literature/search/papers-chat-guide/SKILL.md +194 -0
- package/skills/literature/search/pasa-paper-search-guide/SKILL.md +138 -0
- package/skills/literature/search/plos-open-access-api/SKILL.md +203 -0
- package/skills/literature/search/scielo-api/SKILL.md +182 -0
- package/skills/literature/search/share-research-api/SKILL.md +129 -0
- package/skills/literature/search/worldcat-search-api/SKILL.md +224 -0
- package/skills/research/automation/ai-scientist-v2-guide/SKILL.md +284 -0
- package/skills/research/automation/aim-experiment-guide/SKILL.md +234 -0
- package/skills/research/automation/claude-academic-workflow-guide/SKILL.md +202 -0
- package/skills/research/automation/coexist-ai-guide/SKILL.md +149 -0
- package/skills/research/automation/datagen-research-guide/SKILL.md +131 -0
- package/skills/research/automation/foam-agent-guide/SKILL.md +203 -0
- package/skills/research/automation/kedro-pipeline-guide/SKILL.md +216 -0
- package/skills/research/automation/mle-agent-guide/SKILL.md +139 -0
- package/skills/research/automation/paper-to-agent-guide/SKILL.md +116 -0
- package/skills/research/automation/rd-agent-guide/SKILL.md +246 -0
- package/skills/research/automation/research-paper-orchestrator/SKILL.md +254 -0
- package/skills/research/deep-research/academic-deep-research/SKILL.md +190 -0
- package/skills/research/deep-research/auto-deep-research-guide/SKILL.md +141 -0
- package/skills/research/deep-research/cognitive-kernel-guide/SKILL.md +200 -0
- package/skills/research/deep-research/corvus-research-guide/SKILL.md +132 -0
- package/skills/research/deep-research/deep-research-pro/SKILL.md +213 -0
- package/skills/research/deep-research/deep-research-work/SKILL.md +204 -0
- package/skills/research/deep-research/deep-searcher-guide/SKILL.md +253 -0
- package/skills/research/deep-research/gpt-researcher-guide/SKILL.md +191 -0
- package/skills/research/deep-research/in-depth-research-guide/SKILL.md +205 -0
- package/skills/research/deep-research/khoj-research-guide/SKILL.md +200 -0
- package/skills/research/deep-research/kosmos-scientist-guide/SKILL.md +185 -0
- package/skills/research/deep-research/llm-scientific-discovery-guide/SKILL.md +178 -0
- package/skills/research/deep-research/local-deep-research-guide/SKILL.md +253 -0
- package/skills/research/deep-research/open-researcher-guide/SKILL.md +138 -0
- package/skills/research/deep-research/tongyi-deep-research-guide/SKILL.md +217 -0
- package/skills/research/funding/eu-horizon-guide/SKILL.md +244 -0
- package/skills/research/funding/grant-budget-guide/SKILL.md +284 -0
- package/skills/research/funding/nih-reporter-api-guide/SKILL.md +166 -0
- package/skills/research/funding/nsf-award-api-guide/SKILL.md +133 -0
- package/skills/research/methodology/academic-mentor-guide/SKILL.md +169 -0
- package/skills/research/methodology/claude-scientific-guide/SKILL.md +122 -0
- package/skills/research/methodology/deep-innovator-guide/SKILL.md +242 -0
- package/skills/research/methodology/osf-api-guide/SKILL.md +165 -0
- package/skills/research/methodology/parsifal-slr-guide/SKILL.md +154 -0
- package/skills/research/methodology/research-paper-kb/SKILL.md +263 -0
- package/skills/research/methodology/research-pipeline-units-guide/SKILL.md +169 -0
- package/skills/research/methodology/research-town-guide/SKILL.md +263 -0
- package/skills/research/methodology/slr-automation-guide/SKILL.md +235 -0
- package/skills/research/paper-review/automated-review-guide/SKILL.md +281 -0
- package/skills/research/paper-review/latte-review-guide/SKILL.md +175 -0
- package/skills/research/paper-review/paper-compare-guide/SKILL.md +238 -0
- package/skills/research/paper-review/paper-critique-framework/SKILL.md +181 -0
- package/skills/research/paper-review/paper-digest-guide/SKILL.md +240 -0
- package/skills/research/paper-review/paper-research-assistant/SKILL.md +231 -0
- package/skills/research/paper-review/research-quality-filter/SKILL.md +261 -0
- package/skills/research/paper-review/review-response-guide/SKILL.md +275 -0
- package/skills/tools/code-exec/contextplus-mcp-guide/SKILL.md +110 -0
- package/skills/tools/code-exec/google-colab-guide/SKILL.md +276 -0
- package/skills/tools/code-exec/kaggle-api-guide/SKILL.md +216 -0
- package/skills/tools/code-exec/overleaf-cli-guide/SKILL.md +279 -0
- package/skills/tools/diagram/clawphd-guide/SKILL.md +149 -0
- package/skills/tools/diagram/code-flow-visualizer/SKILL.md +197 -0
- package/skills/tools/diagram/excalidraw-diagram-guide/SKILL.md +170 -0
- package/skills/tools/diagram/json-data-visualizer/SKILL.md +270 -0
- package/skills/tools/diagram/kroki-diagram-api/SKILL.md +198 -0
- package/skills/tools/diagram/mermaid-architect-guide/SKILL.md +219 -0
- package/skills/tools/diagram/scientific-graphical-abstract/SKILL.md +201 -0
- package/skills/tools/diagram/tldraw-whiteboard-guide/SKILL.md +397 -0
- package/skills/tools/document/docsgpt-guide/SKILL.md +130 -0
- package/skills/tools/document/large-document-reader/SKILL.md +202 -0
- package/skills/tools/document/md2pdf-xelatex/SKILL.md +212 -0
- package/skills/tools/document/openpaper-guide/SKILL.md +232 -0
- package/skills/tools/document/paper-parse-guide/SKILL.md +243 -0
- package/skills/tools/document/weknora-guide/SKILL.md +216 -0
- package/skills/tools/document/zotero-addon-market-guide/SKILL.md +108 -0
- package/skills/tools/document/zotero-night-theme-guide/SKILL.md +142 -0
- package/skills/tools/document/zotero-style-guide/SKILL.md +217 -0
- package/skills/tools/knowledge-graph/citation-network-builder/SKILL.md +244 -0
- package/skills/tools/knowledge-graph/concept-map-generator/SKILL.md +284 -0
- package/skills/tools/knowledge-graph/graphiti-guide/SKILL.md +219 -0
- package/skills/tools/knowledge-graph/mimir-memory-guide/SKILL.md +135 -0
- package/skills/tools/knowledge-graph/notero-zotero-notion-guide/SKILL.md +187 -0
- package/skills/tools/knowledge-graph/open-webui-tools-guide/SKILL.md +156 -0
- package/skills/tools/knowledge-graph/openspg-guide/SKILL.md +210 -0
- package/skills/tools/knowledge-graph/paperpile-notion-guide/SKILL.md +84 -0
- package/skills/tools/knowledge-graph/zotero-markdb-connect-guide/SKILL.md +162 -0
- package/skills/tools/ocr-translate/latex-translation-guide/SKILL.md +176 -0
- package/skills/tools/ocr-translate/math-equation-renderer/SKILL.md +198 -0
- package/skills/tools/ocr-translate/pdf-math-translate-guide/SKILL.md +141 -0
- package/skills/tools/ocr-translate/zotero-pdf-translate-guide/SKILL.md +95 -0
- package/skills/tools/ocr-translate/zotero-pdf2zh-guide/SKILL.md +143 -0
- package/skills/tools/scraping/dataset-finder-guide/SKILL.md +253 -0
- package/skills/tools/scraping/easy-spider-guide/SKILL.md +250 -0
- package/skills/tools/scraping/google-scholar-scraper/SKILL.md +255 -0
- package/skills/tools/scraping/repository-harvesting-guide/SKILL.md +310 -0
- package/skills/writing/citation/academic-citation-manager/SKILL.md +314 -0
- package/skills/writing/citation/academic-citation-manager-guide/SKILL.md +182 -0
- package/skills/writing/citation/citation-assistant-skill/SKILL.md +192 -0
- package/skills/writing/citation/jabref-reference-guide/SKILL.md +127 -0
- package/skills/writing/citation/jasminum-zotero-guide/SKILL.md +103 -0
- package/skills/writing/citation/mendeley-api/SKILL.md +231 -0
- package/skills/writing/citation/obsidian-citation-guide/SKILL.md +164 -0
- package/skills/writing/citation/obsidian-zotero-guide/SKILL.md +137 -0
- package/skills/writing/citation/onecite-reference-guide/SKILL.md +168 -0
- package/skills/writing/citation/papersgpt-zotero-guide/SKILL.md +132 -0
- package/skills/writing/citation/papis-cli-guide/SKILL.md +213 -0
- package/skills/writing/citation/zotero-better-bibtex-guide/SKILL.md +107 -0
- package/skills/writing/citation/zotero-better-notes-guide/SKILL.md +121 -0
- package/skills/writing/citation/zotero-gpt-guide/SKILL.md +111 -0
- package/skills/writing/citation/zotero-mcp-guide/SKILL.md +164 -0
- package/skills/writing/citation/zotero-mdnotes-guide/SKILL.md +162 -0
- package/skills/writing/citation/zotero-reference-guide/SKILL.md +139 -0
- package/skills/writing/citation/zotero-scholar-guide/SKILL.md +294 -0
- package/skills/writing/citation/zotfile-attachment-guide/SKILL.md +140 -0
- package/skills/writing/composition/ml-paper-writing/SKILL.md +163 -0
- package/skills/writing/composition/opendraft-thesis-guide/SKILL.md +200 -0
- package/skills/writing/composition/paper-debugger-guide/SKILL.md +143 -0
- package/skills/writing/composition/paperforge-guide/SKILL.md +205 -0
- package/skills/writing/composition/research-paper-writer/SKILL.md +226 -0
- package/skills/writing/composition/scientific-writing-resources/SKILL.md +151 -0
- package/skills/writing/composition/scientific-writing-wrapper/SKILL.md +153 -0
- package/skills/writing/latex/academic-writing-latex/SKILL.md +285 -0
- package/skills/writing/latex/latex-drawing-collection/SKILL.md +154 -0
- package/skills/writing/latex/latex-templates-collection/SKILL.md +159 -0
- package/skills/writing/latex/md-to-pdf-academic/SKILL.md +230 -0
- package/skills/writing/latex/tex-render-guide/SKILL.md +243 -0
- package/skills/writing/polish/academic-tone-guide/SKILL.md +209 -0
- package/skills/writing/polish/chinese-text-humanizer/SKILL.md +140 -0
- package/skills/writing/polish/conciseness-editing-guide/SKILL.md +225 -0
- package/skills/writing/polish/paper-polish-guide/SKILL.md +160 -0
- package/skills/writing/templates/arxiv-preprint-template/SKILL.md +184 -0
- package/skills/writing/templates/elegant-paper-template/SKILL.md +141 -0
- package/skills/writing/templates/graphical-abstract-guide/SKILL.md +183 -0
- package/skills/writing/templates/novathesis-guide/SKILL.md +152 -0
- package/skills/writing/templates/scientific-article-pdf/SKILL.md +261 -0
- package/skills/writing/templates/sjtuthesis-guide/SKILL.md +197 -0
- package/skills/writing/templates/thuthesis-guide/SKILL.md +181 -0
- package/skills/literature/fulltext/repository-harvesting-guide/SKILL.md +0 -207
|
@@ -0,0 +1,133 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: arxiv-batch-reporting
|
|
3
|
+
description: "Batch search and report generation from arXiv preprint repository"
|
|
4
|
+
metadata:
|
|
5
|
+
openclaw:
|
|
6
|
+
emoji: "📊"
|
|
7
|
+
category: "literature"
|
|
8
|
+
subcategory: "search"
|
|
9
|
+
keywords: ["arxiv", "batch search", "preprint", "report generation", "literature monitoring", "research trends"]
|
|
10
|
+
source: "https://github.com/sspaeti/arxiv-batch-search"
|
|
11
|
+
---
|
|
12
|
+
|
|
13
|
+
# arXiv Batch Reporting
|
|
14
|
+
|
|
15
|
+
## Overview
|
|
16
|
+
|
|
17
|
+
Keeping up with the flood of new preprints on arXiv is one of the most persistent challenges in fast-moving fields like machine learning, physics, mathematics, and computer science. The arXiv Batch Reporting skill provides a systematic approach to searching, filtering, and generating structured reports from arXiv at scale.
|
|
18
|
+
|
|
19
|
+
Unlike ad-hoc manual searches, this skill enables researchers to define persistent query profiles, run batch searches across date ranges, and produce formatted reports that highlight the most relevant papers. It is particularly useful for weekly or monthly literature surveillance, lab meeting preparation, and trend analysis across subfields.
|
|
20
|
+
|
|
21
|
+
The skill leverages the arXiv API and supports advanced query syntax, category filtering, and result ranking by relevance or recency. Reports can be generated in Markdown, HTML, or CSV formats for integration into existing workflows.
|
|
22
|
+
|
|
23
|
+
## Setting Up Batch Queries
|
|
24
|
+
|
|
25
|
+
### Query Profile Definition
|
|
26
|
+
|
|
27
|
+
Define your search profiles as structured configurations. Each profile specifies the search terms, category filters, date range, and output preferences:
|
|
28
|
+
|
|
29
|
+
```yaml
|
|
30
|
+
profile_name: "transformer-architectures-weekly"
|
|
31
|
+
queries:
|
|
32
|
+
- "ti:transformer AND abs:attention mechanism"
|
|
33
|
+
- "ti:vision transformer"
|
|
34
|
+
- "abs:efficient transformer AND cat:cs.LG"
|
|
35
|
+
categories:
|
|
36
|
+
- cs.LG
|
|
37
|
+
- cs.CL
|
|
38
|
+
- cs.CV
|
|
39
|
+
date_range: "last_7_days"
|
|
40
|
+
max_results: 100
|
|
41
|
+
sort_by: "submittedDate"
|
|
42
|
+
sort_order: "descending"
|
|
43
|
+
```
|
|
44
|
+
|
|
45
|
+
### arXiv API Query Syntax
|
|
46
|
+
|
|
47
|
+
The arXiv API supports field-specific searches:
|
|
48
|
+
|
|
49
|
+
- `ti:` — Search in title
|
|
50
|
+
- `abs:` — Search in abstract
|
|
51
|
+
- `au:` — Search by author
|
|
52
|
+
- `cat:` — Filter by category (e.g., `cs.AI`, `math.PR`, `physics.comp-ph`)
|
|
53
|
+
- Boolean operators: `AND`, `OR`, `ANDNOT`
|
|
54
|
+
- Group with parentheses for complex queries
|
|
55
|
+
|
|
56
|
+
**Example queries:**
|
|
57
|
+
- Find recent GAN papers in computer vision: `abs:generative adversarial AND cat:cs.CV`
|
|
58
|
+
- Find a specific author's work: `au:bengio AND ti:deep learning`
|
|
59
|
+
- Exclude survey papers: `abs:reinforcement learning ANDNOT ti:survey`
|
|
60
|
+
|
|
61
|
+
### Rate Limiting and Pagination
|
|
62
|
+
|
|
63
|
+
The arXiv API enforces rate limits. Follow these guidelines:
|
|
64
|
+
|
|
65
|
+
- Wait at least 3 seconds between API requests
|
|
66
|
+
- Use pagination with `start` and `max_results` parameters (max 2000 per request)
|
|
67
|
+
- For large batch jobs, implement exponential backoff on HTTP 503 responses
|
|
68
|
+
- Cache results locally to avoid redundant API calls
|
|
69
|
+
|
|
70
|
+
## Report Generation
|
|
71
|
+
|
|
72
|
+
### Standard Report Template
|
|
73
|
+
|
|
74
|
+
After collecting batch results, generate a report with the following structure:
|
|
75
|
+
|
|
76
|
+
```markdown
|
|
77
|
+
# arXiv Batch Report: [Profile Name]
|
|
78
|
+
**Date range:** [start] to [end]
|
|
79
|
+
**Total results:** [N] papers
|
|
80
|
+
**Generated:** [timestamp]
|
|
81
|
+
|
|
82
|
+
## Highlights (Top 10 by Relevance)
|
|
83
|
+
| # | Title | Authors | Category | Date |
|
|
84
|
+
|---|-------|---------|----------|------|
|
|
85
|
+
| 1 | [Title](arxiv-link) | First Author et al. | cs.LG | 2026-03-08 |
|
|
86
|
+
|
|
87
|
+
## Category Breakdown
|
|
88
|
+
- cs.LG: 45 papers
|
|
89
|
+
- cs.CL: 23 papers
|
|
90
|
+
- cs.CV: 18 papers
|
|
91
|
+
|
|
92
|
+
## Keyword Frequency
|
|
93
|
+
- "transformer": 38 mentions
|
|
94
|
+
- "attention": 29 mentions
|
|
95
|
+
- "efficient": 15 mentions
|
|
96
|
+
|
|
97
|
+
## Full Results
|
|
98
|
+
[Expandable table with all papers]
|
|
99
|
+
```
|
|
100
|
+
|
|
101
|
+
### Filtering and Ranking
|
|
102
|
+
|
|
103
|
+
After retrieving raw results, apply post-processing filters to surface the most relevant papers:
|
|
104
|
+
|
|
105
|
+
1. **Relevance scoring**: Score each paper based on keyword density in the title and abstract relative to your query terms.
|
|
106
|
+
2. **Author filtering**: Boost papers from authors on your watch list (key researchers in your field).
|
|
107
|
+
3. **Citation proxy**: Papers that appear in multiple query results likely sit at the intersection of your interests—rank them higher.
|
|
108
|
+
4. **Novelty detection**: Flag papers whose abstracts contain terms not seen in your previous reports, indicating potentially new directions.
|
|
109
|
+
|
|
110
|
+
## Automation and Scheduling
|
|
111
|
+
|
|
112
|
+
For ongoing literature surveillance, automate your batch reports:
|
|
113
|
+
|
|
114
|
+
- **Cron scheduling**: Run batch queries weekly (e.g., every Monday at 8 AM) using a scheduled task or CI pipeline.
|
|
115
|
+
- **Diff reports**: Compare the current week's results against the previous week to highlight only new papers.
|
|
116
|
+
- **Alert thresholds**: Set alerts when a report contains more than N papers matching a high-priority query, indicating a burst of activity in that area.
|
|
117
|
+
- **Email or Slack delivery**: Route generated reports to your inbox or lab Slack channel for team-wide awareness.
|
|
118
|
+
|
|
119
|
+
Store all generated reports in a versioned directory structure for longitudinal trend analysis:
|
|
120
|
+
|
|
121
|
+
```
|
|
122
|
+
reports/
|
|
123
|
+
transformer-architectures-weekly/
|
|
124
|
+
2026-03-03.md
|
|
125
|
+
2026-03-10.md
|
|
126
|
+
...
|
|
127
|
+
```
|
|
128
|
+
|
|
129
|
+
## References
|
|
130
|
+
|
|
131
|
+
- arXiv API documentation: https://info.arxiv.org/help/api/index.html
|
|
132
|
+
- arXiv category taxonomy: https://arxiv.org/category_taxonomy
|
|
133
|
+
- arXiv Batch Search: https://github.com/sspaeti/arxiv-batch-search
|
|
@@ -0,0 +1,172 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: arxiv-cli-tools
|
|
3
|
+
description: "Command-line tools for searching and batch-downloading arXiv papers"
|
|
4
|
+
metadata:
|
|
5
|
+
openclaw:
|
|
6
|
+
emoji: "🔍"
|
|
7
|
+
category: "literature"
|
|
8
|
+
subcategory: "search"
|
|
9
|
+
keywords: ["arxiv", "command line", "paper download", "preprint search", "batch download", "literature retrieval"]
|
|
10
|
+
source: "https://pypi.org/project/arxiv-cli-tools/"
|
|
11
|
+
---
|
|
12
|
+
|
|
13
|
+
# arXiv CLI Tools
|
|
14
|
+
|
|
15
|
+
## Overview
|
|
16
|
+
|
|
17
|
+
`arxiv-cli-tools` is a Python command-line interface for searching and downloading papers from arXiv.org. It wraps the `arxiv` Python client library into convenient CLI commands, enabling researchers to search by keyword, author, or category, view abstracts, and batch-download PDFs directly from the terminal. No API key is required.
|
|
18
|
+
|
|
19
|
+
## Installation
|
|
20
|
+
|
|
21
|
+
```bash
|
|
22
|
+
# Recommended: isolated install with pipx
|
|
23
|
+
pipx install arxiv-cli-tools
|
|
24
|
+
|
|
25
|
+
# Alternative: pip
|
|
26
|
+
pip install arxiv-cli-tools
|
|
27
|
+
|
|
28
|
+
# Verify installation
|
|
29
|
+
arxiv-cli --help
|
|
30
|
+
```
|
|
31
|
+
|
|
32
|
+
## Searching Papers
|
|
33
|
+
|
|
34
|
+
### Basic Search
|
|
35
|
+
|
|
36
|
+
```bash
|
|
37
|
+
# Search by keyword (default: 10 results)
|
|
38
|
+
arxiv-cli search "transformer attention mechanism"
|
|
39
|
+
|
|
40
|
+
# Limit results
|
|
41
|
+
arxiv-cli search "quantum computing" -n 5
|
|
42
|
+
|
|
43
|
+
# Show abstracts in results
|
|
44
|
+
arxiv-cli search "prompt engineering" -n 5 --summary
|
|
45
|
+
```
|
|
46
|
+
|
|
47
|
+
### Filtered Search
|
|
48
|
+
|
|
49
|
+
```bash
|
|
50
|
+
# Filter by author
|
|
51
|
+
arxiv-cli search "attention mechanism" --authors "Vaswani"
|
|
52
|
+
|
|
53
|
+
# Filter by arXiv category
|
|
54
|
+
arxiv-cli search "neural networks" --categories "cs.LG,cs.AI"
|
|
55
|
+
|
|
56
|
+
# Combine filters
|
|
57
|
+
arxiv-cli search "protein folding" --categories "q-bio" -n 20 --summary
|
|
58
|
+
```
|
|
59
|
+
|
|
60
|
+
### Common arXiv Categories
|
|
61
|
+
|
|
62
|
+
| Prefix | Field | Popular Subcategories |
|
|
63
|
+
|--------|-------|----------------------|
|
|
64
|
+
| `cs` | Computer Science | cs.AI, cs.CL, cs.CV, cs.LG, cs.SE |
|
|
65
|
+
| `math` | Mathematics | math.ST, math.OC, math.PR |
|
|
66
|
+
| `physics` | Physics | physics.comp-ph, hep-th, cond-mat |
|
|
67
|
+
| `stat` | Statistics | stat.ML, stat.ME, stat.TH |
|
|
68
|
+
| `q-bio` | Quantitative Biology | q-bio.BM, q-bio.GN |
|
|
69
|
+
| `q-fin` | Quantitative Finance | q-fin.ST, q-fin.PM |
|
|
70
|
+
| `econ` | Economics | econ.EM, econ.GN |
|
|
71
|
+
| `eess` | Electrical Engineering | eess.SP, eess.AS |
|
|
72
|
+
|
|
73
|
+
## Downloading Papers
|
|
74
|
+
|
|
75
|
+
### Single Paper
|
|
76
|
+
|
|
77
|
+
```bash
|
|
78
|
+
# Download by arXiv ID
|
|
79
|
+
arxiv-cli download --id 1706.03762 --dest ~/papers
|
|
80
|
+
|
|
81
|
+
# Download PDF format explicitly
|
|
82
|
+
arxiv-cli download --id 2301.13688 --dest ~/papers --pdf
|
|
83
|
+
```
|
|
84
|
+
|
|
85
|
+
### Batch Download
|
|
86
|
+
|
|
87
|
+
```bash
|
|
88
|
+
# Download multiple papers
|
|
89
|
+
arxiv-cli download --id 1706.03762 --id 2301.13688 --id 2303.08774 \
|
|
90
|
+
--dest ~/papers/transformers
|
|
91
|
+
|
|
92
|
+
# Skip already downloaded files
|
|
93
|
+
arxiv-cli download --id 1706.03762 --id 2301.13688 \
|
|
94
|
+
--dest ~/papers --skip-existing
|
|
95
|
+
```
|
|
96
|
+
|
|
97
|
+
### Download from Search Results
|
|
98
|
+
|
|
99
|
+
A common workflow is to search first, then download selected papers:
|
|
100
|
+
|
|
101
|
+
```bash
|
|
102
|
+
# 1. Search and note IDs
|
|
103
|
+
arxiv-cli search "diffusion models survey" -n 10 --summary
|
|
104
|
+
|
|
105
|
+
# 2. Download the relevant ones
|
|
106
|
+
arxiv-cli download --id 2209.00796 --id 2206.00364 --dest ~/papers/diffusion
|
|
107
|
+
```
|
|
108
|
+
|
|
109
|
+
## Python API Alternative
|
|
110
|
+
|
|
111
|
+
For programmatic use, the underlying `arxiv` library provides a Python API:
|
|
112
|
+
|
|
113
|
+
```python
|
|
114
|
+
import arxiv
|
|
115
|
+
|
|
116
|
+
# Search
|
|
117
|
+
search = arxiv.Search(
|
|
118
|
+
query="large language models",
|
|
119
|
+
max_results=10,
|
|
120
|
+
sort_by=arxiv.SortCriterion.SubmittedDate
|
|
121
|
+
)
|
|
122
|
+
|
|
123
|
+
for result in arxiv.Client().results(search):
|
|
124
|
+
print(f"{result.entry_id}: {result.title}")
|
|
125
|
+
print(f" Authors: {', '.join(a.name for a in result.authors)}")
|
|
126
|
+
print(f" Published: {result.published.date()}")
|
|
127
|
+
print(f" PDF: {result.pdf_url}")
|
|
128
|
+
print()
|
|
129
|
+
|
|
130
|
+
# Download
|
|
131
|
+
result.download_pdf(dirpath="./papers", filename="paper.pdf")
|
|
132
|
+
```
|
|
133
|
+
|
|
134
|
+
## Workflow Integration
|
|
135
|
+
|
|
136
|
+
### Daily Paper Check Script
|
|
137
|
+
|
|
138
|
+
```bash
|
|
139
|
+
#!/bin/bash
|
|
140
|
+
# Check for new papers in your research area
|
|
141
|
+
DATE=$(date +%Y-%m-%d)
|
|
142
|
+
LOG="$HOME/papers/daily_${DATE}.txt"
|
|
143
|
+
|
|
144
|
+
echo "=== arXiv Papers for $DATE ===" > "$LOG"
|
|
145
|
+
arxiv-cli search "retrieval augmented generation" \
|
|
146
|
+
--categories "cs.CL,cs.AI" -n 20 --summary >> "$LOG"
|
|
147
|
+
|
|
148
|
+
echo "Paper digest saved to $LOG"
|
|
149
|
+
```
|
|
150
|
+
|
|
151
|
+
### Export to BibTeX
|
|
152
|
+
|
|
153
|
+
After finding relevant papers, retrieve BibTeX entries via the arXiv API:
|
|
154
|
+
|
|
155
|
+
```bash
|
|
156
|
+
# Get BibTeX for a specific paper
|
|
157
|
+
curl -s "https://arxiv.org/bibtex/1706.03762"
|
|
158
|
+
```
|
|
159
|
+
|
|
160
|
+
## Rate Limits and Etiquette
|
|
161
|
+
|
|
162
|
+
- arXiv API allows **1 request per 3 seconds** for programmatic access
|
|
163
|
+
- For bulk downloads, add delays between requests
|
|
164
|
+
- The CLI tool respects rate limits by default
|
|
165
|
+
- See [arXiv API Terms of Use](https://info.arxiv.org/help/api/tou.html)
|
|
166
|
+
|
|
167
|
+
## References
|
|
168
|
+
|
|
169
|
+
- [arxiv-cli-tools on PyPI](https://pypi.org/project/arxiv-cli-tools/)
|
|
170
|
+
- [arxiv Python Client](https://github.com/lukasschwab/arxiv.py)
|
|
171
|
+
- [arXiv API Documentation](https://info.arxiv.org/help/api/)
|
|
172
|
+
- [arXiv Category Taxonomy](https://arxiv.org/category_taxonomy)
|
|
@@ -0,0 +1,199 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: arxiv-osiris
|
|
3
|
+
description: "Search and download arXiv papers via Python and PowerShell scripts"
|
|
4
|
+
metadata:
|
|
5
|
+
openclaw:
|
|
6
|
+
emoji: "🔍"
|
|
7
|
+
category: "literature"
|
|
8
|
+
subcategory: "search"
|
|
9
|
+
keywords: ["arxiv", "paper download", "preprint search", "python script", "powershell", "literature retrieval"]
|
|
10
|
+
source: "https://clawhub.com/kostaskyq/arxiv-osiris"
|
|
11
|
+
---
|
|
12
|
+
|
|
13
|
+
# arXiv Osiris — Paper Search and Download Tool
|
|
14
|
+
|
|
15
|
+
## Overview
|
|
16
|
+
|
|
17
|
+
arXiv Osiris provides cross-platform scripts (Python and PowerShell) for searching and downloading scientific papers from arXiv.org. It supports keyword search, category filtering, metadata retrieval, and direct PDF download. Useful for researchers who prefer scripted automation over browser-based arXiv access, particularly for building local paper collections.
|
|
18
|
+
|
|
19
|
+
## Installation
|
|
20
|
+
|
|
21
|
+
```bash
|
|
22
|
+
# Install the arxiv Python client (required dependency)
|
|
23
|
+
pip install arxiv
|
|
24
|
+
|
|
25
|
+
# Clone the tool (if using from source)
|
|
26
|
+
git clone https://github.com/kostaskyq/arxiv-osiris.git
|
|
27
|
+
```
|
|
28
|
+
|
|
29
|
+
## Usage — Python API
|
|
30
|
+
|
|
31
|
+
### Search for Papers
|
|
32
|
+
|
|
33
|
+
```python
|
|
34
|
+
import arxiv
|
|
35
|
+
|
|
36
|
+
# Basic keyword search
|
|
37
|
+
search = arxiv.Search(
|
|
38
|
+
query="quantum computing error correction",
|
|
39
|
+
max_results=10,
|
|
40
|
+
sort_by=arxiv.SortCriterion.Relevance
|
|
41
|
+
)
|
|
42
|
+
|
|
43
|
+
client = arxiv.Client()
|
|
44
|
+
for result in client.results(search):
|
|
45
|
+
print(f"ID: {result.entry_id}")
|
|
46
|
+
print(f"Title: {result.title}")
|
|
47
|
+
print(f"Authors: {', '.join(a.name for a in result.authors)}")
|
|
48
|
+
print(f"Published:{result.published.strftime('%Y-%m-%d')}")
|
|
49
|
+
print(f"PDF: {result.pdf_url}")
|
|
50
|
+
print(f"Abstract: {result.summary[:200]}...")
|
|
51
|
+
print()
|
|
52
|
+
```
|
|
53
|
+
|
|
54
|
+
### Category-Filtered Search
|
|
55
|
+
|
|
56
|
+
```python
|
|
57
|
+
# Search within specific categories
|
|
58
|
+
search = arxiv.Search(
|
|
59
|
+
query="cat:cs.CL AND transformer",
|
|
60
|
+
max_results=20,
|
|
61
|
+
sort_by=arxiv.SortCriterion.SubmittedDate
|
|
62
|
+
)
|
|
63
|
+
|
|
64
|
+
# Multiple categories
|
|
65
|
+
search = arxiv.Search(
|
|
66
|
+
query="(cat:cs.AI OR cat:cs.LG) AND reinforcement learning",
|
|
67
|
+
max_results=15
|
|
68
|
+
)
|
|
69
|
+
```
|
|
70
|
+
|
|
71
|
+
### Download Papers
|
|
72
|
+
|
|
73
|
+
```python
|
|
74
|
+
import os
|
|
75
|
+
|
|
76
|
+
search = arxiv.Search(query="attention mechanism", max_results=5)
|
|
77
|
+
client = arxiv.Client()
|
|
78
|
+
download_dir = os.path.expanduser("~/papers/attention")
|
|
79
|
+
os.makedirs(download_dir, exist_ok=True)
|
|
80
|
+
|
|
81
|
+
for result in client.results(search):
|
|
82
|
+
# Download PDF
|
|
83
|
+
result.download_pdf(dirpath=download_dir)
|
|
84
|
+
print(f"Downloaded: {result.title}")
|
|
85
|
+
|
|
86
|
+
# Download source (LaTeX) if available
|
|
87
|
+
result.download_source(dirpath=download_dir)
|
|
88
|
+
```
|
|
89
|
+
|
|
90
|
+
## Usage — PowerShell Script
|
|
91
|
+
|
|
92
|
+
### Search
|
|
93
|
+
|
|
94
|
+
```powershell
|
|
95
|
+
# Basic search
|
|
96
|
+
.\arxiv.ps1 -Action search -Query "machine learning"
|
|
97
|
+
|
|
98
|
+
# With max results
|
|
99
|
+
.\arxiv.ps1 -Action search -Query "neural networks" -MaxResults 10
|
|
100
|
+
|
|
101
|
+
# Filter by category
|
|
102
|
+
.\arxiv.ps1 -Action search -Query "deep learning" -Categories "cs,stat"
|
|
103
|
+
```
|
|
104
|
+
|
|
105
|
+
### Download
|
|
106
|
+
|
|
107
|
+
```powershell
|
|
108
|
+
# Download by arXiv ID
|
|
109
|
+
.\arxiv.ps1 -Action download -ArxivId "1706.03762"
|
|
110
|
+
|
|
111
|
+
# Download to specific directory
|
|
112
|
+
.\arxiv.ps1 -Action download -ArxivId "2301.13688" -OutputDir "C:\Papers"
|
|
113
|
+
```
|
|
114
|
+
|
|
115
|
+
## Advanced Queries
|
|
116
|
+
|
|
117
|
+
The arXiv API supports a rich query syntax:
|
|
118
|
+
|
|
119
|
+
| Operator | Meaning | Example |
|
|
120
|
+
|----------|---------|---------|
|
|
121
|
+
| `AND` | Both terms | `"deep learning" AND "drug discovery"` |
|
|
122
|
+
| `OR` | Either term | `"GAN" OR "diffusion model"` |
|
|
123
|
+
| `ANDNOT` | Exclude term | `"NLP" ANDNOT "translation"` |
|
|
124
|
+
| `au:` | Author | `au:"Hinton"` |
|
|
125
|
+
| `ti:` | Title contains | `ti:"attention"` |
|
|
126
|
+
| `abs:` | Abstract contains | `abs:"protein folding"` |
|
|
127
|
+
| `cat:` | Category | `cat:cs.CV` |
|
|
128
|
+
|
|
129
|
+
### Complex Query Examples
|
|
130
|
+
|
|
131
|
+
```python
|
|
132
|
+
# Papers by a specific author on a specific topic
|
|
133
|
+
search = arxiv.Search(query='au:"Yann LeCun" AND ti:"self-supervised"')
|
|
134
|
+
|
|
135
|
+
# Recent papers in two categories excluding surveys
|
|
136
|
+
search = arxiv.Search(
|
|
137
|
+
query='(cat:cs.CL OR cat:cs.AI) AND "large language model" ANDNOT ti:"survey"',
|
|
138
|
+
sort_by=arxiv.SortCriterion.SubmittedDate,
|
|
139
|
+
max_results=50
|
|
140
|
+
)
|
|
141
|
+
```
|
|
142
|
+
|
|
143
|
+
## Building a Local Paper Library
|
|
144
|
+
|
|
145
|
+
```python
|
|
146
|
+
import arxiv
|
|
147
|
+
import json
|
|
148
|
+
import os
|
|
149
|
+
from datetime import datetime
|
|
150
|
+
|
|
151
|
+
def build_library(queries: dict, base_dir: str = "~/papers"):
|
|
152
|
+
"""Build organized paper library from multiple search queries."""
|
|
153
|
+
base = os.path.expanduser(base_dir)
|
|
154
|
+
catalog = []
|
|
155
|
+
client = arxiv.Client()
|
|
156
|
+
|
|
157
|
+
for topic, query in queries.items():
|
|
158
|
+
topic_dir = os.path.join(base, topic)
|
|
159
|
+
os.makedirs(topic_dir, exist_ok=True)
|
|
160
|
+
|
|
161
|
+
search = arxiv.Search(query=query, max_results=20,
|
|
162
|
+
sort_by=arxiv.SortCriterion.SubmittedDate)
|
|
163
|
+
|
|
164
|
+
for paper in client.results(search):
|
|
165
|
+
paper.download_pdf(dirpath=topic_dir)
|
|
166
|
+
catalog.append({
|
|
167
|
+
"id": paper.entry_id,
|
|
168
|
+
"title": paper.title,
|
|
169
|
+
"authors": [a.name for a in paper.authors],
|
|
170
|
+
"published": paper.published.isoformat(),
|
|
171
|
+
"topic": topic,
|
|
172
|
+
"pdf_path": os.path.join(topic_dir, f"{paper.get_short_id()}.pdf")
|
|
173
|
+
})
|
|
174
|
+
|
|
175
|
+
# Save catalog
|
|
176
|
+
with open(os.path.join(base, "catalog.json"), "w") as f:
|
|
177
|
+
json.dump(catalog, f, indent=2)
|
|
178
|
+
print(f"Library built: {len(catalog)} papers in {len(queries)} topics")
|
|
179
|
+
|
|
180
|
+
# Usage
|
|
181
|
+
build_library({
|
|
182
|
+
"rag": "cat:cs.CL AND retrieval augmented generation",
|
|
183
|
+
"agents": "cat:cs.AI AND (LLM agent OR tool use)",
|
|
184
|
+
"evaluation": "cat:cs.CL AND (benchmark OR evaluation) AND language model"
|
|
185
|
+
})
|
|
186
|
+
```
|
|
187
|
+
|
|
188
|
+
## Rate Limits
|
|
189
|
+
|
|
190
|
+
- arXiv API: **1 request per 3 seconds** for automated access
|
|
191
|
+
- The `arxiv` Python client handles rate limiting automatically
|
|
192
|
+
- For large-scale downloads, add explicit delays: `time.sleep(3)`
|
|
193
|
+
- Respect [arXiv API Terms of Use](https://info.arxiv.org/help/api/tou.html)
|
|
194
|
+
|
|
195
|
+
## References
|
|
196
|
+
|
|
197
|
+
- [arxiv Python Client](https://github.com/lukasschwab/arxiv.py)
|
|
198
|
+
- [arXiv API User Manual](https://info.arxiv.org/help/api/user-manual.html)
|
|
199
|
+
- [arXiv Category Taxonomy](https://arxiv.org/category_taxonomy)
|
|
@@ -0,0 +1,141 @@
|
|
|
1
|
+
---
|
|
2
|
+
name: arxiv-paper-processor
|
|
3
|
+
description: "Process and analyze arXiv papers systematically for research workflows"
|
|
4
|
+
metadata:
|
|
5
|
+
openclaw:
|
|
6
|
+
emoji: "⚙️"
|
|
7
|
+
category: "literature"
|
|
8
|
+
subcategory: "search"
|
|
9
|
+
keywords: ["arxiv", "paper processing", "PDF parsing", "metadata extraction", "preprint analysis", "research pipeline"]
|
|
10
|
+
source: "https://github.com/tatsu-lab/gpt_paper_assistant"
|
|
11
|
+
---
|
|
12
|
+
|
|
13
|
+
# arXiv Paper Processor
|
|
14
|
+
|
|
15
|
+
## Overview
|
|
16
|
+
|
|
17
|
+
The arXiv Paper Processor skill provides a complete pipeline for downloading, parsing, and analyzing arXiv papers programmatically. While the arXiv API provides metadata, researchers often need to work with the full text—extracting sections, equations, figures, and references for deeper analysis.
|
|
18
|
+
|
|
19
|
+
This skill covers the entire processing chain: retrieving papers by ID or search query, downloading PDF and LaTeX source files, extracting structured content, and producing analysis-ready outputs. It is particularly valuable for researchers conducting large-scale literature analysis, building training datasets from academic text, or automating evidence extraction for systematic reviews.
|
|
20
|
+
|
|
21
|
+
The pipeline handles common challenges in academic PDF processing including multi-column layouts, mathematical notation, table extraction, and reference parsing. It integrates with tools like GROBID for PDF parsing and can work directly with arXiv LaTeX sources for higher-fidelity extraction.
|
|
22
|
+
|
|
23
|
+
## Paper Retrieval and Download
|
|
24
|
+
|
|
25
|
+
### Fetching by arXiv ID
|
|
26
|
+
|
|
27
|
+
The most reliable method is to fetch papers by their arXiv identifier:
|
|
28
|
+
|
|
29
|
+
```python
|
|
30
|
+
import urllib.request
|
|
31
|
+
import feedparser
|
|
32
|
+
|
|
33
|
+
# Fetch metadata via Atom feed
|
|
34
|
+
arxiv_id = "2301.07041"
|
|
35
|
+
url = f"http://export.arxiv.org/api/query?id_list={arxiv_id}"
|
|
36
|
+
response = urllib.request.urlopen(url)
|
|
37
|
+
feed = feedparser.parse(response.read())
|
|
38
|
+
|
|
39
|
+
entry = feed.entries[0]
|
|
40
|
+
title = entry.title
|
|
41
|
+
abstract = entry.summary
|
|
42
|
+
authors = [a.name for a in entry.authors]
|
|
43
|
+
pdf_url = entry.links[1].href # PDF link
|
|
44
|
+
```
|
|
45
|
+
|
|
46
|
+
### Downloading Source Files
|
|
47
|
+
|
|
48
|
+
arXiv stores LaTeX source files for most papers. These provide much richer structure than PDFs:
|
|
49
|
+
|
|
50
|
+
```bash
|
|
51
|
+
# Download LaTeX source (typically a .tar.gz)
|
|
52
|
+
wget https://arxiv.org/e-print/2301.07041 -O paper_source.tar.gz
|
|
53
|
+
tar -xzf paper_source.tar.gz -C paper_source/
|
|
54
|
+
```
|
|
55
|
+
|
|
56
|
+
Source files contain the original `.tex` files, figures, bibliography files, and any custom style files. Parsing LaTeX directly gives you access to section structure, equations in their original notation, citation keys, and figure captions without the ambiguity of PDF extraction.
|
|
57
|
+
|
|
58
|
+
### Batch Download Guidelines
|
|
59
|
+
|
|
60
|
+
When downloading multiple papers, respect arXiv's usage policies:
|
|
61
|
+
|
|
62
|
+
- Limit requests to 1 per 3 seconds for API calls
|
|
63
|
+
- Use the arXiv bulk data access (S3 or GCS) for large-scale processing (1000+ papers)
|
|
64
|
+
- Cache all downloaded files locally and check before re-downloading
|
|
65
|
+
- Include a descriptive User-Agent header in your HTTP requests
|
|
66
|
+
|
|
67
|
+
## Content Extraction Pipeline
|
|
68
|
+
|
|
69
|
+
### PDF Extraction with GROBID
|
|
70
|
+
|
|
71
|
+
For papers where only PDF is available, use GROBID (GeneRation Of BIbliographic Data) for structured extraction:
|
|
72
|
+
|
|
73
|
+
```bash
|
|
74
|
+
# Run GROBID as a local service
|
|
75
|
+
docker run --rm -p 8070:8070 grobid/grobid:0.8.0
|
|
76
|
+
|
|
77
|
+
# Process a PDF
|
|
78
|
+
curl -X POST "http://localhost:8070/api/processFulltextDocument" \
|
|
79
|
+
-F "input=@paper.pdf" \
|
|
80
|
+
-F "consolidateHeader=1" \
|
|
81
|
+
-F "consolidateCitations=1" \
|
|
82
|
+
> paper_tei.xml
|
|
83
|
+
```
|
|
84
|
+
|
|
85
|
+
GROBID outputs TEI-XML with structured sections including:
|
|
86
|
+
- Header metadata (title, authors, affiliations, abstract)
|
|
87
|
+
- Body text with section hierarchy
|
|
88
|
+
- Equations (as MathML or raw text)
|
|
89
|
+
- Figure and table references
|
|
90
|
+
- Parsed bibliography entries with DOIs where available
|
|
91
|
+
|
|
92
|
+
### LaTeX Source Parsing
|
|
93
|
+
|
|
94
|
+
When LaTeX source is available, parse it directly for higher fidelity:
|
|
95
|
+
|
|
96
|
+
1. Identify the main `.tex` file (look for `\documentclass` or `\begin{document}`)
|
|
97
|
+
2. Resolve `\input{}` and `\include{}` directives to build the complete document
|
|
98
|
+
3. Extract sections using `\section{}`, `\subsection{}` markers
|
|
99
|
+
4. Extract equations from `equation`, `align`, `gather` environments
|
|
100
|
+
5. Parse `\cite{}` commands and cross-reference with the `.bib` file
|
|
101
|
+
6. Extract figure captions from `\caption{}` commands
|
|
102
|
+
|
|
103
|
+
### Structured Output Schema
|
|
104
|
+
|
|
105
|
+
Produce a standardized JSON output for each processed paper:
|
|
106
|
+
|
|
107
|
+
```json
|
|
108
|
+
{
|
|
109
|
+
"arxiv_id": "2301.07041",
|
|
110
|
+
"title": "Paper Title",
|
|
111
|
+
"authors": ["Author One", "Author Two"],
|
|
112
|
+
"abstract": "...",
|
|
113
|
+
"sections": [
|
|
114
|
+
{"heading": "Introduction", "level": 1, "text": "..."},
|
|
115
|
+
{"heading": "Related Work", "level": 1, "text": "..."}
|
|
116
|
+
],
|
|
117
|
+
"equations": ["E = mc^2", "..."],
|
|
118
|
+
"figures": [{"id": "fig1", "caption": "..."}],
|
|
119
|
+
"references": [{"key": "smith2020", "title": "...", "doi": "..."}],
|
|
120
|
+
"processed_date": "2026-03-10"
|
|
121
|
+
}
|
|
122
|
+
```
|
|
123
|
+
|
|
124
|
+
## Analysis and Integration
|
|
125
|
+
|
|
126
|
+
Once papers are processed into structured format, several downstream analyses become possible:
|
|
127
|
+
|
|
128
|
+
- **Section-level search**: Search across the methods sections of hundreds of papers to find specific techniques.
|
|
129
|
+
- **Equation extraction**: Build a database of mathematical formulations used in your subfield.
|
|
130
|
+
- **Citation graph construction**: Map which papers cite which, using extracted reference lists.
|
|
131
|
+
- **Terminology tracking**: Monitor how specific terms evolve in usage frequency over time.
|
|
132
|
+
- **Dataset identification**: Extract mentions of datasets and benchmarks from experimental sections.
|
|
133
|
+
|
|
134
|
+
Integrate processed outputs with your reference manager by generating BibTeX entries enriched with extracted metadata, or feed structured JSON into a local search index for full-text retrieval across your paper collection.
|
|
135
|
+
|
|
136
|
+
## References
|
|
137
|
+
|
|
138
|
+
- arXiv API: https://info.arxiv.org/help/api/index.html
|
|
139
|
+
- GROBID: https://github.com/kermitt2/grobid
|
|
140
|
+
- GPT Paper Assistant: https://github.com/tatsu-lab/gpt_paper_assistant
|
|
141
|
+
- arXiv bulk data access: https://info.arxiv.org/help/bulk_data/index.html
|