@wentorai/research-plugins 1.0.0 → 1.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (415) hide show
  1. package/README.md +22 -22
  2. package/curated/analysis/README.md +82 -56
  3. package/curated/domains/README.md +225 -69
  4. package/curated/literature/README.md +115 -46
  5. package/curated/research/README.md +106 -58
  6. package/curated/tools/README.md +107 -87
  7. package/curated/writing/README.md +92 -45
  8. package/mcp-configs/academic-db/alphafold-mcp.json +20 -0
  9. package/mcp-configs/academic-db/brightspace-mcp.json +21 -0
  10. package/mcp-configs/academic-db/climatiq-mcp.json +20 -0
  11. package/mcp-configs/academic-db/gibs-mcp.json +20 -0
  12. package/mcp-configs/academic-db/gis-mcp-server.json +22 -0
  13. package/mcp-configs/academic-db/google-earth-engine-mcp.json +21 -0
  14. package/mcp-configs/academic-db/m4-clinical-mcp.json +21 -0
  15. package/mcp-configs/academic-db/medical-mcp.json +21 -0
  16. package/mcp-configs/academic-db/nexonco-mcp.json +20 -0
  17. package/mcp-configs/academic-db/omop-mcp.json +20 -0
  18. package/mcp-configs/academic-db/onekgpd-mcp.json +20 -0
  19. package/mcp-configs/academic-db/openedu-mcp.json +20 -0
  20. package/mcp-configs/academic-db/opengenes-mcp.json +20 -0
  21. package/mcp-configs/academic-db/openstax-mcp.json +21 -0
  22. package/mcp-configs/academic-db/openstreetmap-mcp.json +21 -0
  23. package/mcp-configs/academic-db/opentargets-mcp.json +21 -0
  24. package/mcp-configs/academic-db/pdb-mcp.json +21 -0
  25. package/mcp-configs/academic-db/smithsonian-mcp.json +20 -0
  26. package/mcp-configs/ai-platform/magi-researchers.json +21 -0
  27. package/mcp-configs/ai-platform/mcp-academic-researcher.json +22 -0
  28. package/mcp-configs/ai-platform/open-paper-machine.json +21 -0
  29. package/mcp-configs/ai-platform/paper-intelligence.json +21 -0
  30. package/mcp-configs/ai-platform/paper-reader.json +21 -0
  31. package/mcp-configs/ai-platform/paperdebugger.json +21 -0
  32. package/mcp-configs/browser/exa-mcp.json +20 -0
  33. package/mcp-configs/browser/mcp-searxng.json +21 -0
  34. package/mcp-configs/browser/mcp-webresearch.json +20 -0
  35. package/mcp-configs/cloud-docs/confluence-mcp.json +37 -0
  36. package/mcp-configs/cloud-docs/google-drive-mcp.json +35 -0
  37. package/mcp-configs/cloud-docs/notion-mcp.json +29 -0
  38. package/mcp-configs/communication/discord-mcp.json +29 -0
  39. package/mcp-configs/communication/discourse-mcp.json +21 -0
  40. package/mcp-configs/communication/slack-mcp.json +29 -0
  41. package/mcp-configs/communication/telegram-mcp.json +28 -0
  42. package/mcp-configs/data-platform/automl-stat-mcp.json +21 -0
  43. package/mcp-configs/data-platform/jefferson-stats-mcp.json +22 -0
  44. package/mcp-configs/data-platform/mcp-excel-server.json +21 -0
  45. package/mcp-configs/data-platform/mcp-stata.json +21 -0
  46. package/mcp-configs/data-platform/mcpstack-jupyter.json +21 -0
  47. package/mcp-configs/data-platform/ml-mcp.json +21 -0
  48. package/mcp-configs/data-platform/nasdaq-data-link-mcp.json +20 -0
  49. package/mcp-configs/data-platform/numpy-mcp.json +21 -0
  50. package/mcp-configs/database/neo4j-mcp.json +37 -0
  51. package/mcp-configs/database/postgres-mcp.json +28 -0
  52. package/mcp-configs/database/sqlite-mcp.json +29 -0
  53. package/mcp-configs/dev-platform/geogebra-mcp.json +21 -0
  54. package/mcp-configs/dev-platform/github-mcp.json +31 -0
  55. package/mcp-configs/dev-platform/gitlab-mcp.json +34 -0
  56. package/mcp-configs/dev-platform/latex-mcp-server.json +21 -0
  57. package/mcp-configs/dev-platform/manim-mcp.json +20 -0
  58. package/mcp-configs/dev-platform/mcp-echarts.json +20 -0
  59. package/mcp-configs/dev-platform/panel-viz-mcp.json +20 -0
  60. package/mcp-configs/dev-platform/paperbanana.json +20 -0
  61. package/mcp-configs/dev-platform/texflow-mcp.json +20 -0
  62. package/mcp-configs/dev-platform/texmcp.json +20 -0
  63. package/mcp-configs/dev-platform/typst-mcp.json +21 -0
  64. package/mcp-configs/dev-platform/vizro-mcp.json +20 -0
  65. package/mcp-configs/email/email-mcp.json +40 -0
  66. package/mcp-configs/email/gmail-mcp.json +37 -0
  67. package/mcp-configs/note-knowledge/local-faiss-mcp.json +21 -0
  68. package/mcp-configs/note-knowledge/mcp-memory-service.json +21 -0
  69. package/mcp-configs/note-knowledge/mcp-obsidian.json +23 -0
  70. package/mcp-configs/note-knowledge/mcp-ragdocs.json +20 -0
  71. package/mcp-configs/note-knowledge/mcp-summarizer.json +21 -0
  72. package/mcp-configs/note-knowledge/mediawiki-mcp.json +21 -0
  73. package/mcp-configs/note-knowledge/openzim-mcp.json +20 -0
  74. package/mcp-configs/note-knowledge/zettelkasten-mcp.json +21 -0
  75. package/mcp-configs/reference-mgr/academic-paper-mcp-http.json +20 -0
  76. package/mcp-configs/reference-mgr/academix.json +20 -0
  77. package/mcp-configs/reference-mgr/arxiv-research-mcp.json +21 -0
  78. package/mcp-configs/reference-mgr/google-scholar-abstract-mcp.json +19 -0
  79. package/mcp-configs/reference-mgr/google-scholar-mcp.json +20 -0
  80. package/mcp-configs/reference-mgr/mcp-paperswithcode.json +21 -0
  81. package/mcp-configs/reference-mgr/mcp-scholarly.json +20 -0
  82. package/mcp-configs/reference-mgr/mcp-simple-arxiv.json +20 -0
  83. package/mcp-configs/reference-mgr/mcp-simple-pubmed.json +20 -0
  84. package/mcp-configs/reference-mgr/mcp-zotero.json +21 -0
  85. package/mcp-configs/reference-mgr/mendeley-mcp.json +20 -0
  86. package/mcp-configs/reference-mgr/ncbi-mcp-server.json +22 -0
  87. package/mcp-configs/reference-mgr/onecite.json +21 -0
  88. package/mcp-configs/reference-mgr/paper-search-mcp.json +21 -0
  89. package/mcp-configs/reference-mgr/pubmed-search-mcp.json +21 -0
  90. package/mcp-configs/reference-mgr/scholar-mcp.json +21 -0
  91. package/mcp-configs/reference-mgr/scholar-multi-mcp.json +21 -0
  92. package/mcp-configs/reference-mgr/seerai.json +21 -0
  93. package/mcp-configs/reference-mgr/semantic-scholar-fastmcp.json +21 -0
  94. package/mcp-configs/reference-mgr/sourcelibrary.json +20 -0
  95. package/mcp-configs/registry.json +178 -149
  96. package/mcp-configs/repository/dataverse-mcp.json +33 -0
  97. package/mcp-configs/repository/huggingface-mcp.json +29 -0
  98. package/openclaw.plugin.json +2 -2
  99. package/package.json +2 -2
  100. package/skills/analysis/dataviz/algorithm-visualizer-guide/SKILL.md +259 -0
  101. package/skills/analysis/dataviz/bokeh-visualization-guide/SKILL.md +270 -0
  102. package/skills/analysis/dataviz/chart-image-generator/SKILL.md +229 -0
  103. package/skills/analysis/dataviz/citation-map-guide/SKILL.md +184 -0
  104. package/skills/analysis/dataviz/d3-visualization-guide/SKILL.md +281 -0
  105. package/skills/analysis/dataviz/data-visualization-principles/SKILL.md +171 -0
  106. package/skills/analysis/dataviz/echarts-visualization-guide/SKILL.md +250 -0
  107. package/skills/analysis/dataviz/metabase-analytics-guide/SKILL.md +242 -0
  108. package/skills/analysis/dataviz/plotly-interactive-guide/SKILL.md +266 -0
  109. package/skills/analysis/dataviz/redash-analytics-guide/SKILL.md +284 -0
  110. package/skills/analysis/econometrics/econml-causal-guide/SKILL.md +163 -0
  111. package/skills/analysis/econometrics/empirical-paper-analysis/SKILL.md +192 -0
  112. package/skills/analysis/econometrics/mostly-harmless-guide/SKILL.md +139 -0
  113. package/skills/analysis/econometrics/panel-data-analyst/SKILL.md +259 -0
  114. package/skills/analysis/econometrics/panel-data-regression-workflow/SKILL.md +267 -0
  115. package/skills/analysis/econometrics/python-causality-guide/SKILL.md +134 -0
  116. package/skills/analysis/econometrics/stata-accounting-guide/SKILL.md +269 -0
  117. package/skills/analysis/econometrics/stata-analyst-guide/SKILL.md +245 -0
  118. package/skills/analysis/econometrics/stata-reference-guide/SKILL.md +293 -0
  119. package/skills/analysis/statistics/data-anomaly-detection/SKILL.md +157 -0
  120. package/skills/analysis/statistics/general-statistics-guide/SKILL.md +226 -0
  121. package/skills/analysis/statistics/infiagent-benchmark-guide/SKILL.md +106 -0
  122. package/skills/analysis/statistics/ml-experiment-tracker/SKILL.md +212 -0
  123. package/skills/analysis/statistics/pywayne-statistics-guide/SKILL.md +192 -0
  124. package/skills/analysis/statistics/quantitative-methods-guide/SKILL.md +193 -0
  125. package/skills/analysis/statistics/senior-data-scientist-guide/SKILL.md +223 -0
  126. package/skills/analysis/wrangling/claude-data-analysis-guide/SKILL.md +100 -0
  127. package/skills/analysis/wrangling/csv-data-analyzer/SKILL.md +170 -0
  128. package/skills/analysis/wrangling/data-cleaning-pipeline/SKILL.md +266 -0
  129. package/skills/analysis/wrangling/data-cog-guide/SKILL.md +178 -0
  130. package/skills/analysis/wrangling/open-data-scientist-guide/SKILL.md +197 -0
  131. package/skills/analysis/wrangling/stata-data-cleaning/SKILL.md +276 -0
  132. package/skills/analysis/wrangling/streamline-analyst-guide/SKILL.md +119 -0
  133. package/skills/analysis/wrangling/survey-data-processing/SKILL.md +298 -0
  134. package/skills/domains/ai-ml/ai-agent-papers-guide/SKILL.md +146 -0
  135. package/skills/domains/ai-ml/ai-model-benchmarking/SKILL.md +209 -0
  136. package/skills/domains/ai-ml/annotated-dl-papers-guide/SKILL.md +159 -0
  137. package/skills/domains/ai-ml/anomaly-detection-papers-guide/SKILL.md +167 -0
  138. package/skills/domains/ai-ml/autonomous-agents-papers-guide/SKILL.md +178 -0
  139. package/skills/domains/ai-ml/dl-transformer-finetune/SKILL.md +239 -0
  140. package/skills/domains/ai-ml/domain-adaptation-papers-guide/SKILL.md +173 -0
  141. package/skills/domains/ai-ml/generative-ai-guide/SKILL.md +146 -0
  142. package/skills/domains/ai-ml/graph-learning-papers-guide/SKILL.md +125 -0
  143. package/skills/domains/ai-ml/huggingface-inference-guide/SKILL.md +196 -0
  144. package/skills/domains/ai-ml/keras-deep-learning/SKILL.md +210 -0
  145. package/skills/domains/ai-ml/kolmogorov-arnold-networks-guide/SKILL.md +185 -0
  146. package/skills/domains/ai-ml/llm-from-scratch-guide/SKILL.md +124 -0
  147. package/skills/domains/ai-ml/ml-pipeline-guide/SKILL.md +295 -0
  148. package/skills/domains/ai-ml/nlp-toolkit-guide/SKILL.md +247 -0
  149. package/skills/domains/ai-ml/npcpy-research-guide/SKILL.md +137 -0
  150. package/skills/domains/ai-ml/pytorch-guide/SKILL.md +281 -0
  151. package/skills/domains/ai-ml/pytorch-lightning-guide/SKILL.md +244 -0
  152. package/skills/domains/ai-ml/responsible-ai-guide/SKILL.md +126 -0
  153. package/skills/domains/ai-ml/tensorflow-guide/SKILL.md +241 -0
  154. package/skills/domains/ai-ml/vmas-simulator-guide/SKILL.md +129 -0
  155. package/skills/domains/biomedical/bioagents-guide/SKILL.md +308 -0
  156. package/skills/domains/biomedical/clawbio-guide/SKILL.md +167 -0
  157. package/skills/domains/biomedical/clinical-dialogue-agents-guide/SKILL.md +145 -0
  158. package/skills/domains/biomedical/ena-sequence-api/SKILL.md +175 -0
  159. package/skills/domains/biomedical/genomas-guide/SKILL.md +126 -0
  160. package/skills/domains/biomedical/genotex-benchmark-guide/SKILL.md +125 -0
  161. package/skills/domains/biomedical/med-researcher-guide/SKILL.md +161 -0
  162. package/skills/domains/biomedical/med-researcher-r1-guide/SKILL.md +146 -0
  163. package/skills/domains/biomedical/medgeclaw-guide/SKILL.md +345 -0
  164. package/skills/domains/biomedical/medical-imaging-guide/SKILL.md +305 -0
  165. package/skills/domains/biomedical/ncbi-blast-api/SKILL.md +195 -0
  166. package/skills/domains/biomedical/ncbi-datasets-api/SKILL.md +220 -0
  167. package/skills/domains/biomedical/quickgo-api/SKILL.md +181 -0
  168. package/skills/domains/business/architecture-design-guide/SKILL.md +279 -0
  169. package/skills/domains/business/innovation-management-guide/SKILL.md +257 -0
  170. package/skills/domains/business/operations-research-guide/SKILL.md +258 -0
  171. package/skills/domains/business/xpert-bi-guide/SKILL.md +84 -0
  172. package/skills/domains/chemistry/cactus-cheminformatics-guide/SKILL.md +89 -0
  173. package/skills/domains/chemistry/chemeagle-guide/SKILL.md +147 -0
  174. package/skills/domains/chemistry/chemgraph-agent-guide/SKILL.md +120 -0
  175. package/skills/domains/chemistry/molecular-dynamics-guide/SKILL.md +237 -0
  176. package/skills/domains/chemistry/pubchem-api-guide/SKILL.md +180 -0
  177. package/skills/domains/chemistry/spectroscopy-analysis-guide/SKILL.md +290 -0
  178. package/skills/domains/cs/ai-security-papers-guide/SKILL.md +103 -0
  179. package/skills/domains/cs/code-llm-papers-guide/SKILL.md +131 -0
  180. package/skills/domains/cs/distributed-systems-guide/SKILL.md +268 -0
  181. package/skills/domains/cs/formal-verification-guide/SKILL.md +298 -0
  182. package/skills/domains/cs/gaussian-splatting-papers-guide/SKILL.md +158 -0
  183. package/skills/domains/cs/llm-aiops-guide/SKILL.md +70 -0
  184. package/skills/domains/cs/software-heritage-api/SKILL.md +200 -0
  185. package/skills/domains/ecology/species-distribution-guide/SKILL.md +343 -0
  186. package/skills/domains/economics/imf-data-api-guide/SKILL.md +174 -0
  187. package/skills/domains/economics/nber-working-papers-api/SKILL.md +177 -0
  188. package/skills/domains/economics/post-labor-economics/SKILL.md +254 -0
  189. package/skills/domains/economics/pricing-psychology-guide/SKILL.md +273 -0
  190. package/skills/domains/economics/repec-economics-api/SKILL.md +188 -0
  191. package/skills/domains/economics/world-bank-data-guide/SKILL.md +179 -0
  192. package/skills/domains/education/academic-study-methods/SKILL.md +228 -0
  193. package/skills/domains/education/assessment-design-guide/SKILL.md +213 -0
  194. package/skills/domains/education/educational-research-methods/SKILL.md +179 -0
  195. package/skills/domains/education/edumcp-guide/SKILL.md +74 -0
  196. package/skills/domains/education/mooc-analytics-guide/SKILL.md +206 -0
  197. package/skills/domains/education/open-syllabus-api/SKILL.md +171 -0
  198. package/skills/domains/finance/akshare-finance-data/SKILL.md +207 -0
  199. package/skills/domains/finance/finsight-research-guide/SKILL.md +113 -0
  200. package/skills/domains/finance/options-analytics-agent-guide/SKILL.md +117 -0
  201. package/skills/domains/finance/portfolio-optimization-guide/SKILL.md +279 -0
  202. package/skills/domains/finance/risk-modeling-guide/SKILL.md +260 -0
  203. package/skills/domains/finance/stata-accounting-research/SKILL.md +372 -0
  204. package/skills/domains/geoscience/climate-modeling-guide/SKILL.md +215 -0
  205. package/skills/domains/geoscience/pangaea-data-api/SKILL.md +197 -0
  206. package/skills/domains/geoscience/satellite-remote-sensing/SKILL.md +193 -0
  207. package/skills/domains/geoscience/seismology-data-guide/SKILL.md +208 -0
  208. package/skills/domains/humanities/digital-humanities-methods/SKILL.md +232 -0
  209. package/skills/domains/humanities/ethical-philosophy-guide/SKILL.md +244 -0
  210. package/skills/domains/humanities/history-research-guide/SKILL.md +260 -0
  211. package/skills/domains/humanities/political-history-guide/SKILL.md +241 -0
  212. package/skills/domains/law/caselaw-access-api/SKILL.md +149 -0
  213. package/skills/domains/law/legal-agent-skills-guide/SKILL.md +132 -0
  214. package/skills/domains/law/legal-nlp-guide/SKILL.md +236 -0
  215. package/skills/domains/law/legal-research-methods/SKILL.md +190 -0
  216. package/skills/domains/law/opencontracts-guide/SKILL.md +168 -0
  217. package/skills/domains/law/patent-analysis-guide/SKILL.md +257 -0
  218. package/skills/domains/law/regulatory-compliance-guide/SKILL.md +267 -0
  219. package/skills/domains/math/lean-theorem-proving-guide/SKILL.md +140 -0
  220. package/skills/domains/math/symbolic-computation-guide/SKILL.md +263 -0
  221. package/skills/domains/math/topology-data-analysis/SKILL.md +305 -0
  222. package/skills/domains/pharma/clinical-trial-design-guide/SKILL.md +271 -0
  223. package/skills/domains/pharma/drug-target-interaction/SKILL.md +242 -0
  224. package/skills/domains/pharma/madd-drug-discovery-guide/SKILL.md +153 -0
  225. package/skills/domains/pharma/pharmacovigilance-guide/SKILL.md +216 -0
  226. package/skills/domains/physics/astrophysics-data-guide/SKILL.md +305 -0
  227. package/skills/domains/physics/particle-physics-guide/SKILL.md +287 -0
  228. package/skills/domains/social-science/ipums-microdata-api/SKILL.md +211 -0
  229. package/skills/domains/social-science/network-analysis-guide/SKILL.md +310 -0
  230. package/skills/domains/social-science/psychology-research-guide/SKILL.md +270 -0
  231. package/skills/domains/social-science/sociology-research-guide/SKILL.md +238 -0
  232. package/skills/domains/social-science/sociology-research-methods/SKILL.md +181 -0
  233. package/skills/literature/discovery/arxiv-paper-monitoring/SKILL.md +233 -0
  234. package/skills/literature/discovery/paper-recommendation-guide/SKILL.md +120 -0
  235. package/skills/literature/discovery/papers-we-love-guide/SKILL.md +169 -0
  236. package/skills/literature/discovery/semantic-paper-radar/SKILL.md +144 -0
  237. package/skills/literature/discovery/zotero-arxiv-daily-guide/SKILL.md +94 -0
  238. package/skills/literature/fulltext/bioc-pmc-api/SKILL.md +146 -0
  239. package/skills/literature/fulltext/core-api-guide/SKILL.md +144 -0
  240. package/skills/literature/fulltext/dataverse-api/SKILL.md +215 -0
  241. package/skills/literature/fulltext/hal-archive-api/SKILL.md +218 -0
  242. package/skills/literature/fulltext/institutional-repository-guide/SKILL.md +212 -0
  243. package/skills/literature/fulltext/open-access-mining-guide/SKILL.md +341 -0
  244. package/skills/literature/fulltext/osf-api/SKILL.md +212 -0
  245. package/skills/literature/fulltext/pmc-ftp-bulk-download/SKILL.md +182 -0
  246. package/skills/literature/fulltext/zotero-ai-butler-guide/SKILL.md +166 -0
  247. package/skills/literature/fulltext/zotero-scihub-guide/SKILL.md +168 -0
  248. package/skills/literature/metadata/academic-paper-summarizer/SKILL.md +101 -0
  249. package/skills/literature/metadata/bibliometrix-guide/SKILL.md +164 -0
  250. package/skills/literature/metadata/crossref-event-data-api/SKILL.md +183 -0
  251. package/skills/literature/metadata/doi-content-negotiation/SKILL.md +202 -0
  252. package/skills/literature/metadata/orkg-api/SKILL.md +153 -0
  253. package/skills/literature/metadata/plumx-metrics-api/SKILL.md +188 -0
  254. package/skills/literature/metadata/ror-organization-api/SKILL.md +208 -0
  255. package/skills/literature/metadata/sophosia-reference-guide/SKILL.md +110 -0
  256. package/skills/literature/metadata/viaf-authority-api/SKILL.md +209 -0
  257. package/skills/literature/metadata/wikidata-api-guide/SKILL.md +156 -0
  258. package/skills/literature/metadata/zoplicate-dedup-guide/SKILL.md +147 -0
  259. package/skills/literature/metadata/zotero-actions-tags-guide/SKILL.md +212 -0
  260. package/skills/literature/metadata/zotmoov-guide/SKILL.md +120 -0
  261. package/skills/literature/metadata/zutilo-guide/SKILL.md +140 -0
  262. package/skills/literature/search/arxiv-batch-reporting/SKILL.md +133 -0
  263. package/skills/literature/search/arxiv-cli-tools/SKILL.md +172 -0
  264. package/skills/literature/search/arxiv-osiris/SKILL.md +199 -0
  265. package/skills/literature/search/arxiv-paper-processor/SKILL.md +141 -0
  266. package/skills/literature/search/baidu-scholar-guide/SKILL.md +110 -0
  267. package/skills/literature/search/base-academic-search/SKILL.md +196 -0
  268. package/skills/literature/search/chatpaper-guide/SKILL.md +122 -0
  269. package/skills/literature/search/citeseerx-api/SKILL.md +183 -0
  270. package/skills/literature/search/deep-literature-search/SKILL.md +149 -0
  271. package/skills/literature/search/deepgit-search-guide/SKILL.md +147 -0
  272. package/skills/literature/search/eric-education-api/SKILL.md +199 -0
  273. package/skills/literature/search/findpapers-guide/SKILL.md +177 -0
  274. package/skills/literature/search/ieee-xplore-api/SKILL.md +177 -0
  275. package/skills/literature/search/lens-scholarly-api/SKILL.md +211 -0
  276. package/skills/literature/search/multi-database-literature-search/SKILL.md +198 -0
  277. package/skills/literature/search/open-library-api/SKILL.md +196 -0
  278. package/skills/literature/search/open-semantic-search-guide/SKILL.md +190 -0
  279. package/skills/literature/search/openaire-api/SKILL.md +141 -0
  280. package/skills/literature/search/paper-search-mcp-guide/SKILL.md +107 -0
  281. package/skills/literature/search/papers-chat-guide/SKILL.md +194 -0
  282. package/skills/literature/search/pasa-paper-search-guide/SKILL.md +138 -0
  283. package/skills/literature/search/plos-open-access-api/SKILL.md +203 -0
  284. package/skills/literature/search/scielo-api/SKILL.md +182 -0
  285. package/skills/literature/search/share-research-api/SKILL.md +129 -0
  286. package/skills/literature/search/worldcat-search-api/SKILL.md +224 -0
  287. package/skills/research/automation/ai-scientist-v2-guide/SKILL.md +284 -0
  288. package/skills/research/automation/aim-experiment-guide/SKILL.md +234 -0
  289. package/skills/research/automation/claude-academic-workflow-guide/SKILL.md +202 -0
  290. package/skills/research/automation/coexist-ai-guide/SKILL.md +149 -0
  291. package/skills/research/automation/datagen-research-guide/SKILL.md +131 -0
  292. package/skills/research/automation/foam-agent-guide/SKILL.md +203 -0
  293. package/skills/research/automation/kedro-pipeline-guide/SKILL.md +216 -0
  294. package/skills/research/automation/mle-agent-guide/SKILL.md +139 -0
  295. package/skills/research/automation/paper-to-agent-guide/SKILL.md +116 -0
  296. package/skills/research/automation/rd-agent-guide/SKILL.md +246 -0
  297. package/skills/research/automation/research-paper-orchestrator/SKILL.md +254 -0
  298. package/skills/research/deep-research/academic-deep-research/SKILL.md +190 -0
  299. package/skills/research/deep-research/auto-deep-research-guide/SKILL.md +141 -0
  300. package/skills/research/deep-research/cognitive-kernel-guide/SKILL.md +200 -0
  301. package/skills/research/deep-research/corvus-research-guide/SKILL.md +132 -0
  302. package/skills/research/deep-research/deep-research-pro/SKILL.md +213 -0
  303. package/skills/research/deep-research/deep-research-work/SKILL.md +204 -0
  304. package/skills/research/deep-research/deep-searcher-guide/SKILL.md +253 -0
  305. package/skills/research/deep-research/gpt-researcher-guide/SKILL.md +191 -0
  306. package/skills/research/deep-research/in-depth-research-guide/SKILL.md +205 -0
  307. package/skills/research/deep-research/khoj-research-guide/SKILL.md +200 -0
  308. package/skills/research/deep-research/kosmos-scientist-guide/SKILL.md +185 -0
  309. package/skills/research/deep-research/llm-scientific-discovery-guide/SKILL.md +178 -0
  310. package/skills/research/deep-research/local-deep-research-guide/SKILL.md +253 -0
  311. package/skills/research/deep-research/open-researcher-guide/SKILL.md +138 -0
  312. package/skills/research/deep-research/tongyi-deep-research-guide/SKILL.md +217 -0
  313. package/skills/research/funding/eu-horizon-guide/SKILL.md +244 -0
  314. package/skills/research/funding/grant-budget-guide/SKILL.md +284 -0
  315. package/skills/research/funding/nih-reporter-api-guide/SKILL.md +166 -0
  316. package/skills/research/funding/nsf-award-api-guide/SKILL.md +133 -0
  317. package/skills/research/methodology/academic-mentor-guide/SKILL.md +169 -0
  318. package/skills/research/methodology/claude-scientific-guide/SKILL.md +122 -0
  319. package/skills/research/methodology/deep-innovator-guide/SKILL.md +242 -0
  320. package/skills/research/methodology/osf-api-guide/SKILL.md +165 -0
  321. package/skills/research/methodology/parsifal-slr-guide/SKILL.md +154 -0
  322. package/skills/research/methodology/research-paper-kb/SKILL.md +263 -0
  323. package/skills/research/methodology/research-pipeline-units-guide/SKILL.md +169 -0
  324. package/skills/research/methodology/research-town-guide/SKILL.md +263 -0
  325. package/skills/research/methodology/slr-automation-guide/SKILL.md +235 -0
  326. package/skills/research/paper-review/automated-review-guide/SKILL.md +281 -0
  327. package/skills/research/paper-review/latte-review-guide/SKILL.md +175 -0
  328. package/skills/research/paper-review/paper-compare-guide/SKILL.md +238 -0
  329. package/skills/research/paper-review/paper-critique-framework/SKILL.md +181 -0
  330. package/skills/research/paper-review/paper-digest-guide/SKILL.md +240 -0
  331. package/skills/research/paper-review/paper-research-assistant/SKILL.md +231 -0
  332. package/skills/research/paper-review/research-quality-filter/SKILL.md +261 -0
  333. package/skills/research/paper-review/review-response-guide/SKILL.md +275 -0
  334. package/skills/tools/code-exec/contextplus-mcp-guide/SKILL.md +110 -0
  335. package/skills/tools/code-exec/google-colab-guide/SKILL.md +276 -0
  336. package/skills/tools/code-exec/kaggle-api-guide/SKILL.md +216 -0
  337. package/skills/tools/code-exec/overleaf-cli-guide/SKILL.md +279 -0
  338. package/skills/tools/diagram/clawphd-guide/SKILL.md +149 -0
  339. package/skills/tools/diagram/code-flow-visualizer/SKILL.md +197 -0
  340. package/skills/tools/diagram/excalidraw-diagram-guide/SKILL.md +170 -0
  341. package/skills/tools/diagram/json-data-visualizer/SKILL.md +270 -0
  342. package/skills/tools/diagram/kroki-diagram-api/SKILL.md +198 -0
  343. package/skills/tools/diagram/mermaid-architect-guide/SKILL.md +219 -0
  344. package/skills/tools/diagram/scientific-graphical-abstract/SKILL.md +201 -0
  345. package/skills/tools/diagram/tldraw-whiteboard-guide/SKILL.md +397 -0
  346. package/skills/tools/document/docsgpt-guide/SKILL.md +130 -0
  347. package/skills/tools/document/large-document-reader/SKILL.md +202 -0
  348. package/skills/tools/document/md2pdf-xelatex/SKILL.md +212 -0
  349. package/skills/tools/document/openpaper-guide/SKILL.md +232 -0
  350. package/skills/tools/document/paper-parse-guide/SKILL.md +243 -0
  351. package/skills/tools/document/weknora-guide/SKILL.md +216 -0
  352. package/skills/tools/document/zotero-addon-market-guide/SKILL.md +108 -0
  353. package/skills/tools/document/zotero-night-theme-guide/SKILL.md +142 -0
  354. package/skills/tools/document/zotero-style-guide/SKILL.md +217 -0
  355. package/skills/tools/knowledge-graph/citation-network-builder/SKILL.md +244 -0
  356. package/skills/tools/knowledge-graph/concept-map-generator/SKILL.md +284 -0
  357. package/skills/tools/knowledge-graph/graphiti-guide/SKILL.md +219 -0
  358. package/skills/tools/knowledge-graph/mimir-memory-guide/SKILL.md +135 -0
  359. package/skills/tools/knowledge-graph/notero-zotero-notion-guide/SKILL.md +187 -0
  360. package/skills/tools/knowledge-graph/open-webui-tools-guide/SKILL.md +156 -0
  361. package/skills/tools/knowledge-graph/openspg-guide/SKILL.md +210 -0
  362. package/skills/tools/knowledge-graph/paperpile-notion-guide/SKILL.md +84 -0
  363. package/skills/tools/knowledge-graph/zotero-markdb-connect-guide/SKILL.md +162 -0
  364. package/skills/tools/ocr-translate/latex-translation-guide/SKILL.md +176 -0
  365. package/skills/tools/ocr-translate/math-equation-renderer/SKILL.md +198 -0
  366. package/skills/tools/ocr-translate/pdf-math-translate-guide/SKILL.md +141 -0
  367. package/skills/tools/ocr-translate/zotero-pdf-translate-guide/SKILL.md +95 -0
  368. package/skills/tools/ocr-translate/zotero-pdf2zh-guide/SKILL.md +143 -0
  369. package/skills/tools/scraping/dataset-finder-guide/SKILL.md +253 -0
  370. package/skills/tools/scraping/easy-spider-guide/SKILL.md +250 -0
  371. package/skills/tools/scraping/google-scholar-scraper/SKILL.md +255 -0
  372. package/skills/tools/scraping/repository-harvesting-guide/SKILL.md +310 -0
  373. package/skills/writing/citation/academic-citation-manager/SKILL.md +314 -0
  374. package/skills/writing/citation/academic-citation-manager-guide/SKILL.md +182 -0
  375. package/skills/writing/citation/citation-assistant-skill/SKILL.md +192 -0
  376. package/skills/writing/citation/jabref-reference-guide/SKILL.md +127 -0
  377. package/skills/writing/citation/jasminum-zotero-guide/SKILL.md +103 -0
  378. package/skills/writing/citation/mendeley-api/SKILL.md +231 -0
  379. package/skills/writing/citation/obsidian-citation-guide/SKILL.md +164 -0
  380. package/skills/writing/citation/obsidian-zotero-guide/SKILL.md +137 -0
  381. package/skills/writing/citation/onecite-reference-guide/SKILL.md +168 -0
  382. package/skills/writing/citation/papersgpt-zotero-guide/SKILL.md +132 -0
  383. package/skills/writing/citation/papis-cli-guide/SKILL.md +213 -0
  384. package/skills/writing/citation/zotero-better-bibtex-guide/SKILL.md +107 -0
  385. package/skills/writing/citation/zotero-better-notes-guide/SKILL.md +121 -0
  386. package/skills/writing/citation/zotero-gpt-guide/SKILL.md +111 -0
  387. package/skills/writing/citation/zotero-mcp-guide/SKILL.md +164 -0
  388. package/skills/writing/citation/zotero-mdnotes-guide/SKILL.md +162 -0
  389. package/skills/writing/citation/zotero-reference-guide/SKILL.md +139 -0
  390. package/skills/writing/citation/zotero-scholar-guide/SKILL.md +294 -0
  391. package/skills/writing/citation/zotfile-attachment-guide/SKILL.md +140 -0
  392. package/skills/writing/composition/ml-paper-writing/SKILL.md +163 -0
  393. package/skills/writing/composition/opendraft-thesis-guide/SKILL.md +200 -0
  394. package/skills/writing/composition/paper-debugger-guide/SKILL.md +143 -0
  395. package/skills/writing/composition/paperforge-guide/SKILL.md +205 -0
  396. package/skills/writing/composition/research-paper-writer/SKILL.md +226 -0
  397. package/skills/writing/composition/scientific-writing-resources/SKILL.md +151 -0
  398. package/skills/writing/composition/scientific-writing-wrapper/SKILL.md +153 -0
  399. package/skills/writing/latex/academic-writing-latex/SKILL.md +285 -0
  400. package/skills/writing/latex/latex-drawing-collection/SKILL.md +154 -0
  401. package/skills/writing/latex/latex-templates-collection/SKILL.md +159 -0
  402. package/skills/writing/latex/md-to-pdf-academic/SKILL.md +230 -0
  403. package/skills/writing/latex/tex-render-guide/SKILL.md +243 -0
  404. package/skills/writing/polish/academic-tone-guide/SKILL.md +209 -0
  405. package/skills/writing/polish/chinese-text-humanizer/SKILL.md +140 -0
  406. package/skills/writing/polish/conciseness-editing-guide/SKILL.md +225 -0
  407. package/skills/writing/polish/paper-polish-guide/SKILL.md +160 -0
  408. package/skills/writing/templates/arxiv-preprint-template/SKILL.md +184 -0
  409. package/skills/writing/templates/elegant-paper-template/SKILL.md +141 -0
  410. package/skills/writing/templates/graphical-abstract-guide/SKILL.md +183 -0
  411. package/skills/writing/templates/novathesis-guide/SKILL.md +152 -0
  412. package/skills/writing/templates/scientific-article-pdf/SKILL.md +261 -0
  413. package/skills/writing/templates/sjtuthesis-guide/SKILL.md +197 -0
  414. package/skills/writing/templates/thuthesis-guide/SKILL.md +181 -0
  415. package/skills/literature/fulltext/repository-harvesting-guide/SKILL.md +0 -207
@@ -0,0 +1,133 @@
1
+ ---
2
+ name: arxiv-batch-reporting
3
+ description: "Batch search and report generation from arXiv preprint repository"
4
+ metadata:
5
+ openclaw:
6
+ emoji: "📊"
7
+ category: "literature"
8
+ subcategory: "search"
9
+ keywords: ["arxiv", "batch search", "preprint", "report generation", "literature monitoring", "research trends"]
10
+ source: "https://github.com/sspaeti/arxiv-batch-search"
11
+ ---
12
+
13
+ # arXiv Batch Reporting
14
+
15
+ ## Overview
16
+
17
+ Keeping up with the flood of new preprints on arXiv is one of the most persistent challenges in fast-moving fields like machine learning, physics, mathematics, and computer science. The arXiv Batch Reporting skill provides a systematic approach to searching, filtering, and generating structured reports from arXiv at scale.
18
+
19
+ Unlike ad-hoc manual searches, this skill enables researchers to define persistent query profiles, run batch searches across date ranges, and produce formatted reports that highlight the most relevant papers. It is particularly useful for weekly or monthly literature surveillance, lab meeting preparation, and trend analysis across subfields.
20
+
21
+ The skill leverages the arXiv API and supports advanced query syntax, category filtering, and result ranking by relevance or recency. Reports can be generated in Markdown, HTML, or CSV formats for integration into existing workflows.
22
+
23
+ ## Setting Up Batch Queries
24
+
25
+ ### Query Profile Definition
26
+
27
+ Define your search profiles as structured configurations. Each profile specifies the search terms, category filters, date range, and output preferences:
28
+
29
+ ```yaml
30
+ profile_name: "transformer-architectures-weekly"
31
+ queries:
32
+ - "ti:transformer AND abs:attention mechanism"
33
+ - "ti:vision transformer"
34
+ - "abs:efficient transformer AND cat:cs.LG"
35
+ categories:
36
+ - cs.LG
37
+ - cs.CL
38
+ - cs.CV
39
+ date_range: "last_7_days"
40
+ max_results: 100
41
+ sort_by: "submittedDate"
42
+ sort_order: "descending"
43
+ ```
44
+
45
+ ### arXiv API Query Syntax
46
+
47
+ The arXiv API supports field-specific searches:
48
+
49
+ - `ti:` — Search in title
50
+ - `abs:` — Search in abstract
51
+ - `au:` — Search by author
52
+ - `cat:` — Filter by category (e.g., `cs.AI`, `math.PR`, `physics.comp-ph`)
53
+ - Boolean operators: `AND`, `OR`, `ANDNOT`
54
+ - Group with parentheses for complex queries
55
+
56
+ **Example queries:**
57
+ - Find recent GAN papers in computer vision: `abs:generative adversarial AND cat:cs.CV`
58
+ - Find a specific author's work: `au:bengio AND ti:deep learning`
59
+ - Exclude survey papers: `abs:reinforcement learning ANDNOT ti:survey`
60
+
61
+ ### Rate Limiting and Pagination
62
+
63
+ The arXiv API enforces rate limits. Follow these guidelines:
64
+
65
+ - Wait at least 3 seconds between API requests
66
+ - Use pagination with `start` and `max_results` parameters (max 2000 per request)
67
+ - For large batch jobs, implement exponential backoff on HTTP 503 responses
68
+ - Cache results locally to avoid redundant API calls
69
+
70
+ ## Report Generation
71
+
72
+ ### Standard Report Template
73
+
74
+ After collecting batch results, generate a report with the following structure:
75
+
76
+ ```markdown
77
+ # arXiv Batch Report: [Profile Name]
78
+ **Date range:** [start] to [end]
79
+ **Total results:** [N] papers
80
+ **Generated:** [timestamp]
81
+
82
+ ## Highlights (Top 10 by Relevance)
83
+ | # | Title | Authors | Category | Date |
84
+ |---|-------|---------|----------|------|
85
+ | 1 | [Title](arxiv-link) | First Author et al. | cs.LG | 2026-03-08 |
86
+
87
+ ## Category Breakdown
88
+ - cs.LG: 45 papers
89
+ - cs.CL: 23 papers
90
+ - cs.CV: 18 papers
91
+
92
+ ## Keyword Frequency
93
+ - "transformer": 38 mentions
94
+ - "attention": 29 mentions
95
+ - "efficient": 15 mentions
96
+
97
+ ## Full Results
98
+ [Expandable table with all papers]
99
+ ```
100
+
101
+ ### Filtering and Ranking
102
+
103
+ After retrieving raw results, apply post-processing filters to surface the most relevant papers:
104
+
105
+ 1. **Relevance scoring**: Score each paper based on keyword density in the title and abstract relative to your query terms.
106
+ 2. **Author filtering**: Boost papers from authors on your watch list (key researchers in your field).
107
+ 3. **Citation proxy**: Papers that appear in multiple query results likely sit at the intersection of your interests—rank them higher.
108
+ 4. **Novelty detection**: Flag papers whose abstracts contain terms not seen in your previous reports, indicating potentially new directions.
109
+
110
+ ## Automation and Scheduling
111
+
112
+ For ongoing literature surveillance, automate your batch reports:
113
+
114
+ - **Cron scheduling**: Run batch queries weekly (e.g., every Monday at 8 AM) using a scheduled task or CI pipeline.
115
+ - **Diff reports**: Compare the current week's results against the previous week to highlight only new papers.
116
+ - **Alert thresholds**: Set alerts when a report contains more than N papers matching a high-priority query, indicating a burst of activity in that area.
117
+ - **Email or Slack delivery**: Route generated reports to your inbox or lab Slack channel for team-wide awareness.
118
+
119
+ Store all generated reports in a versioned directory structure for longitudinal trend analysis:
120
+
121
+ ```
122
+ reports/
123
+ transformer-architectures-weekly/
124
+ 2026-03-03.md
125
+ 2026-03-10.md
126
+ ...
127
+ ```
128
+
129
+ ## References
130
+
131
+ - arXiv API documentation: https://info.arxiv.org/help/api/index.html
132
+ - arXiv category taxonomy: https://arxiv.org/category_taxonomy
133
+ - arXiv Batch Search: https://github.com/sspaeti/arxiv-batch-search
@@ -0,0 +1,172 @@
1
+ ---
2
+ name: arxiv-cli-tools
3
+ description: "Command-line tools for searching and batch-downloading arXiv papers"
4
+ metadata:
5
+ openclaw:
6
+ emoji: "🔍"
7
+ category: "literature"
8
+ subcategory: "search"
9
+ keywords: ["arxiv", "command line", "paper download", "preprint search", "batch download", "literature retrieval"]
10
+ source: "https://pypi.org/project/arxiv-cli-tools/"
11
+ ---
12
+
13
+ # arXiv CLI Tools
14
+
15
+ ## Overview
16
+
17
+ `arxiv-cli-tools` is a Python command-line interface for searching and downloading papers from arXiv.org. It wraps the `arxiv` Python client library into convenient CLI commands, enabling researchers to search by keyword, author, or category, view abstracts, and batch-download PDFs directly from the terminal. No API key is required.
18
+
19
+ ## Installation
20
+
21
+ ```bash
22
+ # Recommended: isolated install with pipx
23
+ pipx install arxiv-cli-tools
24
+
25
+ # Alternative: pip
26
+ pip install arxiv-cli-tools
27
+
28
+ # Verify installation
29
+ arxiv-cli --help
30
+ ```
31
+
32
+ ## Searching Papers
33
+
34
+ ### Basic Search
35
+
36
+ ```bash
37
+ # Search by keyword (default: 10 results)
38
+ arxiv-cli search "transformer attention mechanism"
39
+
40
+ # Limit results
41
+ arxiv-cli search "quantum computing" -n 5
42
+
43
+ # Show abstracts in results
44
+ arxiv-cli search "prompt engineering" -n 5 --summary
45
+ ```
46
+
47
+ ### Filtered Search
48
+
49
+ ```bash
50
+ # Filter by author
51
+ arxiv-cli search "attention mechanism" --authors "Vaswani"
52
+
53
+ # Filter by arXiv category
54
+ arxiv-cli search "neural networks" --categories "cs.LG,cs.AI"
55
+
56
+ # Combine filters
57
+ arxiv-cli search "protein folding" --categories "q-bio" -n 20 --summary
58
+ ```
59
+
60
+ ### Common arXiv Categories
61
+
62
+ | Prefix | Field | Popular Subcategories |
63
+ |--------|-------|----------------------|
64
+ | `cs` | Computer Science | cs.AI, cs.CL, cs.CV, cs.LG, cs.SE |
65
+ | `math` | Mathematics | math.ST, math.OC, math.PR |
66
+ | `physics` | Physics | physics.comp-ph, hep-th, cond-mat |
67
+ | `stat` | Statistics | stat.ML, stat.ME, stat.TH |
68
+ | `q-bio` | Quantitative Biology | q-bio.BM, q-bio.GN |
69
+ | `q-fin` | Quantitative Finance | q-fin.ST, q-fin.PM |
70
+ | `econ` | Economics | econ.EM, econ.GN |
71
+ | `eess` | Electrical Engineering | eess.SP, eess.AS |
72
+
73
+ ## Downloading Papers
74
+
75
+ ### Single Paper
76
+
77
+ ```bash
78
+ # Download by arXiv ID
79
+ arxiv-cli download --id 1706.03762 --dest ~/papers
80
+
81
+ # Download PDF format explicitly
82
+ arxiv-cli download --id 2301.13688 --dest ~/papers --pdf
83
+ ```
84
+
85
+ ### Batch Download
86
+
87
+ ```bash
88
+ # Download multiple papers
89
+ arxiv-cli download --id 1706.03762 --id 2301.13688 --id 2303.08774 \
90
+ --dest ~/papers/transformers
91
+
92
+ # Skip already downloaded files
93
+ arxiv-cli download --id 1706.03762 --id 2301.13688 \
94
+ --dest ~/papers --skip-existing
95
+ ```
96
+
97
+ ### Download from Search Results
98
+
99
+ A common workflow is to search first, then download selected papers:
100
+
101
+ ```bash
102
+ # 1. Search and note IDs
103
+ arxiv-cli search "diffusion models survey" -n 10 --summary
104
+
105
+ # 2. Download the relevant ones
106
+ arxiv-cli download --id 2209.00796 --id 2206.00364 --dest ~/papers/diffusion
107
+ ```
108
+
109
+ ## Python API Alternative
110
+
111
+ For programmatic use, the underlying `arxiv` library provides a Python API:
112
+
113
+ ```python
114
+ import arxiv
115
+
116
+ # Search
117
+ search = arxiv.Search(
118
+ query="large language models",
119
+ max_results=10,
120
+ sort_by=arxiv.SortCriterion.SubmittedDate
121
+ )
122
+
123
+ for result in arxiv.Client().results(search):
124
+ print(f"{result.entry_id}: {result.title}")
125
+ print(f" Authors: {', '.join(a.name for a in result.authors)}")
126
+ print(f" Published: {result.published.date()}")
127
+ print(f" PDF: {result.pdf_url}")
128
+ print()
129
+
130
+ # Download
131
+ result.download_pdf(dirpath="./papers", filename="paper.pdf")
132
+ ```
133
+
134
+ ## Workflow Integration
135
+
136
+ ### Daily Paper Check Script
137
+
138
+ ```bash
139
+ #!/bin/bash
140
+ # Check for new papers in your research area
141
+ DATE=$(date +%Y-%m-%d)
142
+ LOG="$HOME/papers/daily_${DATE}.txt"
143
+
144
+ echo "=== arXiv Papers for $DATE ===" > "$LOG"
145
+ arxiv-cli search "retrieval augmented generation" \
146
+ --categories "cs.CL,cs.AI" -n 20 --summary >> "$LOG"
147
+
148
+ echo "Paper digest saved to $LOG"
149
+ ```
150
+
151
+ ### Export to BibTeX
152
+
153
+ After finding relevant papers, retrieve BibTeX entries via the arXiv API:
154
+
155
+ ```bash
156
+ # Get BibTeX for a specific paper
157
+ curl -s "https://arxiv.org/bibtex/1706.03762"
158
+ ```
159
+
160
+ ## Rate Limits and Etiquette
161
+
162
+ - arXiv API allows **1 request per 3 seconds** for programmatic access
163
+ - For bulk downloads, add delays between requests
164
+ - The CLI tool respects rate limits by default
165
+ - See [arXiv API Terms of Use](https://info.arxiv.org/help/api/tou.html)
166
+
167
+ ## References
168
+
169
+ - [arxiv-cli-tools on PyPI](https://pypi.org/project/arxiv-cli-tools/)
170
+ - [arxiv Python Client](https://github.com/lukasschwab/arxiv.py)
171
+ - [arXiv API Documentation](https://info.arxiv.org/help/api/)
172
+ - [arXiv Category Taxonomy](https://arxiv.org/category_taxonomy)
@@ -0,0 +1,199 @@
1
+ ---
2
+ name: arxiv-osiris
3
+ description: "Search and download arXiv papers via Python and PowerShell scripts"
4
+ metadata:
5
+ openclaw:
6
+ emoji: "🔍"
7
+ category: "literature"
8
+ subcategory: "search"
9
+ keywords: ["arxiv", "paper download", "preprint search", "python script", "powershell", "literature retrieval"]
10
+ source: "https://clawhub.com/kostaskyq/arxiv-osiris"
11
+ ---
12
+
13
+ # arXiv Osiris — Paper Search and Download Tool
14
+
15
+ ## Overview
16
+
17
+ arXiv Osiris provides cross-platform scripts (Python and PowerShell) for searching and downloading scientific papers from arXiv.org. It supports keyword search, category filtering, metadata retrieval, and direct PDF download. Useful for researchers who prefer scripted automation over browser-based arXiv access, particularly for building local paper collections.
18
+
19
+ ## Installation
20
+
21
+ ```bash
22
+ # Install the arxiv Python client (required dependency)
23
+ pip install arxiv
24
+
25
+ # Clone the tool (if using from source)
26
+ git clone https://github.com/kostaskyq/arxiv-osiris.git
27
+ ```
28
+
29
+ ## Usage — Python API
30
+
31
+ ### Search for Papers
32
+
33
+ ```python
34
+ import arxiv
35
+
36
+ # Basic keyword search
37
+ search = arxiv.Search(
38
+ query="quantum computing error correction",
39
+ max_results=10,
40
+ sort_by=arxiv.SortCriterion.Relevance
41
+ )
42
+
43
+ client = arxiv.Client()
44
+ for result in client.results(search):
45
+ print(f"ID: {result.entry_id}")
46
+ print(f"Title: {result.title}")
47
+ print(f"Authors: {', '.join(a.name for a in result.authors)}")
48
+ print(f"Published:{result.published.strftime('%Y-%m-%d')}")
49
+ print(f"PDF: {result.pdf_url}")
50
+ print(f"Abstract: {result.summary[:200]}...")
51
+ print()
52
+ ```
53
+
54
+ ### Category-Filtered Search
55
+
56
+ ```python
57
+ # Search within specific categories
58
+ search = arxiv.Search(
59
+ query="cat:cs.CL AND transformer",
60
+ max_results=20,
61
+ sort_by=arxiv.SortCriterion.SubmittedDate
62
+ )
63
+
64
+ # Multiple categories
65
+ search = arxiv.Search(
66
+ query="(cat:cs.AI OR cat:cs.LG) AND reinforcement learning",
67
+ max_results=15
68
+ )
69
+ ```
70
+
71
+ ### Download Papers
72
+
73
+ ```python
74
+ import os
75
+
76
+ search = arxiv.Search(query="attention mechanism", max_results=5)
77
+ client = arxiv.Client()
78
+ download_dir = os.path.expanduser("~/papers/attention")
79
+ os.makedirs(download_dir, exist_ok=True)
80
+
81
+ for result in client.results(search):
82
+ # Download PDF
83
+ result.download_pdf(dirpath=download_dir)
84
+ print(f"Downloaded: {result.title}")
85
+
86
+ # Download source (LaTeX) if available
87
+ result.download_source(dirpath=download_dir)
88
+ ```
89
+
90
+ ## Usage — PowerShell Script
91
+
92
+ ### Search
93
+
94
+ ```powershell
95
+ # Basic search
96
+ .\arxiv.ps1 -Action search -Query "machine learning"
97
+
98
+ # With max results
99
+ .\arxiv.ps1 -Action search -Query "neural networks" -MaxResults 10
100
+
101
+ # Filter by category
102
+ .\arxiv.ps1 -Action search -Query "deep learning" -Categories "cs,stat"
103
+ ```
104
+
105
+ ### Download
106
+
107
+ ```powershell
108
+ # Download by arXiv ID
109
+ .\arxiv.ps1 -Action download -ArxivId "1706.03762"
110
+
111
+ # Download to specific directory
112
+ .\arxiv.ps1 -Action download -ArxivId "2301.13688" -OutputDir "C:\Papers"
113
+ ```
114
+
115
+ ## Advanced Queries
116
+
117
+ The arXiv API supports a rich query syntax:
118
+
119
+ | Operator | Meaning | Example |
120
+ |----------|---------|---------|
121
+ | `AND` | Both terms | `"deep learning" AND "drug discovery"` |
122
+ | `OR` | Either term | `"GAN" OR "diffusion model"` |
123
+ | `ANDNOT` | Exclude term | `"NLP" ANDNOT "translation"` |
124
+ | `au:` | Author | `au:"Hinton"` |
125
+ | `ti:` | Title contains | `ti:"attention"` |
126
+ | `abs:` | Abstract contains | `abs:"protein folding"` |
127
+ | `cat:` | Category | `cat:cs.CV` |
128
+
129
+ ### Complex Query Examples
130
+
131
+ ```python
132
+ # Papers by a specific author on a specific topic
133
+ search = arxiv.Search(query='au:"Yann LeCun" AND ti:"self-supervised"')
134
+
135
+ # Recent papers in two categories excluding surveys
136
+ search = arxiv.Search(
137
+ query='(cat:cs.CL OR cat:cs.AI) AND "large language model" ANDNOT ti:"survey"',
138
+ sort_by=arxiv.SortCriterion.SubmittedDate,
139
+ max_results=50
140
+ )
141
+ ```
142
+
143
+ ## Building a Local Paper Library
144
+
145
+ ```python
146
+ import arxiv
147
+ import json
148
+ import os
149
+ from datetime import datetime
150
+
151
+ def build_library(queries: dict, base_dir: str = "~/papers"):
152
+ """Build organized paper library from multiple search queries."""
153
+ base = os.path.expanduser(base_dir)
154
+ catalog = []
155
+ client = arxiv.Client()
156
+
157
+ for topic, query in queries.items():
158
+ topic_dir = os.path.join(base, topic)
159
+ os.makedirs(topic_dir, exist_ok=True)
160
+
161
+ search = arxiv.Search(query=query, max_results=20,
162
+ sort_by=arxiv.SortCriterion.SubmittedDate)
163
+
164
+ for paper in client.results(search):
165
+ paper.download_pdf(dirpath=topic_dir)
166
+ catalog.append({
167
+ "id": paper.entry_id,
168
+ "title": paper.title,
169
+ "authors": [a.name for a in paper.authors],
170
+ "published": paper.published.isoformat(),
171
+ "topic": topic,
172
+ "pdf_path": os.path.join(topic_dir, f"{paper.get_short_id()}.pdf")
173
+ })
174
+
175
+ # Save catalog
176
+ with open(os.path.join(base, "catalog.json"), "w") as f:
177
+ json.dump(catalog, f, indent=2)
178
+ print(f"Library built: {len(catalog)} papers in {len(queries)} topics")
179
+
180
+ # Usage
181
+ build_library({
182
+ "rag": "cat:cs.CL AND retrieval augmented generation",
183
+ "agents": "cat:cs.AI AND (LLM agent OR tool use)",
184
+ "evaluation": "cat:cs.CL AND (benchmark OR evaluation) AND language model"
185
+ })
186
+ ```
187
+
188
+ ## Rate Limits
189
+
190
+ - arXiv API: **1 request per 3 seconds** for automated access
191
+ - The `arxiv` Python client handles rate limiting automatically
192
+ - For large-scale downloads, add explicit delays: `time.sleep(3)`
193
+ - Respect [arXiv API Terms of Use](https://info.arxiv.org/help/api/tou.html)
194
+
195
+ ## References
196
+
197
+ - [arxiv Python Client](https://github.com/lukasschwab/arxiv.py)
198
+ - [arXiv API User Manual](https://info.arxiv.org/help/api/user-manual.html)
199
+ - [arXiv Category Taxonomy](https://arxiv.org/category_taxonomy)
@@ -0,0 +1,141 @@
1
+ ---
2
+ name: arxiv-paper-processor
3
+ description: "Process and analyze arXiv papers systematically for research workflows"
4
+ metadata:
5
+ openclaw:
6
+ emoji: "⚙️"
7
+ category: "literature"
8
+ subcategory: "search"
9
+ keywords: ["arxiv", "paper processing", "PDF parsing", "metadata extraction", "preprint analysis", "research pipeline"]
10
+ source: "https://github.com/tatsu-lab/gpt_paper_assistant"
11
+ ---
12
+
13
+ # arXiv Paper Processor
14
+
15
+ ## Overview
16
+
17
+ The arXiv Paper Processor skill provides a complete pipeline for downloading, parsing, and analyzing arXiv papers programmatically. While the arXiv API provides metadata, researchers often need to work with the full text—extracting sections, equations, figures, and references for deeper analysis.
18
+
19
+ This skill covers the entire processing chain: retrieving papers by ID or search query, downloading PDF and LaTeX source files, extracting structured content, and producing analysis-ready outputs. It is particularly valuable for researchers conducting large-scale literature analysis, building training datasets from academic text, or automating evidence extraction for systematic reviews.
20
+
21
+ The pipeline handles common challenges in academic PDF processing including multi-column layouts, mathematical notation, table extraction, and reference parsing. It integrates with tools like GROBID for PDF parsing and can work directly with arXiv LaTeX sources for higher-fidelity extraction.
22
+
23
+ ## Paper Retrieval and Download
24
+
25
+ ### Fetching by arXiv ID
26
+
27
+ The most reliable method is to fetch papers by their arXiv identifier:
28
+
29
+ ```python
30
+ import urllib.request
31
+ import feedparser
32
+
33
+ # Fetch metadata via Atom feed
34
+ arxiv_id = "2301.07041"
35
+ url = f"http://export.arxiv.org/api/query?id_list={arxiv_id}"
36
+ response = urllib.request.urlopen(url)
37
+ feed = feedparser.parse(response.read())
38
+
39
+ entry = feed.entries[0]
40
+ title = entry.title
41
+ abstract = entry.summary
42
+ authors = [a.name for a in entry.authors]
43
+ pdf_url = entry.links[1].href # PDF link
44
+ ```
45
+
46
+ ### Downloading Source Files
47
+
48
+ arXiv stores LaTeX source files for most papers. These provide much richer structure than PDFs:
49
+
50
+ ```bash
51
+ # Download LaTeX source (typically a .tar.gz)
52
+ wget https://arxiv.org/e-print/2301.07041 -O paper_source.tar.gz
53
+ tar -xzf paper_source.tar.gz -C paper_source/
54
+ ```
55
+
56
+ Source files contain the original `.tex` files, figures, bibliography files, and any custom style files. Parsing LaTeX directly gives you access to section structure, equations in their original notation, citation keys, and figure captions without the ambiguity of PDF extraction.
57
+
58
+ ### Batch Download Guidelines
59
+
60
+ When downloading multiple papers, respect arXiv's usage policies:
61
+
62
+ - Limit requests to 1 per 3 seconds for API calls
63
+ - Use the arXiv bulk data access (S3 or GCS) for large-scale processing (1000+ papers)
64
+ - Cache all downloaded files locally and check before re-downloading
65
+ - Include a descriptive User-Agent header in your HTTP requests
66
+
67
+ ## Content Extraction Pipeline
68
+
69
+ ### PDF Extraction with GROBID
70
+
71
+ For papers where only PDF is available, use GROBID (GeneRation Of BIbliographic Data) for structured extraction:
72
+
73
+ ```bash
74
+ # Run GROBID as a local service
75
+ docker run --rm -p 8070:8070 grobid/grobid:0.8.0
76
+
77
+ # Process a PDF
78
+ curl -X POST "http://localhost:8070/api/processFulltextDocument" \
79
+ -F "input=@paper.pdf" \
80
+ -F "consolidateHeader=1" \
81
+ -F "consolidateCitations=1" \
82
+ > paper_tei.xml
83
+ ```
84
+
85
+ GROBID outputs TEI-XML with structured sections including:
86
+ - Header metadata (title, authors, affiliations, abstract)
87
+ - Body text with section hierarchy
88
+ - Equations (as MathML or raw text)
89
+ - Figure and table references
90
+ - Parsed bibliography entries with DOIs where available
91
+
92
+ ### LaTeX Source Parsing
93
+
94
+ When LaTeX source is available, parse it directly for higher fidelity:
95
+
96
+ 1. Identify the main `.tex` file (look for `\documentclass` or `\begin{document}`)
97
+ 2. Resolve `\input{}` and `\include{}` directives to build the complete document
98
+ 3. Extract sections using `\section{}`, `\subsection{}` markers
99
+ 4. Extract equations from `equation`, `align`, `gather` environments
100
+ 5. Parse `\cite{}` commands and cross-reference with the `.bib` file
101
+ 6. Extract figure captions from `\caption{}` commands
102
+
103
+ ### Structured Output Schema
104
+
105
+ Produce a standardized JSON output for each processed paper:
106
+
107
+ ```json
108
+ {
109
+ "arxiv_id": "2301.07041",
110
+ "title": "Paper Title",
111
+ "authors": ["Author One", "Author Two"],
112
+ "abstract": "...",
113
+ "sections": [
114
+ {"heading": "Introduction", "level": 1, "text": "..."},
115
+ {"heading": "Related Work", "level": 1, "text": "..."}
116
+ ],
117
+ "equations": ["E = mc^2", "..."],
118
+ "figures": [{"id": "fig1", "caption": "..."}],
119
+ "references": [{"key": "smith2020", "title": "...", "doi": "..."}],
120
+ "processed_date": "2026-03-10"
121
+ }
122
+ ```
123
+
124
+ ## Analysis and Integration
125
+
126
+ Once papers are processed into structured format, several downstream analyses become possible:
127
+
128
+ - **Section-level search**: Search across the methods sections of hundreds of papers to find specific techniques.
129
+ - **Equation extraction**: Build a database of mathematical formulations used in your subfield.
130
+ - **Citation graph construction**: Map which papers cite which, using extracted reference lists.
131
+ - **Terminology tracking**: Monitor how specific terms evolve in usage frequency over time.
132
+ - **Dataset identification**: Extract mentions of datasets and benchmarks from experimental sections.
133
+
134
+ Integrate processed outputs with your reference manager by generating BibTeX entries enriched with extracted metadata, or feed structured JSON into a local search index for full-text retrieval across your paper collection.
135
+
136
+ ## References
137
+
138
+ - arXiv API: https://info.arxiv.org/help/api/index.html
139
+ - GROBID: https://github.com/kermitt2/grobid
140
+ - GPT Paper Assistant: https://github.com/tatsu-lab/gpt_paper_assistant
141
+ - arXiv bulk data access: https://info.arxiv.org/help/bulk_data/index.html