@wentorai/research-plugins 1.0.0 → 1.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (415) hide show
  1. package/README.md +22 -22
  2. package/curated/analysis/README.md +82 -56
  3. package/curated/domains/README.md +225 -69
  4. package/curated/literature/README.md +115 -46
  5. package/curated/research/README.md +106 -58
  6. package/curated/tools/README.md +107 -87
  7. package/curated/writing/README.md +92 -45
  8. package/mcp-configs/academic-db/alphafold-mcp.json +20 -0
  9. package/mcp-configs/academic-db/brightspace-mcp.json +21 -0
  10. package/mcp-configs/academic-db/climatiq-mcp.json +20 -0
  11. package/mcp-configs/academic-db/gibs-mcp.json +20 -0
  12. package/mcp-configs/academic-db/gis-mcp-server.json +22 -0
  13. package/mcp-configs/academic-db/google-earth-engine-mcp.json +21 -0
  14. package/mcp-configs/academic-db/m4-clinical-mcp.json +21 -0
  15. package/mcp-configs/academic-db/medical-mcp.json +21 -0
  16. package/mcp-configs/academic-db/nexonco-mcp.json +20 -0
  17. package/mcp-configs/academic-db/omop-mcp.json +20 -0
  18. package/mcp-configs/academic-db/onekgpd-mcp.json +20 -0
  19. package/mcp-configs/academic-db/openedu-mcp.json +20 -0
  20. package/mcp-configs/academic-db/opengenes-mcp.json +20 -0
  21. package/mcp-configs/academic-db/openstax-mcp.json +21 -0
  22. package/mcp-configs/academic-db/openstreetmap-mcp.json +21 -0
  23. package/mcp-configs/academic-db/opentargets-mcp.json +21 -0
  24. package/mcp-configs/academic-db/pdb-mcp.json +21 -0
  25. package/mcp-configs/academic-db/smithsonian-mcp.json +20 -0
  26. package/mcp-configs/ai-platform/magi-researchers.json +21 -0
  27. package/mcp-configs/ai-platform/mcp-academic-researcher.json +22 -0
  28. package/mcp-configs/ai-platform/open-paper-machine.json +21 -0
  29. package/mcp-configs/ai-platform/paper-intelligence.json +21 -0
  30. package/mcp-configs/ai-platform/paper-reader.json +21 -0
  31. package/mcp-configs/ai-platform/paperdebugger.json +21 -0
  32. package/mcp-configs/browser/exa-mcp.json +20 -0
  33. package/mcp-configs/browser/mcp-searxng.json +21 -0
  34. package/mcp-configs/browser/mcp-webresearch.json +20 -0
  35. package/mcp-configs/cloud-docs/confluence-mcp.json +37 -0
  36. package/mcp-configs/cloud-docs/google-drive-mcp.json +35 -0
  37. package/mcp-configs/cloud-docs/notion-mcp.json +29 -0
  38. package/mcp-configs/communication/discord-mcp.json +29 -0
  39. package/mcp-configs/communication/discourse-mcp.json +21 -0
  40. package/mcp-configs/communication/slack-mcp.json +29 -0
  41. package/mcp-configs/communication/telegram-mcp.json +28 -0
  42. package/mcp-configs/data-platform/automl-stat-mcp.json +21 -0
  43. package/mcp-configs/data-platform/jefferson-stats-mcp.json +22 -0
  44. package/mcp-configs/data-platform/mcp-excel-server.json +21 -0
  45. package/mcp-configs/data-platform/mcp-stata.json +21 -0
  46. package/mcp-configs/data-platform/mcpstack-jupyter.json +21 -0
  47. package/mcp-configs/data-platform/ml-mcp.json +21 -0
  48. package/mcp-configs/data-platform/nasdaq-data-link-mcp.json +20 -0
  49. package/mcp-configs/data-platform/numpy-mcp.json +21 -0
  50. package/mcp-configs/database/neo4j-mcp.json +37 -0
  51. package/mcp-configs/database/postgres-mcp.json +28 -0
  52. package/mcp-configs/database/sqlite-mcp.json +29 -0
  53. package/mcp-configs/dev-platform/geogebra-mcp.json +21 -0
  54. package/mcp-configs/dev-platform/github-mcp.json +31 -0
  55. package/mcp-configs/dev-platform/gitlab-mcp.json +34 -0
  56. package/mcp-configs/dev-platform/latex-mcp-server.json +21 -0
  57. package/mcp-configs/dev-platform/manim-mcp.json +20 -0
  58. package/mcp-configs/dev-platform/mcp-echarts.json +20 -0
  59. package/mcp-configs/dev-platform/panel-viz-mcp.json +20 -0
  60. package/mcp-configs/dev-platform/paperbanana.json +20 -0
  61. package/mcp-configs/dev-platform/texflow-mcp.json +20 -0
  62. package/mcp-configs/dev-platform/texmcp.json +20 -0
  63. package/mcp-configs/dev-platform/typst-mcp.json +21 -0
  64. package/mcp-configs/dev-platform/vizro-mcp.json +20 -0
  65. package/mcp-configs/email/email-mcp.json +40 -0
  66. package/mcp-configs/email/gmail-mcp.json +37 -0
  67. package/mcp-configs/note-knowledge/local-faiss-mcp.json +21 -0
  68. package/mcp-configs/note-knowledge/mcp-memory-service.json +21 -0
  69. package/mcp-configs/note-knowledge/mcp-obsidian.json +23 -0
  70. package/mcp-configs/note-knowledge/mcp-ragdocs.json +20 -0
  71. package/mcp-configs/note-knowledge/mcp-summarizer.json +21 -0
  72. package/mcp-configs/note-knowledge/mediawiki-mcp.json +21 -0
  73. package/mcp-configs/note-knowledge/openzim-mcp.json +20 -0
  74. package/mcp-configs/note-knowledge/zettelkasten-mcp.json +21 -0
  75. package/mcp-configs/reference-mgr/academic-paper-mcp-http.json +20 -0
  76. package/mcp-configs/reference-mgr/academix.json +20 -0
  77. package/mcp-configs/reference-mgr/arxiv-research-mcp.json +21 -0
  78. package/mcp-configs/reference-mgr/google-scholar-abstract-mcp.json +19 -0
  79. package/mcp-configs/reference-mgr/google-scholar-mcp.json +20 -0
  80. package/mcp-configs/reference-mgr/mcp-paperswithcode.json +21 -0
  81. package/mcp-configs/reference-mgr/mcp-scholarly.json +20 -0
  82. package/mcp-configs/reference-mgr/mcp-simple-arxiv.json +20 -0
  83. package/mcp-configs/reference-mgr/mcp-simple-pubmed.json +20 -0
  84. package/mcp-configs/reference-mgr/mcp-zotero.json +21 -0
  85. package/mcp-configs/reference-mgr/mendeley-mcp.json +20 -0
  86. package/mcp-configs/reference-mgr/ncbi-mcp-server.json +22 -0
  87. package/mcp-configs/reference-mgr/onecite.json +21 -0
  88. package/mcp-configs/reference-mgr/paper-search-mcp.json +21 -0
  89. package/mcp-configs/reference-mgr/pubmed-search-mcp.json +21 -0
  90. package/mcp-configs/reference-mgr/scholar-mcp.json +21 -0
  91. package/mcp-configs/reference-mgr/scholar-multi-mcp.json +21 -0
  92. package/mcp-configs/reference-mgr/seerai.json +21 -0
  93. package/mcp-configs/reference-mgr/semantic-scholar-fastmcp.json +21 -0
  94. package/mcp-configs/reference-mgr/sourcelibrary.json +20 -0
  95. package/mcp-configs/registry.json +178 -149
  96. package/mcp-configs/repository/dataverse-mcp.json +33 -0
  97. package/mcp-configs/repository/huggingface-mcp.json +29 -0
  98. package/openclaw.plugin.json +2 -2
  99. package/package.json +2 -2
  100. package/skills/analysis/dataviz/algorithm-visualizer-guide/SKILL.md +259 -0
  101. package/skills/analysis/dataviz/bokeh-visualization-guide/SKILL.md +270 -0
  102. package/skills/analysis/dataviz/chart-image-generator/SKILL.md +229 -0
  103. package/skills/analysis/dataviz/citation-map-guide/SKILL.md +184 -0
  104. package/skills/analysis/dataviz/d3-visualization-guide/SKILL.md +281 -0
  105. package/skills/analysis/dataviz/data-visualization-principles/SKILL.md +171 -0
  106. package/skills/analysis/dataviz/echarts-visualization-guide/SKILL.md +250 -0
  107. package/skills/analysis/dataviz/metabase-analytics-guide/SKILL.md +242 -0
  108. package/skills/analysis/dataviz/plotly-interactive-guide/SKILL.md +266 -0
  109. package/skills/analysis/dataviz/redash-analytics-guide/SKILL.md +284 -0
  110. package/skills/analysis/econometrics/econml-causal-guide/SKILL.md +163 -0
  111. package/skills/analysis/econometrics/empirical-paper-analysis/SKILL.md +192 -0
  112. package/skills/analysis/econometrics/mostly-harmless-guide/SKILL.md +139 -0
  113. package/skills/analysis/econometrics/panel-data-analyst/SKILL.md +259 -0
  114. package/skills/analysis/econometrics/panel-data-regression-workflow/SKILL.md +267 -0
  115. package/skills/analysis/econometrics/python-causality-guide/SKILL.md +134 -0
  116. package/skills/analysis/econometrics/stata-accounting-guide/SKILL.md +269 -0
  117. package/skills/analysis/econometrics/stata-analyst-guide/SKILL.md +245 -0
  118. package/skills/analysis/econometrics/stata-reference-guide/SKILL.md +293 -0
  119. package/skills/analysis/statistics/data-anomaly-detection/SKILL.md +157 -0
  120. package/skills/analysis/statistics/general-statistics-guide/SKILL.md +226 -0
  121. package/skills/analysis/statistics/infiagent-benchmark-guide/SKILL.md +106 -0
  122. package/skills/analysis/statistics/ml-experiment-tracker/SKILL.md +212 -0
  123. package/skills/analysis/statistics/pywayne-statistics-guide/SKILL.md +192 -0
  124. package/skills/analysis/statistics/quantitative-methods-guide/SKILL.md +193 -0
  125. package/skills/analysis/statistics/senior-data-scientist-guide/SKILL.md +223 -0
  126. package/skills/analysis/wrangling/claude-data-analysis-guide/SKILL.md +100 -0
  127. package/skills/analysis/wrangling/csv-data-analyzer/SKILL.md +170 -0
  128. package/skills/analysis/wrangling/data-cleaning-pipeline/SKILL.md +266 -0
  129. package/skills/analysis/wrangling/data-cog-guide/SKILL.md +178 -0
  130. package/skills/analysis/wrangling/open-data-scientist-guide/SKILL.md +197 -0
  131. package/skills/analysis/wrangling/stata-data-cleaning/SKILL.md +276 -0
  132. package/skills/analysis/wrangling/streamline-analyst-guide/SKILL.md +119 -0
  133. package/skills/analysis/wrangling/survey-data-processing/SKILL.md +298 -0
  134. package/skills/domains/ai-ml/ai-agent-papers-guide/SKILL.md +146 -0
  135. package/skills/domains/ai-ml/ai-model-benchmarking/SKILL.md +209 -0
  136. package/skills/domains/ai-ml/annotated-dl-papers-guide/SKILL.md +159 -0
  137. package/skills/domains/ai-ml/anomaly-detection-papers-guide/SKILL.md +167 -0
  138. package/skills/domains/ai-ml/autonomous-agents-papers-guide/SKILL.md +178 -0
  139. package/skills/domains/ai-ml/dl-transformer-finetune/SKILL.md +239 -0
  140. package/skills/domains/ai-ml/domain-adaptation-papers-guide/SKILL.md +173 -0
  141. package/skills/domains/ai-ml/generative-ai-guide/SKILL.md +146 -0
  142. package/skills/domains/ai-ml/graph-learning-papers-guide/SKILL.md +125 -0
  143. package/skills/domains/ai-ml/huggingface-inference-guide/SKILL.md +196 -0
  144. package/skills/domains/ai-ml/keras-deep-learning/SKILL.md +210 -0
  145. package/skills/domains/ai-ml/kolmogorov-arnold-networks-guide/SKILL.md +185 -0
  146. package/skills/domains/ai-ml/llm-from-scratch-guide/SKILL.md +124 -0
  147. package/skills/domains/ai-ml/ml-pipeline-guide/SKILL.md +295 -0
  148. package/skills/domains/ai-ml/nlp-toolkit-guide/SKILL.md +247 -0
  149. package/skills/domains/ai-ml/npcpy-research-guide/SKILL.md +137 -0
  150. package/skills/domains/ai-ml/pytorch-guide/SKILL.md +281 -0
  151. package/skills/domains/ai-ml/pytorch-lightning-guide/SKILL.md +244 -0
  152. package/skills/domains/ai-ml/responsible-ai-guide/SKILL.md +126 -0
  153. package/skills/domains/ai-ml/tensorflow-guide/SKILL.md +241 -0
  154. package/skills/domains/ai-ml/vmas-simulator-guide/SKILL.md +129 -0
  155. package/skills/domains/biomedical/bioagents-guide/SKILL.md +308 -0
  156. package/skills/domains/biomedical/clawbio-guide/SKILL.md +167 -0
  157. package/skills/domains/biomedical/clinical-dialogue-agents-guide/SKILL.md +145 -0
  158. package/skills/domains/biomedical/ena-sequence-api/SKILL.md +175 -0
  159. package/skills/domains/biomedical/genomas-guide/SKILL.md +126 -0
  160. package/skills/domains/biomedical/genotex-benchmark-guide/SKILL.md +125 -0
  161. package/skills/domains/biomedical/med-researcher-guide/SKILL.md +161 -0
  162. package/skills/domains/biomedical/med-researcher-r1-guide/SKILL.md +146 -0
  163. package/skills/domains/biomedical/medgeclaw-guide/SKILL.md +345 -0
  164. package/skills/domains/biomedical/medical-imaging-guide/SKILL.md +305 -0
  165. package/skills/domains/biomedical/ncbi-blast-api/SKILL.md +195 -0
  166. package/skills/domains/biomedical/ncbi-datasets-api/SKILL.md +220 -0
  167. package/skills/domains/biomedical/quickgo-api/SKILL.md +181 -0
  168. package/skills/domains/business/architecture-design-guide/SKILL.md +279 -0
  169. package/skills/domains/business/innovation-management-guide/SKILL.md +257 -0
  170. package/skills/domains/business/operations-research-guide/SKILL.md +258 -0
  171. package/skills/domains/business/xpert-bi-guide/SKILL.md +84 -0
  172. package/skills/domains/chemistry/cactus-cheminformatics-guide/SKILL.md +89 -0
  173. package/skills/domains/chemistry/chemeagle-guide/SKILL.md +147 -0
  174. package/skills/domains/chemistry/chemgraph-agent-guide/SKILL.md +120 -0
  175. package/skills/domains/chemistry/molecular-dynamics-guide/SKILL.md +237 -0
  176. package/skills/domains/chemistry/pubchem-api-guide/SKILL.md +180 -0
  177. package/skills/domains/chemistry/spectroscopy-analysis-guide/SKILL.md +290 -0
  178. package/skills/domains/cs/ai-security-papers-guide/SKILL.md +103 -0
  179. package/skills/domains/cs/code-llm-papers-guide/SKILL.md +131 -0
  180. package/skills/domains/cs/distributed-systems-guide/SKILL.md +268 -0
  181. package/skills/domains/cs/formal-verification-guide/SKILL.md +298 -0
  182. package/skills/domains/cs/gaussian-splatting-papers-guide/SKILL.md +158 -0
  183. package/skills/domains/cs/llm-aiops-guide/SKILL.md +70 -0
  184. package/skills/domains/cs/software-heritage-api/SKILL.md +200 -0
  185. package/skills/domains/ecology/species-distribution-guide/SKILL.md +343 -0
  186. package/skills/domains/economics/imf-data-api-guide/SKILL.md +174 -0
  187. package/skills/domains/economics/nber-working-papers-api/SKILL.md +177 -0
  188. package/skills/domains/economics/post-labor-economics/SKILL.md +254 -0
  189. package/skills/domains/economics/pricing-psychology-guide/SKILL.md +273 -0
  190. package/skills/domains/economics/repec-economics-api/SKILL.md +188 -0
  191. package/skills/domains/economics/world-bank-data-guide/SKILL.md +179 -0
  192. package/skills/domains/education/academic-study-methods/SKILL.md +228 -0
  193. package/skills/domains/education/assessment-design-guide/SKILL.md +213 -0
  194. package/skills/domains/education/educational-research-methods/SKILL.md +179 -0
  195. package/skills/domains/education/edumcp-guide/SKILL.md +74 -0
  196. package/skills/domains/education/mooc-analytics-guide/SKILL.md +206 -0
  197. package/skills/domains/education/open-syllabus-api/SKILL.md +171 -0
  198. package/skills/domains/finance/akshare-finance-data/SKILL.md +207 -0
  199. package/skills/domains/finance/finsight-research-guide/SKILL.md +113 -0
  200. package/skills/domains/finance/options-analytics-agent-guide/SKILL.md +117 -0
  201. package/skills/domains/finance/portfolio-optimization-guide/SKILL.md +279 -0
  202. package/skills/domains/finance/risk-modeling-guide/SKILL.md +260 -0
  203. package/skills/domains/finance/stata-accounting-research/SKILL.md +372 -0
  204. package/skills/domains/geoscience/climate-modeling-guide/SKILL.md +215 -0
  205. package/skills/domains/geoscience/pangaea-data-api/SKILL.md +197 -0
  206. package/skills/domains/geoscience/satellite-remote-sensing/SKILL.md +193 -0
  207. package/skills/domains/geoscience/seismology-data-guide/SKILL.md +208 -0
  208. package/skills/domains/humanities/digital-humanities-methods/SKILL.md +232 -0
  209. package/skills/domains/humanities/ethical-philosophy-guide/SKILL.md +244 -0
  210. package/skills/domains/humanities/history-research-guide/SKILL.md +260 -0
  211. package/skills/domains/humanities/political-history-guide/SKILL.md +241 -0
  212. package/skills/domains/law/caselaw-access-api/SKILL.md +149 -0
  213. package/skills/domains/law/legal-agent-skills-guide/SKILL.md +132 -0
  214. package/skills/domains/law/legal-nlp-guide/SKILL.md +236 -0
  215. package/skills/domains/law/legal-research-methods/SKILL.md +190 -0
  216. package/skills/domains/law/opencontracts-guide/SKILL.md +168 -0
  217. package/skills/domains/law/patent-analysis-guide/SKILL.md +257 -0
  218. package/skills/domains/law/regulatory-compliance-guide/SKILL.md +267 -0
  219. package/skills/domains/math/lean-theorem-proving-guide/SKILL.md +140 -0
  220. package/skills/domains/math/symbolic-computation-guide/SKILL.md +263 -0
  221. package/skills/domains/math/topology-data-analysis/SKILL.md +305 -0
  222. package/skills/domains/pharma/clinical-trial-design-guide/SKILL.md +271 -0
  223. package/skills/domains/pharma/drug-target-interaction/SKILL.md +242 -0
  224. package/skills/domains/pharma/madd-drug-discovery-guide/SKILL.md +153 -0
  225. package/skills/domains/pharma/pharmacovigilance-guide/SKILL.md +216 -0
  226. package/skills/domains/physics/astrophysics-data-guide/SKILL.md +305 -0
  227. package/skills/domains/physics/particle-physics-guide/SKILL.md +287 -0
  228. package/skills/domains/social-science/ipums-microdata-api/SKILL.md +211 -0
  229. package/skills/domains/social-science/network-analysis-guide/SKILL.md +310 -0
  230. package/skills/domains/social-science/psychology-research-guide/SKILL.md +270 -0
  231. package/skills/domains/social-science/sociology-research-guide/SKILL.md +238 -0
  232. package/skills/domains/social-science/sociology-research-methods/SKILL.md +181 -0
  233. package/skills/literature/discovery/arxiv-paper-monitoring/SKILL.md +233 -0
  234. package/skills/literature/discovery/paper-recommendation-guide/SKILL.md +120 -0
  235. package/skills/literature/discovery/papers-we-love-guide/SKILL.md +169 -0
  236. package/skills/literature/discovery/semantic-paper-radar/SKILL.md +144 -0
  237. package/skills/literature/discovery/zotero-arxiv-daily-guide/SKILL.md +94 -0
  238. package/skills/literature/fulltext/bioc-pmc-api/SKILL.md +146 -0
  239. package/skills/literature/fulltext/core-api-guide/SKILL.md +144 -0
  240. package/skills/literature/fulltext/dataverse-api/SKILL.md +215 -0
  241. package/skills/literature/fulltext/hal-archive-api/SKILL.md +218 -0
  242. package/skills/literature/fulltext/institutional-repository-guide/SKILL.md +212 -0
  243. package/skills/literature/fulltext/open-access-mining-guide/SKILL.md +341 -0
  244. package/skills/literature/fulltext/osf-api/SKILL.md +212 -0
  245. package/skills/literature/fulltext/pmc-ftp-bulk-download/SKILL.md +182 -0
  246. package/skills/literature/fulltext/zotero-ai-butler-guide/SKILL.md +166 -0
  247. package/skills/literature/fulltext/zotero-scihub-guide/SKILL.md +168 -0
  248. package/skills/literature/metadata/academic-paper-summarizer/SKILL.md +101 -0
  249. package/skills/literature/metadata/bibliometrix-guide/SKILL.md +164 -0
  250. package/skills/literature/metadata/crossref-event-data-api/SKILL.md +183 -0
  251. package/skills/literature/metadata/doi-content-negotiation/SKILL.md +202 -0
  252. package/skills/literature/metadata/orkg-api/SKILL.md +153 -0
  253. package/skills/literature/metadata/plumx-metrics-api/SKILL.md +188 -0
  254. package/skills/literature/metadata/ror-organization-api/SKILL.md +208 -0
  255. package/skills/literature/metadata/sophosia-reference-guide/SKILL.md +110 -0
  256. package/skills/literature/metadata/viaf-authority-api/SKILL.md +209 -0
  257. package/skills/literature/metadata/wikidata-api-guide/SKILL.md +156 -0
  258. package/skills/literature/metadata/zoplicate-dedup-guide/SKILL.md +147 -0
  259. package/skills/literature/metadata/zotero-actions-tags-guide/SKILL.md +212 -0
  260. package/skills/literature/metadata/zotmoov-guide/SKILL.md +120 -0
  261. package/skills/literature/metadata/zutilo-guide/SKILL.md +140 -0
  262. package/skills/literature/search/arxiv-batch-reporting/SKILL.md +133 -0
  263. package/skills/literature/search/arxiv-cli-tools/SKILL.md +172 -0
  264. package/skills/literature/search/arxiv-osiris/SKILL.md +199 -0
  265. package/skills/literature/search/arxiv-paper-processor/SKILL.md +141 -0
  266. package/skills/literature/search/baidu-scholar-guide/SKILL.md +110 -0
  267. package/skills/literature/search/base-academic-search/SKILL.md +196 -0
  268. package/skills/literature/search/chatpaper-guide/SKILL.md +122 -0
  269. package/skills/literature/search/citeseerx-api/SKILL.md +183 -0
  270. package/skills/literature/search/deep-literature-search/SKILL.md +149 -0
  271. package/skills/literature/search/deepgit-search-guide/SKILL.md +147 -0
  272. package/skills/literature/search/eric-education-api/SKILL.md +199 -0
  273. package/skills/literature/search/findpapers-guide/SKILL.md +177 -0
  274. package/skills/literature/search/ieee-xplore-api/SKILL.md +177 -0
  275. package/skills/literature/search/lens-scholarly-api/SKILL.md +211 -0
  276. package/skills/literature/search/multi-database-literature-search/SKILL.md +198 -0
  277. package/skills/literature/search/open-library-api/SKILL.md +196 -0
  278. package/skills/literature/search/open-semantic-search-guide/SKILL.md +190 -0
  279. package/skills/literature/search/openaire-api/SKILL.md +141 -0
  280. package/skills/literature/search/paper-search-mcp-guide/SKILL.md +107 -0
  281. package/skills/literature/search/papers-chat-guide/SKILL.md +194 -0
  282. package/skills/literature/search/pasa-paper-search-guide/SKILL.md +138 -0
  283. package/skills/literature/search/plos-open-access-api/SKILL.md +203 -0
  284. package/skills/literature/search/scielo-api/SKILL.md +182 -0
  285. package/skills/literature/search/share-research-api/SKILL.md +129 -0
  286. package/skills/literature/search/worldcat-search-api/SKILL.md +224 -0
  287. package/skills/research/automation/ai-scientist-v2-guide/SKILL.md +284 -0
  288. package/skills/research/automation/aim-experiment-guide/SKILL.md +234 -0
  289. package/skills/research/automation/claude-academic-workflow-guide/SKILL.md +202 -0
  290. package/skills/research/automation/coexist-ai-guide/SKILL.md +149 -0
  291. package/skills/research/automation/datagen-research-guide/SKILL.md +131 -0
  292. package/skills/research/automation/foam-agent-guide/SKILL.md +203 -0
  293. package/skills/research/automation/kedro-pipeline-guide/SKILL.md +216 -0
  294. package/skills/research/automation/mle-agent-guide/SKILL.md +139 -0
  295. package/skills/research/automation/paper-to-agent-guide/SKILL.md +116 -0
  296. package/skills/research/automation/rd-agent-guide/SKILL.md +246 -0
  297. package/skills/research/automation/research-paper-orchestrator/SKILL.md +254 -0
  298. package/skills/research/deep-research/academic-deep-research/SKILL.md +190 -0
  299. package/skills/research/deep-research/auto-deep-research-guide/SKILL.md +141 -0
  300. package/skills/research/deep-research/cognitive-kernel-guide/SKILL.md +200 -0
  301. package/skills/research/deep-research/corvus-research-guide/SKILL.md +132 -0
  302. package/skills/research/deep-research/deep-research-pro/SKILL.md +213 -0
  303. package/skills/research/deep-research/deep-research-work/SKILL.md +204 -0
  304. package/skills/research/deep-research/deep-searcher-guide/SKILL.md +253 -0
  305. package/skills/research/deep-research/gpt-researcher-guide/SKILL.md +191 -0
  306. package/skills/research/deep-research/in-depth-research-guide/SKILL.md +205 -0
  307. package/skills/research/deep-research/khoj-research-guide/SKILL.md +200 -0
  308. package/skills/research/deep-research/kosmos-scientist-guide/SKILL.md +185 -0
  309. package/skills/research/deep-research/llm-scientific-discovery-guide/SKILL.md +178 -0
  310. package/skills/research/deep-research/local-deep-research-guide/SKILL.md +253 -0
  311. package/skills/research/deep-research/open-researcher-guide/SKILL.md +138 -0
  312. package/skills/research/deep-research/tongyi-deep-research-guide/SKILL.md +217 -0
  313. package/skills/research/funding/eu-horizon-guide/SKILL.md +244 -0
  314. package/skills/research/funding/grant-budget-guide/SKILL.md +284 -0
  315. package/skills/research/funding/nih-reporter-api-guide/SKILL.md +166 -0
  316. package/skills/research/funding/nsf-award-api-guide/SKILL.md +133 -0
  317. package/skills/research/methodology/academic-mentor-guide/SKILL.md +169 -0
  318. package/skills/research/methodology/claude-scientific-guide/SKILL.md +122 -0
  319. package/skills/research/methodology/deep-innovator-guide/SKILL.md +242 -0
  320. package/skills/research/methodology/osf-api-guide/SKILL.md +165 -0
  321. package/skills/research/methodology/parsifal-slr-guide/SKILL.md +154 -0
  322. package/skills/research/methodology/research-paper-kb/SKILL.md +263 -0
  323. package/skills/research/methodology/research-pipeline-units-guide/SKILL.md +169 -0
  324. package/skills/research/methodology/research-town-guide/SKILL.md +263 -0
  325. package/skills/research/methodology/slr-automation-guide/SKILL.md +235 -0
  326. package/skills/research/paper-review/automated-review-guide/SKILL.md +281 -0
  327. package/skills/research/paper-review/latte-review-guide/SKILL.md +175 -0
  328. package/skills/research/paper-review/paper-compare-guide/SKILL.md +238 -0
  329. package/skills/research/paper-review/paper-critique-framework/SKILL.md +181 -0
  330. package/skills/research/paper-review/paper-digest-guide/SKILL.md +240 -0
  331. package/skills/research/paper-review/paper-research-assistant/SKILL.md +231 -0
  332. package/skills/research/paper-review/research-quality-filter/SKILL.md +261 -0
  333. package/skills/research/paper-review/review-response-guide/SKILL.md +275 -0
  334. package/skills/tools/code-exec/contextplus-mcp-guide/SKILL.md +110 -0
  335. package/skills/tools/code-exec/google-colab-guide/SKILL.md +276 -0
  336. package/skills/tools/code-exec/kaggle-api-guide/SKILL.md +216 -0
  337. package/skills/tools/code-exec/overleaf-cli-guide/SKILL.md +279 -0
  338. package/skills/tools/diagram/clawphd-guide/SKILL.md +149 -0
  339. package/skills/tools/diagram/code-flow-visualizer/SKILL.md +197 -0
  340. package/skills/tools/diagram/excalidraw-diagram-guide/SKILL.md +170 -0
  341. package/skills/tools/diagram/json-data-visualizer/SKILL.md +270 -0
  342. package/skills/tools/diagram/kroki-diagram-api/SKILL.md +198 -0
  343. package/skills/tools/diagram/mermaid-architect-guide/SKILL.md +219 -0
  344. package/skills/tools/diagram/scientific-graphical-abstract/SKILL.md +201 -0
  345. package/skills/tools/diagram/tldraw-whiteboard-guide/SKILL.md +397 -0
  346. package/skills/tools/document/docsgpt-guide/SKILL.md +130 -0
  347. package/skills/tools/document/large-document-reader/SKILL.md +202 -0
  348. package/skills/tools/document/md2pdf-xelatex/SKILL.md +212 -0
  349. package/skills/tools/document/openpaper-guide/SKILL.md +232 -0
  350. package/skills/tools/document/paper-parse-guide/SKILL.md +243 -0
  351. package/skills/tools/document/weknora-guide/SKILL.md +216 -0
  352. package/skills/tools/document/zotero-addon-market-guide/SKILL.md +108 -0
  353. package/skills/tools/document/zotero-night-theme-guide/SKILL.md +142 -0
  354. package/skills/tools/document/zotero-style-guide/SKILL.md +217 -0
  355. package/skills/tools/knowledge-graph/citation-network-builder/SKILL.md +244 -0
  356. package/skills/tools/knowledge-graph/concept-map-generator/SKILL.md +284 -0
  357. package/skills/tools/knowledge-graph/graphiti-guide/SKILL.md +219 -0
  358. package/skills/tools/knowledge-graph/mimir-memory-guide/SKILL.md +135 -0
  359. package/skills/tools/knowledge-graph/notero-zotero-notion-guide/SKILL.md +187 -0
  360. package/skills/tools/knowledge-graph/open-webui-tools-guide/SKILL.md +156 -0
  361. package/skills/tools/knowledge-graph/openspg-guide/SKILL.md +210 -0
  362. package/skills/tools/knowledge-graph/paperpile-notion-guide/SKILL.md +84 -0
  363. package/skills/tools/knowledge-graph/zotero-markdb-connect-guide/SKILL.md +162 -0
  364. package/skills/tools/ocr-translate/latex-translation-guide/SKILL.md +176 -0
  365. package/skills/tools/ocr-translate/math-equation-renderer/SKILL.md +198 -0
  366. package/skills/tools/ocr-translate/pdf-math-translate-guide/SKILL.md +141 -0
  367. package/skills/tools/ocr-translate/zotero-pdf-translate-guide/SKILL.md +95 -0
  368. package/skills/tools/ocr-translate/zotero-pdf2zh-guide/SKILL.md +143 -0
  369. package/skills/tools/scraping/dataset-finder-guide/SKILL.md +253 -0
  370. package/skills/tools/scraping/easy-spider-guide/SKILL.md +250 -0
  371. package/skills/tools/scraping/google-scholar-scraper/SKILL.md +255 -0
  372. package/skills/tools/scraping/repository-harvesting-guide/SKILL.md +310 -0
  373. package/skills/writing/citation/academic-citation-manager/SKILL.md +314 -0
  374. package/skills/writing/citation/academic-citation-manager-guide/SKILL.md +182 -0
  375. package/skills/writing/citation/citation-assistant-skill/SKILL.md +192 -0
  376. package/skills/writing/citation/jabref-reference-guide/SKILL.md +127 -0
  377. package/skills/writing/citation/jasminum-zotero-guide/SKILL.md +103 -0
  378. package/skills/writing/citation/mendeley-api/SKILL.md +231 -0
  379. package/skills/writing/citation/obsidian-citation-guide/SKILL.md +164 -0
  380. package/skills/writing/citation/obsidian-zotero-guide/SKILL.md +137 -0
  381. package/skills/writing/citation/onecite-reference-guide/SKILL.md +168 -0
  382. package/skills/writing/citation/papersgpt-zotero-guide/SKILL.md +132 -0
  383. package/skills/writing/citation/papis-cli-guide/SKILL.md +213 -0
  384. package/skills/writing/citation/zotero-better-bibtex-guide/SKILL.md +107 -0
  385. package/skills/writing/citation/zotero-better-notes-guide/SKILL.md +121 -0
  386. package/skills/writing/citation/zotero-gpt-guide/SKILL.md +111 -0
  387. package/skills/writing/citation/zotero-mcp-guide/SKILL.md +164 -0
  388. package/skills/writing/citation/zotero-mdnotes-guide/SKILL.md +162 -0
  389. package/skills/writing/citation/zotero-reference-guide/SKILL.md +139 -0
  390. package/skills/writing/citation/zotero-scholar-guide/SKILL.md +294 -0
  391. package/skills/writing/citation/zotfile-attachment-guide/SKILL.md +140 -0
  392. package/skills/writing/composition/ml-paper-writing/SKILL.md +163 -0
  393. package/skills/writing/composition/opendraft-thesis-guide/SKILL.md +200 -0
  394. package/skills/writing/composition/paper-debugger-guide/SKILL.md +143 -0
  395. package/skills/writing/composition/paperforge-guide/SKILL.md +205 -0
  396. package/skills/writing/composition/research-paper-writer/SKILL.md +226 -0
  397. package/skills/writing/composition/scientific-writing-resources/SKILL.md +151 -0
  398. package/skills/writing/composition/scientific-writing-wrapper/SKILL.md +153 -0
  399. package/skills/writing/latex/academic-writing-latex/SKILL.md +285 -0
  400. package/skills/writing/latex/latex-drawing-collection/SKILL.md +154 -0
  401. package/skills/writing/latex/latex-templates-collection/SKILL.md +159 -0
  402. package/skills/writing/latex/md-to-pdf-academic/SKILL.md +230 -0
  403. package/skills/writing/latex/tex-render-guide/SKILL.md +243 -0
  404. package/skills/writing/polish/academic-tone-guide/SKILL.md +209 -0
  405. package/skills/writing/polish/chinese-text-humanizer/SKILL.md +140 -0
  406. package/skills/writing/polish/conciseness-editing-guide/SKILL.md +225 -0
  407. package/skills/writing/polish/paper-polish-guide/SKILL.md +160 -0
  408. package/skills/writing/templates/arxiv-preprint-template/SKILL.md +184 -0
  409. package/skills/writing/templates/elegant-paper-template/SKILL.md +141 -0
  410. package/skills/writing/templates/graphical-abstract-guide/SKILL.md +183 -0
  411. package/skills/writing/templates/novathesis-guide/SKILL.md +152 -0
  412. package/skills/writing/templates/scientific-article-pdf/SKILL.md +261 -0
  413. package/skills/writing/templates/sjtuthesis-guide/SKILL.md +197 -0
  414. package/skills/writing/templates/thuthesis-guide/SKILL.md +181 -0
  415. package/skills/literature/fulltext/repository-harvesting-guide/SKILL.md +0 -207
@@ -0,0 +1,204 @@
1
+ ---
2
+ name: deep-research-work
3
+ description: "Combine web search, content analysis, and source verification"
4
+ metadata:
5
+ openclaw:
6
+ emoji: "🌐"
7
+ category: "research"
8
+ subcategory: "deep-research"
9
+ keywords: ["web research", "content analysis", "source verification", "information extraction", "research workflow", "web scraping"]
10
+ source: "https://github.com/AcademicSkills/deep-research-work"
11
+ ---
12
+
13
+ # Deep Research Work
14
+
15
+ A practical deep research workflow that combines web search, structured content analysis, and systematic source verification to produce comprehensive, trustworthy research outputs. Optimized for researchers who need to rapidly synthesize information from diverse online sources while maintaining academic standards of evidence quality.
16
+
17
+ ## Overview
18
+
19
+ Academic research increasingly requires synthesizing information from beyond traditional journal databases: government datasets, technical documentation, industry reports, software repositories, news coverage, and expert commentary. Deep Research Work provides an operational workflow for conducting this kind of multi-source research efficiently while maintaining rigor. It covers search strategy formulation, content extraction, structured note-taking, source credibility assessment, and synthesis into coherent research narratives.
20
+
21
+ The workflow is designed to be completed in a single focused session (2-8 hours) and produces a structured research document with full source attribution. It is particularly useful for rapid literature scans, technology landscape assessments, policy research, and interdisciplinary investigations where no single database covers the full scope.
22
+
23
+ ## Search Strategy Design
24
+
25
+ ### Query Expansion Technique
26
+
27
+ ```python
28
+ def generate_search_queries(topic: str, context: dict) -> list:
29
+ """
30
+ Generate a comprehensive set of search queries using
31
+ systematic query expansion.
32
+
33
+ Expansion strategies:
34
+ 1. Synonym expansion: use alternative terminology
35
+ 2. Scope expansion: broaden/narrow the topic
36
+ 3. Perspective expansion: different stakeholder views
37
+ 4. Temporal expansion: historical and forward-looking
38
+ 5. Geographic expansion: regional variations
39
+ """
40
+ base_queries = [topic]
41
+
42
+ # Synonym expansion
43
+ synonyms = context.get('synonyms', [])
44
+ for syn in synonyms:
45
+ base_queries.append(syn)
46
+
47
+ # Scope expansion
48
+ broader = context.get('broader_topic', '')
49
+ narrower = context.get('sub_topics', [])
50
+ if broader:
51
+ base_queries.append(f"{broader} {topic}")
52
+ for sub in narrower:
53
+ base_queries.append(f"{topic} {sub}")
54
+
55
+ # Perspective expansion
56
+ perspectives = ['benefits', 'risks', 'challenges', 'future',
57
+ 'criticism', 'comparison', 'case study']
58
+ for p in perspectives:
59
+ base_queries.append(f"{topic} {p}")
60
+
61
+ # Source-type targeting
62
+ source_types = ['systematic review', 'meta-analysis', 'white paper',
63
+ 'technical report', 'dataset', 'open source']
64
+ for st in source_types:
65
+ base_queries.append(f"{topic} {st}")
66
+
67
+ return list(set(base_queries))
68
+ ```
69
+
70
+ ### Database Selection Matrix
71
+
72
+ | Source Type | Best Databases | When to Use |
73
+ |------------|---------------|-------------|
74
+ | Peer-reviewed articles | Google Scholar, Semantic Scholar, PubMed | Core academic evidence |
75
+ | Preprints | arXiv, bioRxiv, SSRN, medRxiv | Cutting-edge, pre-review findings |
76
+ | Government/institutional | Data.gov, WHO, OECD, national statistics | Official data, policy context |
77
+ | Technical documentation | GitHub, ReadTheDocs, official docs | Software, tools, methods |
78
+ | Industry reports | McKinsey, Gartner, CB Insights | Market context, trends |
79
+ | Patent databases | Google Patents, USPTO, Espacenet | Innovation landscape |
80
+ | News and media | Google News, specialized trade press | Current events, context |
81
+
82
+ ## Content Extraction and Note-Taking
83
+
84
+ ### Structured Extraction Template
85
+
86
+ For each source reviewed, extract the following:
87
+
88
+ ```yaml
89
+ source_entry:
90
+ id: "S001"
91
+ url: "https://..."
92
+ title: "Title of the Source"
93
+ authors: ["Author A", "Author B"]
94
+ date: "2025-06"
95
+ type: "journal_article" # or preprint, report, blog, etc.
96
+
97
+ extraction:
98
+ main_claim: "One sentence summarizing the key claim or finding"
99
+ evidence_type: "empirical" # empirical, theoretical, anecdotal, opinion
100
+ methodology: "Randomized controlled trial, n=500"
101
+ key_data_points:
102
+ - "Finding 1: X increased by 23% (p < 0.01)"
103
+ - "Finding 2: No significant effect on Y"
104
+ limitations_noted: "Small sample from single institution"
105
+ relevant_quotes:
106
+ - page: 12
107
+ text: "Our results suggest that..."
108
+
109
+ assessment:
110
+ credibility: "high" # high, medium, low
111
+ relevance: "high" # high, medium, low
112
+ novelty: "medium" # high, medium, low
113
+ bias_concerns: "Funded by industry; potential conflict of interest"
114
+ ```
115
+
116
+ ### Progressive Summarization
117
+
118
+ Apply a layered note-taking approach:
119
+
120
+ 1. **Layer 1 - Capture**: Save the full source with metadata (URL, date, authors).
121
+ 2. **Layer 2 - Bold**: Highlight the most important passages (key findings, methods, conclusions).
122
+ 3. **Layer 3 - Highlight**: From the bolded text, mark the essential takeaways for your research question.
123
+ 4. **Layer 4 - Summary**: Write a 2-3 sentence summary in your own words.
124
+ 5. **Layer 5 - Remix**: Connect the finding to your other sources and your research question.
125
+
126
+ ## Source Verification Protocol
127
+
128
+ ### Credibility Assessment Checklist
129
+
130
+ For each source, evaluate:
131
+
132
+ - [ ] **Authority**: Who is the author/organization? What are their credentials?
133
+ - [ ] **Accuracy**: Are claims supported by evidence? Can you verify the data?
134
+ - [ ] **Currency**: When was it published? Is the information still valid?
135
+ - [ ] **Coverage**: Does it address your question sufficiently?
136
+ - [ ] **Objectivity**: Is there apparent bias? Who funded the work?
137
+ - [ ] **Corroboration**: Do other independent sources support the same claims?
138
+
139
+ ### Red Flags for Low-Quality Sources
140
+
141
+ | Red Flag | Action |
142
+ |----------|--------|
143
+ | No author attribution | Downgrade credibility; seek alternative source |
144
+ | No date published | Treat as potentially outdated |
145
+ | Extraordinary claims without evidence | Require independent corroboration |
146
+ | Known predatory journal | Exclude from primary evidence |
147
+ | Single anonymous blog post | Use only as lead to find primary sources |
148
+ | Circular citations | Trace back to the original source |
149
+
150
+ ## Synthesis Workflow
151
+
152
+ ### From Notes to Narrative
153
+
154
+ ```
155
+ Step 1: Cluster
156
+ Group extracted notes by theme or sub-question.
157
+ Use tags from your extraction template.
158
+
159
+ Step 2: Compare
160
+ Within each cluster, compare findings across sources.
161
+ Note agreements, contradictions, and gaps.
162
+
163
+ Step 3: Evaluate
164
+ Weight evidence by source credibility and recency.
165
+ Higher-quality sources take precedence when sources conflict.
166
+
167
+ Step 4: Narrate
168
+ Write a synthesis paragraph for each cluster that:
169
+ - States the overall finding
170
+ - Cites the supporting sources
171
+ - Notes any caveats or contradictions
172
+ - Identifies remaining gaps
173
+
174
+ Step 5: Integrate
175
+ Connect clusters into a coherent narrative.
176
+ Highlight cross-cutting themes and implications.
177
+ ```
178
+
179
+ ### Output Quality Checklist
180
+
181
+ Before finalizing your research output:
182
+
183
+ - [ ] Every factual claim has at least one source citation
184
+ - [ ] Contradictory evidence is explicitly acknowledged
185
+ - [ ] Source quality is visible (not all sources treated equally)
186
+ - [ ] Gaps in knowledge are clearly identified
187
+ - [ ] The search methodology is documented for reproducibility
188
+ - [ ] Dates of all searches are recorded
189
+ - [ ] The output answers the original research question
190
+
191
+ ## Best Practices
192
+
193
+ - Set a time limit before starting. Research can expand indefinitely without constraints.
194
+ - Use a reference manager (Zotero, Mendeley) from the start, even for informal research.
195
+ - Save web pages as PDF or archive snapshots (Wayback Machine) to prevent link rot.
196
+ - Distinguish between primary sources (original data/study) and secondary sources (reporting on the study).
197
+ - When a source cites a finding, always try to trace back to the original source.
198
+ - Document negative results: sources searched that did not yield relevant information.
199
+
200
+ ## References
201
+
202
+ - Booth, A., Sutton, A., & Papaioannou, D. (2016). *Systematic Approaches to a Successful Literature Review* (2nd ed.). Sage.
203
+ - Forte, T. (2022). *Building a Second Brain*. Atria Books.
204
+ - Machi, L. A. & McEvoy, B. T. (2016). *The Literature Review* (3rd ed.). Corwin Press.
@@ -0,0 +1,253 @@
1
+ ---
2
+ name: deep-searcher-guide
3
+ description: "Open deep research alternative for private data with vector search"
4
+ metadata:
5
+ openclaw:
6
+ emoji: "🔍"
7
+ category: "research"
8
+ subcategory: "deep-research"
9
+ keywords: ["deep-search", "private-data", "milvus", "vector-search", "rag", "document-retrieval"]
10
+ source: "https://github.com/zilliztech/deep-searcher"
11
+ ---
12
+
13
+ # Deep Searcher Guide
14
+
15
+ ## Overview
16
+
17
+ Deep Searcher is an open-source deep research tool developed by Zilliz with over 8,000 GitHub stars, designed to be an open alternative to proprietary deep research systems like OpenAI's Deep Research and Gemini Deep Research. What distinguishes Deep Searcher is its focus on private data -- it enables researchers to conduct deep, iterative research over their own document collections, databases, and institutional knowledge bases rather than being limited to public web content.
18
+
19
+ The system combines vector search via Milvus (or other vector databases) with agentic RAG (Retrieval-Augmented Generation) to decompose complex research questions, retrieve relevant passages from your document collection, reason over the retrieved content, and iteratively refine its search until it can produce a comprehensive answer. This makes it particularly valuable for researchers who work with proprietary datasets, unpublished manuscripts, internal reports, or specialized domain corpora that are not available through web search.
20
+
21
+ Deep Searcher supports multiple LLM providers and embedding models, and can be deployed entirely on-premises for organizations with strict data privacy requirements. It is built on top of Milvus, the high-performance open-source vector database also created by Zilliz, ensuring scalable and efficient similarity search across large document collections.
22
+
23
+ ## Installation and Setup
24
+
25
+ ```bash
26
+ # Install Deep Searcher
27
+ pip install deepsearcher
28
+
29
+ # Or clone for development
30
+ git clone https://github.com/zilliztech/deep-searcher.git
31
+ cd deep-searcher
32
+ pip install -e .
33
+ ```
34
+
35
+ ### Dependencies Setup
36
+
37
+ Deep Searcher requires a vector database and LLM access:
38
+
39
+ ```bash
40
+ # Option 1: Milvus Lite (embedded, no separate server needed)
41
+ pip install pymilvus[model]
42
+
43
+ # Option 2: Full Milvus via Docker
44
+ docker run -d --name milvus \
45
+ -p 19530:19530 \
46
+ -p 9091:9091 \
47
+ milvusio/milvus:latest standalone
48
+
49
+ # Configure LLM access
50
+ export OPENAI_API_KEY=$OPENAI_API_KEY
51
+ # Or for local LLMs
52
+ export OLLAMA_BASE_URL=http://localhost:11434
53
+ ```
54
+
55
+ ### Configuration
56
+
57
+ Create a configuration file for your research setup:
58
+
59
+ ```python
60
+ from deepsearcher import DeepSearcher
61
+ from deepsearcher.config import Config
62
+
63
+ config = Config(
64
+ # Vector database settings
65
+ vector_db="milvus_lite", # or "milvus", "zilliz_cloud"
66
+ collection_name="research_papers",
67
+
68
+ # LLM settings
69
+ llm_provider="openai",
70
+ llm_model="gpt-4o",
71
+
72
+ # Embedding settings
73
+ embedding_model="text-embedding-3-small",
74
+
75
+ # Research settings
76
+ max_iterations=10,
77
+ chunk_size=1000,
78
+ chunk_overlap=200,
79
+ )
80
+
81
+ searcher = DeepSearcher(config)
82
+ ```
83
+
84
+ ## Document Ingestion
85
+
86
+ ### Loading Research Documents
87
+
88
+ Ingest your research documents into the vector database for searchable access:
89
+
90
+ ```python
91
+ # Load individual files
92
+ searcher.load_document("path/to/paper.pdf")
93
+ searcher.load_document("path/to/notes.md")
94
+
95
+ # Load entire directories
96
+ searcher.load_directory(
97
+ "path/to/papers/",
98
+ file_types=["pdf", "md", "txt", "docx"],
99
+ recursive=True,
100
+ )
101
+
102
+ # Load with metadata for filtering
103
+ searcher.load_document(
104
+ "path/to/paper.pdf",
105
+ metadata={
106
+ "author": "Smith et al.",
107
+ "year": 2024,
108
+ "topic": "transformer efficiency",
109
+ "venue": "NeurIPS",
110
+ }
111
+ )
112
+ ```
113
+
114
+ ### Supported Document Types
115
+
116
+ Deep Searcher supports a wide range of document formats commonly used in academic research:
117
+
118
+ - **PDF**: Research papers, textbooks, reports (with OCR support for scanned documents)
119
+ - **Markdown**: Research notes, documentation, wikis
120
+ - **Plain text**: Data files, logs, transcripts
121
+ - **DOCX/DOC**: Word documents, manuscripts
122
+ - **HTML**: Web pages, saved articles
123
+ - **LaTeX**: TeX source files with equation extraction
124
+ - **Jupyter Notebooks**: Code and analysis notebooks
125
+
126
+ ## Deep Research Workflow
127
+
128
+ ### Basic Research Query
129
+
130
+ ```python
131
+ # Ask a research question over your document collection
132
+ result = searcher.research(
133
+ query="What methods have been proposed for reducing the "
134
+ "computational complexity of self-attention in transformers?",
135
+ )
136
+
137
+ print(result.answer)
138
+ print(f"Sources: {len(result.sources)}")
139
+ for source in result.sources:
140
+ print(f" - {source.document}: {source.chunk_preview[:100]}...")
141
+ ```
142
+
143
+ ### Iterative Research Process
144
+
145
+ Deep Searcher follows an iterative research pipeline:
146
+
147
+ 1. **Query decomposition**: The research question is broken into sub-queries
148
+ 2. **Initial retrieval**: Vector search retrieves relevant passages for each sub-query
149
+ 3. **Analysis**: The LLM analyzes retrieved content and identifies information gaps
150
+ 4. **Refined search**: New queries are generated to fill gaps, with the search refined based on what has been found
151
+ 5. **Synthesis**: All gathered information is synthesized into a comprehensive answer with citations
152
+
153
+ ```python
154
+ # Watch the iterative research process
155
+ result = searcher.research(
156
+ query="Compare the approaches to efficient attention in the papers "
157
+ "I have collected, focusing on trade-offs between speed and quality",
158
+ verbose=True, # Print each research iteration
159
+ max_iterations=8,
160
+ )
161
+
162
+ # Access the research trace
163
+ for step in result.trace:
164
+ print(f"Iteration {step.iteration}:")
165
+ print(f" Sub-query: {step.query}")
166
+ print(f" Documents found: {step.num_results}")
167
+ print(f" Gap identified: {step.gap}")
168
+ ```
169
+
170
+ ### Filtered Research
171
+
172
+ Narrow your research to specific subsets of your collection:
173
+
174
+ ```python
175
+ # Research only within papers from a specific venue
176
+ result = searcher.research(
177
+ query="Novel loss functions for contrastive learning",
178
+ filters={"venue": "ICML", "year": {"$gte": 2023}},
179
+ )
180
+
181
+ # Research across specific document groups
182
+ result = searcher.research(
183
+ query="How do the baseline methods compare across my experiment logs?",
184
+ filters={"topic": "baseline-comparison"},
185
+ )
186
+ ```
187
+
188
+ ## Integration with Research Tools
189
+
190
+ ### Combining Private and Public Data
191
+
192
+ Deep Searcher can be combined with web search for comprehensive research that covers both your private collection and public sources:
193
+
194
+ ```python
195
+ from deepsearcher.sources import WebSearchSource
196
+
197
+ # Add web search as an additional source
198
+ config.add_source(WebSearchSource(
199
+ provider="tavily",
200
+ api_key_env="TAVILY_API_KEY",
201
+ ))
202
+
203
+ # Research now spans both private documents and the web
204
+ result = searcher.research(
205
+ query="Recent advances in protein folding prediction",
206
+ sources=["private", "web"],
207
+ )
208
+ ```
209
+
210
+ ### Export and Sharing
211
+
212
+ Export research results in formats suitable for academic use:
213
+
214
+ ```python
215
+ # Export as markdown report
216
+ result.export_markdown("research_report.md")
217
+
218
+ # Export citations in BibTeX format
219
+ result.export_citations("references.bib")
220
+
221
+ # Export the full research trace for reproducibility
222
+ result.export_trace("research_trace.json")
223
+ ```
224
+
225
+ ### API Server
226
+
227
+ Run Deep Searcher as a service for team-wide access:
228
+
229
+ ```bash
230
+ # Start the API server
231
+ deepsearcher serve --host 0.0.0.0 --port 8000
232
+
233
+ # Query via REST API
234
+ curl -X POST http://localhost:8000/research \
235
+ -H "Content-Type: application/json" \
236
+ -d '{"query": "What are the key findings in our latest experiments?"}'
237
+ ```
238
+
239
+ ## Performance and Scalability
240
+
241
+ Deep Searcher leverages Milvus for high-performance vector search, which means it can handle document collections ranging from hundreds to millions of documents efficiently. Key performance considerations include:
242
+
243
+ - **Indexing**: Milvus uses HNSW or IVF indexes for fast approximate nearest neighbor search
244
+ - **Chunking strategy**: Adjustable chunk size and overlap to balance retrieval precision and recall
245
+ - **Embedding caching**: Previously computed embeddings are cached to avoid redundant computation
246
+ - **Batch processing**: Documents can be ingested in parallel for faster indexing
247
+
248
+ ## References
249
+
250
+ - Repository: https://github.com/zilliztech/deep-searcher
251
+ - Milvus vector database: https://milvus.io/
252
+ - Zilliz Cloud (managed Milvus): https://zilliz.com/
253
+ - Milvus documentation: https://milvus.io/docs/
@@ -0,0 +1,191 @@
1
+ ---
2
+ name: gpt-researcher-guide
3
+ description: "Autonomous agent for comprehensive deep research on any topic"
4
+ metadata:
5
+ openclaw:
6
+ emoji: "🔬"
7
+ category: "research"
8
+ subcategory: "deep-research"
9
+ keywords: ["deep-research", "autonomous-agent", "web-search", "report-generation", "literature-review"]
10
+ source: "https://github.com/assafelovic/gpt-researcher"
11
+ ---
12
+
13
+ # GPT Researcher Guide
14
+
15
+ ## Overview
16
+
17
+ GPT Researcher is an autonomous research agent with over 26,000 GitHub stars that conducts comprehensive online research on any given topic. Developed by Assaf Elovic, it generates detailed, factual, and unbiased research reports by planning research questions, searching multiple sources, scraping and filtering relevant content, and synthesizing findings into well-structured reports with citations.
18
+
19
+ The agent addresses a fundamental challenge in AI-assisted research: generating accurate, comprehensive reports rather than relying on a single LLM's potentially outdated or hallucinated knowledge. GPT Researcher uses a multi-agent architecture where a planner agent decomposes the research query into sub-questions, multiple retriever agents gather information from diverse sources, and a writer agent synthesizes everything into a coherent report.
20
+
21
+ For academic researchers, GPT Researcher is valuable for conducting preliminary literature surveys, exploring unfamiliar research domains, gathering background information for grant proposals, and generating initial drafts of review sections. The agent can be configured to search specific domains, use academic search engines, and output reports in various formats including markdown and PDF.
22
+
23
+ ## Installation and Setup
24
+
25
+ ```bash
26
+ # Install from PyPI
27
+ pip install gpt-researcher
28
+
29
+ # Or clone for development
30
+ git clone https://github.com/assafelovic/gpt-researcher.git
31
+ cd gpt-researcher
32
+ pip install -e .
33
+ ```
34
+
35
+ Configure your environment with API keys using environment variables:
36
+
37
+ ```bash
38
+ # Required: LLM provider (choose one)
39
+ export OPENAI_API_KEY=$OPENAI_API_KEY
40
+ # Or use other providers
41
+ export ANTHROPIC_API_KEY=$ANTHROPIC_API_KEY
42
+
43
+ # Required: Search provider (choose one)
44
+ export TAVILY_API_KEY=$TAVILY_API_KEY
45
+ # Or alternatives
46
+ export SERPER_API_KEY=$SERPER_API_KEY
47
+ export SEARX_URL=$SEARX_URL
48
+ ```
49
+
50
+ For a fully local setup without external API dependencies, you can configure local LLMs and search engines:
51
+
52
+ ```bash
53
+ # Use local LLM via Ollama
54
+ export OPENAI_BASE_URL=http://localhost:11434/v1
55
+ export LLM_PROVIDER=ollama
56
+ export FAST_LLM=llama3
57
+ export SMART_LLM=llama3
58
+
59
+ # Use local search via SearXNG
60
+ export SEARX_URL=http://localhost:8888
61
+ export SEARCH_PROVIDER=searx
62
+ ```
63
+
64
+ ## Core Research Workflow
65
+
66
+ ### Basic Research Report
67
+
68
+ Generate a research report with a single function call:
69
+
70
+ ```python
71
+ from gpt_researcher import GPTResearcher
72
+ import asyncio
73
+
74
+ async def run_research():
75
+ query = "Recent advances in protein structure prediction using deep learning"
76
+ researcher = GPTResearcher(query=query, report_type="research_report")
77
+
78
+ # Conduct research (searches, scrapes, analyzes sources)
79
+ research_result = await researcher.conduct_research()
80
+
81
+ # Generate the final report
82
+ report = await researcher.write_report()
83
+
84
+ # Access sources used
85
+ sources = researcher.get_source_urls()
86
+ print(f"Report based on {len(sources)} sources")
87
+ print(report)
88
+
89
+ asyncio.run(run_research())
90
+ ```
91
+
92
+ ### Report Types
93
+
94
+ GPT Researcher supports multiple report types tailored to different needs:
95
+
96
+ - **research_report**: Comprehensive report with findings and analysis (default)
97
+ - **detailed_report**: Extended multi-page report with deeper analysis
98
+ - **resource_report**: Curated list of sources with summaries and relevance scores
99
+ - **outline_report**: Structured outline for further manual research
100
+ - **subtopic_report**: Focused report on a specific subtopic within a broader area
101
+
102
+ ```python
103
+ # Generate a detailed multi-page report
104
+ researcher = GPTResearcher(
105
+ query="Transformer architectures for scientific document understanding",
106
+ report_type="detailed_report",
107
+ max_subtopics=5,
108
+ )
109
+ ```
110
+
111
+ ### Multi-Agent Architecture
112
+
113
+ The research process follows a sophisticated multi-agent pipeline:
114
+
115
+ 1. **Planner Agent**: Decomposes the research query into 4-6 focused sub-questions
116
+ 2. **Retriever Agents**: Each sub-question is researched independently by a dedicated agent that searches, scrapes, and filters content
117
+ 3. **Ranker Agent**: Evaluates and ranks gathered sources by relevance and quality
118
+ 4. **Writer Agent**: Synthesizes all findings into a coherent, well-structured report with inline citations
119
+
120
+ ```python
121
+ # Customize the research configuration
122
+ researcher = GPTResearcher(
123
+ query="Impact of climate change on marine biodiversity",
124
+ report_type="research_report",
125
+ source_urls=None, # Or provide specific URLs to research
126
+ config_path=None, # Or path to custom config
127
+ max_search_results_per_query=5,
128
+ verbose=True,
129
+ )
130
+ ```
131
+
132
+ ## Advanced Configuration
133
+
134
+ ### Custom Source Restrictions
135
+
136
+ Restrict research to specific domains or provide seed URLs:
137
+
138
+ ```python
139
+ # Research only from specific academic sources
140
+ researcher = GPTResearcher(
141
+ query="CRISPR gene editing safety profiles",
142
+ source_urls=[
143
+ "https://pubmed.ncbi.nlm.nih.gov/",
144
+ "https://www.nature.com/",
145
+ "https://www.science.org/",
146
+ ],
147
+ )
148
+ ```
149
+
150
+ ### LLM Configuration
151
+
152
+ Configure different LLMs for different stages of the research pipeline:
153
+
154
+ ```python
155
+ # Use a fast model for planning and a powerful model for writing
156
+ # Set via environment variables
157
+ # FAST_LLM: Used for sub-question generation and filtering
158
+ # SMART_LLM: Used for report synthesis and writing
159
+ ```
160
+
161
+ ### Integration with FastAPI
162
+
163
+ GPT Researcher includes a web interface and API server:
164
+
165
+ ```bash
166
+ # Start the web UI and API server
167
+ cd gpt-researcher
168
+ pip install -r requirements.txt
169
+ python -m uvicorn main:app --host 0.0.0.0 --port 8000
170
+ ```
171
+
172
+ The API exposes WebSocket endpoints for streaming research progress and REST endpoints for report management, making it easy to integrate into existing research platforms.
173
+
174
+ ## Academic Research Applications
175
+
176
+ GPT Researcher can be adapted for several academic use cases:
177
+
178
+ - **Preliminary literature surveys**: Quickly scan the landscape of a new research area before conducting a formal systematic review
179
+ - **Grant proposal background**: Gather recent developments and state-of-the-art results to strengthen research proposals
180
+ - **Conference talk preparation**: Generate comprehensive overviews of related work for presentations
181
+ - **Cross-disciplinary exploration**: Investigate adjacent fields to identify potential collaboration opportunities or interdisciplinary approaches
182
+ - **Fact-checking and verification**: Cross-reference claims across multiple sources to validate research findings
183
+
184
+ The reports include full citations with URLs, making it straightforward to verify sources and follow up with deeper reading of primary literature.
185
+
186
+ ## References
187
+
188
+ - Repository: https://github.com/assafelovic/gpt-researcher
189
+ - Documentation: https://docs.gptr.dev/
190
+ - Tavily Search API: https://tavily.com/
191
+ - Research paper: https://arxiv.org/abs/2305.04091