@wentorai/research-plugins 1.0.0 → 1.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (415) hide show
  1. package/README.md +22 -22
  2. package/curated/analysis/README.md +82 -56
  3. package/curated/domains/README.md +225 -69
  4. package/curated/literature/README.md +115 -46
  5. package/curated/research/README.md +106 -58
  6. package/curated/tools/README.md +107 -87
  7. package/curated/writing/README.md +92 -45
  8. package/mcp-configs/academic-db/alphafold-mcp.json +20 -0
  9. package/mcp-configs/academic-db/brightspace-mcp.json +21 -0
  10. package/mcp-configs/academic-db/climatiq-mcp.json +20 -0
  11. package/mcp-configs/academic-db/gibs-mcp.json +20 -0
  12. package/mcp-configs/academic-db/gis-mcp-server.json +22 -0
  13. package/mcp-configs/academic-db/google-earth-engine-mcp.json +21 -0
  14. package/mcp-configs/academic-db/m4-clinical-mcp.json +21 -0
  15. package/mcp-configs/academic-db/medical-mcp.json +21 -0
  16. package/mcp-configs/academic-db/nexonco-mcp.json +20 -0
  17. package/mcp-configs/academic-db/omop-mcp.json +20 -0
  18. package/mcp-configs/academic-db/onekgpd-mcp.json +20 -0
  19. package/mcp-configs/academic-db/openedu-mcp.json +20 -0
  20. package/mcp-configs/academic-db/opengenes-mcp.json +20 -0
  21. package/mcp-configs/academic-db/openstax-mcp.json +21 -0
  22. package/mcp-configs/academic-db/openstreetmap-mcp.json +21 -0
  23. package/mcp-configs/academic-db/opentargets-mcp.json +21 -0
  24. package/mcp-configs/academic-db/pdb-mcp.json +21 -0
  25. package/mcp-configs/academic-db/smithsonian-mcp.json +20 -0
  26. package/mcp-configs/ai-platform/magi-researchers.json +21 -0
  27. package/mcp-configs/ai-platform/mcp-academic-researcher.json +22 -0
  28. package/mcp-configs/ai-platform/open-paper-machine.json +21 -0
  29. package/mcp-configs/ai-platform/paper-intelligence.json +21 -0
  30. package/mcp-configs/ai-platform/paper-reader.json +21 -0
  31. package/mcp-configs/ai-platform/paperdebugger.json +21 -0
  32. package/mcp-configs/browser/exa-mcp.json +20 -0
  33. package/mcp-configs/browser/mcp-searxng.json +21 -0
  34. package/mcp-configs/browser/mcp-webresearch.json +20 -0
  35. package/mcp-configs/cloud-docs/confluence-mcp.json +37 -0
  36. package/mcp-configs/cloud-docs/google-drive-mcp.json +35 -0
  37. package/mcp-configs/cloud-docs/notion-mcp.json +29 -0
  38. package/mcp-configs/communication/discord-mcp.json +29 -0
  39. package/mcp-configs/communication/discourse-mcp.json +21 -0
  40. package/mcp-configs/communication/slack-mcp.json +29 -0
  41. package/mcp-configs/communication/telegram-mcp.json +28 -0
  42. package/mcp-configs/data-platform/automl-stat-mcp.json +21 -0
  43. package/mcp-configs/data-platform/jefferson-stats-mcp.json +22 -0
  44. package/mcp-configs/data-platform/mcp-excel-server.json +21 -0
  45. package/mcp-configs/data-platform/mcp-stata.json +21 -0
  46. package/mcp-configs/data-platform/mcpstack-jupyter.json +21 -0
  47. package/mcp-configs/data-platform/ml-mcp.json +21 -0
  48. package/mcp-configs/data-platform/nasdaq-data-link-mcp.json +20 -0
  49. package/mcp-configs/data-platform/numpy-mcp.json +21 -0
  50. package/mcp-configs/database/neo4j-mcp.json +37 -0
  51. package/mcp-configs/database/postgres-mcp.json +28 -0
  52. package/mcp-configs/database/sqlite-mcp.json +29 -0
  53. package/mcp-configs/dev-platform/geogebra-mcp.json +21 -0
  54. package/mcp-configs/dev-platform/github-mcp.json +31 -0
  55. package/mcp-configs/dev-platform/gitlab-mcp.json +34 -0
  56. package/mcp-configs/dev-platform/latex-mcp-server.json +21 -0
  57. package/mcp-configs/dev-platform/manim-mcp.json +20 -0
  58. package/mcp-configs/dev-platform/mcp-echarts.json +20 -0
  59. package/mcp-configs/dev-platform/panel-viz-mcp.json +20 -0
  60. package/mcp-configs/dev-platform/paperbanana.json +20 -0
  61. package/mcp-configs/dev-platform/texflow-mcp.json +20 -0
  62. package/mcp-configs/dev-platform/texmcp.json +20 -0
  63. package/mcp-configs/dev-platform/typst-mcp.json +21 -0
  64. package/mcp-configs/dev-platform/vizro-mcp.json +20 -0
  65. package/mcp-configs/email/email-mcp.json +40 -0
  66. package/mcp-configs/email/gmail-mcp.json +37 -0
  67. package/mcp-configs/note-knowledge/local-faiss-mcp.json +21 -0
  68. package/mcp-configs/note-knowledge/mcp-memory-service.json +21 -0
  69. package/mcp-configs/note-knowledge/mcp-obsidian.json +23 -0
  70. package/mcp-configs/note-knowledge/mcp-ragdocs.json +20 -0
  71. package/mcp-configs/note-knowledge/mcp-summarizer.json +21 -0
  72. package/mcp-configs/note-knowledge/mediawiki-mcp.json +21 -0
  73. package/mcp-configs/note-knowledge/openzim-mcp.json +20 -0
  74. package/mcp-configs/note-knowledge/zettelkasten-mcp.json +21 -0
  75. package/mcp-configs/reference-mgr/academic-paper-mcp-http.json +20 -0
  76. package/mcp-configs/reference-mgr/academix.json +20 -0
  77. package/mcp-configs/reference-mgr/arxiv-research-mcp.json +21 -0
  78. package/mcp-configs/reference-mgr/google-scholar-abstract-mcp.json +19 -0
  79. package/mcp-configs/reference-mgr/google-scholar-mcp.json +20 -0
  80. package/mcp-configs/reference-mgr/mcp-paperswithcode.json +21 -0
  81. package/mcp-configs/reference-mgr/mcp-scholarly.json +20 -0
  82. package/mcp-configs/reference-mgr/mcp-simple-arxiv.json +20 -0
  83. package/mcp-configs/reference-mgr/mcp-simple-pubmed.json +20 -0
  84. package/mcp-configs/reference-mgr/mcp-zotero.json +21 -0
  85. package/mcp-configs/reference-mgr/mendeley-mcp.json +20 -0
  86. package/mcp-configs/reference-mgr/ncbi-mcp-server.json +22 -0
  87. package/mcp-configs/reference-mgr/onecite.json +21 -0
  88. package/mcp-configs/reference-mgr/paper-search-mcp.json +21 -0
  89. package/mcp-configs/reference-mgr/pubmed-search-mcp.json +21 -0
  90. package/mcp-configs/reference-mgr/scholar-mcp.json +21 -0
  91. package/mcp-configs/reference-mgr/scholar-multi-mcp.json +21 -0
  92. package/mcp-configs/reference-mgr/seerai.json +21 -0
  93. package/mcp-configs/reference-mgr/semantic-scholar-fastmcp.json +21 -0
  94. package/mcp-configs/reference-mgr/sourcelibrary.json +20 -0
  95. package/mcp-configs/registry.json +178 -149
  96. package/mcp-configs/repository/dataverse-mcp.json +33 -0
  97. package/mcp-configs/repository/huggingface-mcp.json +29 -0
  98. package/openclaw.plugin.json +2 -2
  99. package/package.json +2 -2
  100. package/skills/analysis/dataviz/algorithm-visualizer-guide/SKILL.md +259 -0
  101. package/skills/analysis/dataviz/bokeh-visualization-guide/SKILL.md +270 -0
  102. package/skills/analysis/dataviz/chart-image-generator/SKILL.md +229 -0
  103. package/skills/analysis/dataviz/citation-map-guide/SKILL.md +184 -0
  104. package/skills/analysis/dataviz/d3-visualization-guide/SKILL.md +281 -0
  105. package/skills/analysis/dataviz/data-visualization-principles/SKILL.md +171 -0
  106. package/skills/analysis/dataviz/echarts-visualization-guide/SKILL.md +250 -0
  107. package/skills/analysis/dataviz/metabase-analytics-guide/SKILL.md +242 -0
  108. package/skills/analysis/dataviz/plotly-interactive-guide/SKILL.md +266 -0
  109. package/skills/analysis/dataviz/redash-analytics-guide/SKILL.md +284 -0
  110. package/skills/analysis/econometrics/econml-causal-guide/SKILL.md +163 -0
  111. package/skills/analysis/econometrics/empirical-paper-analysis/SKILL.md +192 -0
  112. package/skills/analysis/econometrics/mostly-harmless-guide/SKILL.md +139 -0
  113. package/skills/analysis/econometrics/panel-data-analyst/SKILL.md +259 -0
  114. package/skills/analysis/econometrics/panel-data-regression-workflow/SKILL.md +267 -0
  115. package/skills/analysis/econometrics/python-causality-guide/SKILL.md +134 -0
  116. package/skills/analysis/econometrics/stata-accounting-guide/SKILL.md +269 -0
  117. package/skills/analysis/econometrics/stata-analyst-guide/SKILL.md +245 -0
  118. package/skills/analysis/econometrics/stata-reference-guide/SKILL.md +293 -0
  119. package/skills/analysis/statistics/data-anomaly-detection/SKILL.md +157 -0
  120. package/skills/analysis/statistics/general-statistics-guide/SKILL.md +226 -0
  121. package/skills/analysis/statistics/infiagent-benchmark-guide/SKILL.md +106 -0
  122. package/skills/analysis/statistics/ml-experiment-tracker/SKILL.md +212 -0
  123. package/skills/analysis/statistics/pywayne-statistics-guide/SKILL.md +192 -0
  124. package/skills/analysis/statistics/quantitative-methods-guide/SKILL.md +193 -0
  125. package/skills/analysis/statistics/senior-data-scientist-guide/SKILL.md +223 -0
  126. package/skills/analysis/wrangling/claude-data-analysis-guide/SKILL.md +100 -0
  127. package/skills/analysis/wrangling/csv-data-analyzer/SKILL.md +170 -0
  128. package/skills/analysis/wrangling/data-cleaning-pipeline/SKILL.md +266 -0
  129. package/skills/analysis/wrangling/data-cog-guide/SKILL.md +178 -0
  130. package/skills/analysis/wrangling/open-data-scientist-guide/SKILL.md +197 -0
  131. package/skills/analysis/wrangling/stata-data-cleaning/SKILL.md +276 -0
  132. package/skills/analysis/wrangling/streamline-analyst-guide/SKILL.md +119 -0
  133. package/skills/analysis/wrangling/survey-data-processing/SKILL.md +298 -0
  134. package/skills/domains/ai-ml/ai-agent-papers-guide/SKILL.md +146 -0
  135. package/skills/domains/ai-ml/ai-model-benchmarking/SKILL.md +209 -0
  136. package/skills/domains/ai-ml/annotated-dl-papers-guide/SKILL.md +159 -0
  137. package/skills/domains/ai-ml/anomaly-detection-papers-guide/SKILL.md +167 -0
  138. package/skills/domains/ai-ml/autonomous-agents-papers-guide/SKILL.md +178 -0
  139. package/skills/domains/ai-ml/dl-transformer-finetune/SKILL.md +239 -0
  140. package/skills/domains/ai-ml/domain-adaptation-papers-guide/SKILL.md +173 -0
  141. package/skills/domains/ai-ml/generative-ai-guide/SKILL.md +146 -0
  142. package/skills/domains/ai-ml/graph-learning-papers-guide/SKILL.md +125 -0
  143. package/skills/domains/ai-ml/huggingface-inference-guide/SKILL.md +196 -0
  144. package/skills/domains/ai-ml/keras-deep-learning/SKILL.md +210 -0
  145. package/skills/domains/ai-ml/kolmogorov-arnold-networks-guide/SKILL.md +185 -0
  146. package/skills/domains/ai-ml/llm-from-scratch-guide/SKILL.md +124 -0
  147. package/skills/domains/ai-ml/ml-pipeline-guide/SKILL.md +295 -0
  148. package/skills/domains/ai-ml/nlp-toolkit-guide/SKILL.md +247 -0
  149. package/skills/domains/ai-ml/npcpy-research-guide/SKILL.md +137 -0
  150. package/skills/domains/ai-ml/pytorch-guide/SKILL.md +281 -0
  151. package/skills/domains/ai-ml/pytorch-lightning-guide/SKILL.md +244 -0
  152. package/skills/domains/ai-ml/responsible-ai-guide/SKILL.md +126 -0
  153. package/skills/domains/ai-ml/tensorflow-guide/SKILL.md +241 -0
  154. package/skills/domains/ai-ml/vmas-simulator-guide/SKILL.md +129 -0
  155. package/skills/domains/biomedical/bioagents-guide/SKILL.md +308 -0
  156. package/skills/domains/biomedical/clawbio-guide/SKILL.md +167 -0
  157. package/skills/domains/biomedical/clinical-dialogue-agents-guide/SKILL.md +145 -0
  158. package/skills/domains/biomedical/ena-sequence-api/SKILL.md +175 -0
  159. package/skills/domains/biomedical/genomas-guide/SKILL.md +126 -0
  160. package/skills/domains/biomedical/genotex-benchmark-guide/SKILL.md +125 -0
  161. package/skills/domains/biomedical/med-researcher-guide/SKILL.md +161 -0
  162. package/skills/domains/biomedical/med-researcher-r1-guide/SKILL.md +146 -0
  163. package/skills/domains/biomedical/medgeclaw-guide/SKILL.md +345 -0
  164. package/skills/domains/biomedical/medical-imaging-guide/SKILL.md +305 -0
  165. package/skills/domains/biomedical/ncbi-blast-api/SKILL.md +195 -0
  166. package/skills/domains/biomedical/ncbi-datasets-api/SKILL.md +220 -0
  167. package/skills/domains/biomedical/quickgo-api/SKILL.md +181 -0
  168. package/skills/domains/business/architecture-design-guide/SKILL.md +279 -0
  169. package/skills/domains/business/innovation-management-guide/SKILL.md +257 -0
  170. package/skills/domains/business/operations-research-guide/SKILL.md +258 -0
  171. package/skills/domains/business/xpert-bi-guide/SKILL.md +84 -0
  172. package/skills/domains/chemistry/cactus-cheminformatics-guide/SKILL.md +89 -0
  173. package/skills/domains/chemistry/chemeagle-guide/SKILL.md +147 -0
  174. package/skills/domains/chemistry/chemgraph-agent-guide/SKILL.md +120 -0
  175. package/skills/domains/chemistry/molecular-dynamics-guide/SKILL.md +237 -0
  176. package/skills/domains/chemistry/pubchem-api-guide/SKILL.md +180 -0
  177. package/skills/domains/chemistry/spectroscopy-analysis-guide/SKILL.md +290 -0
  178. package/skills/domains/cs/ai-security-papers-guide/SKILL.md +103 -0
  179. package/skills/domains/cs/code-llm-papers-guide/SKILL.md +131 -0
  180. package/skills/domains/cs/distributed-systems-guide/SKILL.md +268 -0
  181. package/skills/domains/cs/formal-verification-guide/SKILL.md +298 -0
  182. package/skills/domains/cs/gaussian-splatting-papers-guide/SKILL.md +158 -0
  183. package/skills/domains/cs/llm-aiops-guide/SKILL.md +70 -0
  184. package/skills/domains/cs/software-heritage-api/SKILL.md +200 -0
  185. package/skills/domains/ecology/species-distribution-guide/SKILL.md +343 -0
  186. package/skills/domains/economics/imf-data-api-guide/SKILL.md +174 -0
  187. package/skills/domains/economics/nber-working-papers-api/SKILL.md +177 -0
  188. package/skills/domains/economics/post-labor-economics/SKILL.md +254 -0
  189. package/skills/domains/economics/pricing-psychology-guide/SKILL.md +273 -0
  190. package/skills/domains/economics/repec-economics-api/SKILL.md +188 -0
  191. package/skills/domains/economics/world-bank-data-guide/SKILL.md +179 -0
  192. package/skills/domains/education/academic-study-methods/SKILL.md +228 -0
  193. package/skills/domains/education/assessment-design-guide/SKILL.md +213 -0
  194. package/skills/domains/education/educational-research-methods/SKILL.md +179 -0
  195. package/skills/domains/education/edumcp-guide/SKILL.md +74 -0
  196. package/skills/domains/education/mooc-analytics-guide/SKILL.md +206 -0
  197. package/skills/domains/education/open-syllabus-api/SKILL.md +171 -0
  198. package/skills/domains/finance/akshare-finance-data/SKILL.md +207 -0
  199. package/skills/domains/finance/finsight-research-guide/SKILL.md +113 -0
  200. package/skills/domains/finance/options-analytics-agent-guide/SKILL.md +117 -0
  201. package/skills/domains/finance/portfolio-optimization-guide/SKILL.md +279 -0
  202. package/skills/domains/finance/risk-modeling-guide/SKILL.md +260 -0
  203. package/skills/domains/finance/stata-accounting-research/SKILL.md +372 -0
  204. package/skills/domains/geoscience/climate-modeling-guide/SKILL.md +215 -0
  205. package/skills/domains/geoscience/pangaea-data-api/SKILL.md +197 -0
  206. package/skills/domains/geoscience/satellite-remote-sensing/SKILL.md +193 -0
  207. package/skills/domains/geoscience/seismology-data-guide/SKILL.md +208 -0
  208. package/skills/domains/humanities/digital-humanities-methods/SKILL.md +232 -0
  209. package/skills/domains/humanities/ethical-philosophy-guide/SKILL.md +244 -0
  210. package/skills/domains/humanities/history-research-guide/SKILL.md +260 -0
  211. package/skills/domains/humanities/political-history-guide/SKILL.md +241 -0
  212. package/skills/domains/law/caselaw-access-api/SKILL.md +149 -0
  213. package/skills/domains/law/legal-agent-skills-guide/SKILL.md +132 -0
  214. package/skills/domains/law/legal-nlp-guide/SKILL.md +236 -0
  215. package/skills/domains/law/legal-research-methods/SKILL.md +190 -0
  216. package/skills/domains/law/opencontracts-guide/SKILL.md +168 -0
  217. package/skills/domains/law/patent-analysis-guide/SKILL.md +257 -0
  218. package/skills/domains/law/regulatory-compliance-guide/SKILL.md +267 -0
  219. package/skills/domains/math/lean-theorem-proving-guide/SKILL.md +140 -0
  220. package/skills/domains/math/symbolic-computation-guide/SKILL.md +263 -0
  221. package/skills/domains/math/topology-data-analysis/SKILL.md +305 -0
  222. package/skills/domains/pharma/clinical-trial-design-guide/SKILL.md +271 -0
  223. package/skills/domains/pharma/drug-target-interaction/SKILL.md +242 -0
  224. package/skills/domains/pharma/madd-drug-discovery-guide/SKILL.md +153 -0
  225. package/skills/domains/pharma/pharmacovigilance-guide/SKILL.md +216 -0
  226. package/skills/domains/physics/astrophysics-data-guide/SKILL.md +305 -0
  227. package/skills/domains/physics/particle-physics-guide/SKILL.md +287 -0
  228. package/skills/domains/social-science/ipums-microdata-api/SKILL.md +211 -0
  229. package/skills/domains/social-science/network-analysis-guide/SKILL.md +310 -0
  230. package/skills/domains/social-science/psychology-research-guide/SKILL.md +270 -0
  231. package/skills/domains/social-science/sociology-research-guide/SKILL.md +238 -0
  232. package/skills/domains/social-science/sociology-research-methods/SKILL.md +181 -0
  233. package/skills/literature/discovery/arxiv-paper-monitoring/SKILL.md +233 -0
  234. package/skills/literature/discovery/paper-recommendation-guide/SKILL.md +120 -0
  235. package/skills/literature/discovery/papers-we-love-guide/SKILL.md +169 -0
  236. package/skills/literature/discovery/semantic-paper-radar/SKILL.md +144 -0
  237. package/skills/literature/discovery/zotero-arxiv-daily-guide/SKILL.md +94 -0
  238. package/skills/literature/fulltext/bioc-pmc-api/SKILL.md +146 -0
  239. package/skills/literature/fulltext/core-api-guide/SKILL.md +144 -0
  240. package/skills/literature/fulltext/dataverse-api/SKILL.md +215 -0
  241. package/skills/literature/fulltext/hal-archive-api/SKILL.md +218 -0
  242. package/skills/literature/fulltext/institutional-repository-guide/SKILL.md +212 -0
  243. package/skills/literature/fulltext/open-access-mining-guide/SKILL.md +341 -0
  244. package/skills/literature/fulltext/osf-api/SKILL.md +212 -0
  245. package/skills/literature/fulltext/pmc-ftp-bulk-download/SKILL.md +182 -0
  246. package/skills/literature/fulltext/zotero-ai-butler-guide/SKILL.md +166 -0
  247. package/skills/literature/fulltext/zotero-scihub-guide/SKILL.md +168 -0
  248. package/skills/literature/metadata/academic-paper-summarizer/SKILL.md +101 -0
  249. package/skills/literature/metadata/bibliometrix-guide/SKILL.md +164 -0
  250. package/skills/literature/metadata/crossref-event-data-api/SKILL.md +183 -0
  251. package/skills/literature/metadata/doi-content-negotiation/SKILL.md +202 -0
  252. package/skills/literature/metadata/orkg-api/SKILL.md +153 -0
  253. package/skills/literature/metadata/plumx-metrics-api/SKILL.md +188 -0
  254. package/skills/literature/metadata/ror-organization-api/SKILL.md +208 -0
  255. package/skills/literature/metadata/sophosia-reference-guide/SKILL.md +110 -0
  256. package/skills/literature/metadata/viaf-authority-api/SKILL.md +209 -0
  257. package/skills/literature/metadata/wikidata-api-guide/SKILL.md +156 -0
  258. package/skills/literature/metadata/zoplicate-dedup-guide/SKILL.md +147 -0
  259. package/skills/literature/metadata/zotero-actions-tags-guide/SKILL.md +212 -0
  260. package/skills/literature/metadata/zotmoov-guide/SKILL.md +120 -0
  261. package/skills/literature/metadata/zutilo-guide/SKILL.md +140 -0
  262. package/skills/literature/search/arxiv-batch-reporting/SKILL.md +133 -0
  263. package/skills/literature/search/arxiv-cli-tools/SKILL.md +172 -0
  264. package/skills/literature/search/arxiv-osiris/SKILL.md +199 -0
  265. package/skills/literature/search/arxiv-paper-processor/SKILL.md +141 -0
  266. package/skills/literature/search/baidu-scholar-guide/SKILL.md +110 -0
  267. package/skills/literature/search/base-academic-search/SKILL.md +196 -0
  268. package/skills/literature/search/chatpaper-guide/SKILL.md +122 -0
  269. package/skills/literature/search/citeseerx-api/SKILL.md +183 -0
  270. package/skills/literature/search/deep-literature-search/SKILL.md +149 -0
  271. package/skills/literature/search/deepgit-search-guide/SKILL.md +147 -0
  272. package/skills/literature/search/eric-education-api/SKILL.md +199 -0
  273. package/skills/literature/search/findpapers-guide/SKILL.md +177 -0
  274. package/skills/literature/search/ieee-xplore-api/SKILL.md +177 -0
  275. package/skills/literature/search/lens-scholarly-api/SKILL.md +211 -0
  276. package/skills/literature/search/multi-database-literature-search/SKILL.md +198 -0
  277. package/skills/literature/search/open-library-api/SKILL.md +196 -0
  278. package/skills/literature/search/open-semantic-search-guide/SKILL.md +190 -0
  279. package/skills/literature/search/openaire-api/SKILL.md +141 -0
  280. package/skills/literature/search/paper-search-mcp-guide/SKILL.md +107 -0
  281. package/skills/literature/search/papers-chat-guide/SKILL.md +194 -0
  282. package/skills/literature/search/pasa-paper-search-guide/SKILL.md +138 -0
  283. package/skills/literature/search/plos-open-access-api/SKILL.md +203 -0
  284. package/skills/literature/search/scielo-api/SKILL.md +182 -0
  285. package/skills/literature/search/share-research-api/SKILL.md +129 -0
  286. package/skills/literature/search/worldcat-search-api/SKILL.md +224 -0
  287. package/skills/research/automation/ai-scientist-v2-guide/SKILL.md +284 -0
  288. package/skills/research/automation/aim-experiment-guide/SKILL.md +234 -0
  289. package/skills/research/automation/claude-academic-workflow-guide/SKILL.md +202 -0
  290. package/skills/research/automation/coexist-ai-guide/SKILL.md +149 -0
  291. package/skills/research/automation/datagen-research-guide/SKILL.md +131 -0
  292. package/skills/research/automation/foam-agent-guide/SKILL.md +203 -0
  293. package/skills/research/automation/kedro-pipeline-guide/SKILL.md +216 -0
  294. package/skills/research/automation/mle-agent-guide/SKILL.md +139 -0
  295. package/skills/research/automation/paper-to-agent-guide/SKILL.md +116 -0
  296. package/skills/research/automation/rd-agent-guide/SKILL.md +246 -0
  297. package/skills/research/automation/research-paper-orchestrator/SKILL.md +254 -0
  298. package/skills/research/deep-research/academic-deep-research/SKILL.md +190 -0
  299. package/skills/research/deep-research/auto-deep-research-guide/SKILL.md +141 -0
  300. package/skills/research/deep-research/cognitive-kernel-guide/SKILL.md +200 -0
  301. package/skills/research/deep-research/corvus-research-guide/SKILL.md +132 -0
  302. package/skills/research/deep-research/deep-research-pro/SKILL.md +213 -0
  303. package/skills/research/deep-research/deep-research-work/SKILL.md +204 -0
  304. package/skills/research/deep-research/deep-searcher-guide/SKILL.md +253 -0
  305. package/skills/research/deep-research/gpt-researcher-guide/SKILL.md +191 -0
  306. package/skills/research/deep-research/in-depth-research-guide/SKILL.md +205 -0
  307. package/skills/research/deep-research/khoj-research-guide/SKILL.md +200 -0
  308. package/skills/research/deep-research/kosmos-scientist-guide/SKILL.md +185 -0
  309. package/skills/research/deep-research/llm-scientific-discovery-guide/SKILL.md +178 -0
  310. package/skills/research/deep-research/local-deep-research-guide/SKILL.md +253 -0
  311. package/skills/research/deep-research/open-researcher-guide/SKILL.md +138 -0
  312. package/skills/research/deep-research/tongyi-deep-research-guide/SKILL.md +217 -0
  313. package/skills/research/funding/eu-horizon-guide/SKILL.md +244 -0
  314. package/skills/research/funding/grant-budget-guide/SKILL.md +284 -0
  315. package/skills/research/funding/nih-reporter-api-guide/SKILL.md +166 -0
  316. package/skills/research/funding/nsf-award-api-guide/SKILL.md +133 -0
  317. package/skills/research/methodology/academic-mentor-guide/SKILL.md +169 -0
  318. package/skills/research/methodology/claude-scientific-guide/SKILL.md +122 -0
  319. package/skills/research/methodology/deep-innovator-guide/SKILL.md +242 -0
  320. package/skills/research/methodology/osf-api-guide/SKILL.md +165 -0
  321. package/skills/research/methodology/parsifal-slr-guide/SKILL.md +154 -0
  322. package/skills/research/methodology/research-paper-kb/SKILL.md +263 -0
  323. package/skills/research/methodology/research-pipeline-units-guide/SKILL.md +169 -0
  324. package/skills/research/methodology/research-town-guide/SKILL.md +263 -0
  325. package/skills/research/methodology/slr-automation-guide/SKILL.md +235 -0
  326. package/skills/research/paper-review/automated-review-guide/SKILL.md +281 -0
  327. package/skills/research/paper-review/latte-review-guide/SKILL.md +175 -0
  328. package/skills/research/paper-review/paper-compare-guide/SKILL.md +238 -0
  329. package/skills/research/paper-review/paper-critique-framework/SKILL.md +181 -0
  330. package/skills/research/paper-review/paper-digest-guide/SKILL.md +240 -0
  331. package/skills/research/paper-review/paper-research-assistant/SKILL.md +231 -0
  332. package/skills/research/paper-review/research-quality-filter/SKILL.md +261 -0
  333. package/skills/research/paper-review/review-response-guide/SKILL.md +275 -0
  334. package/skills/tools/code-exec/contextplus-mcp-guide/SKILL.md +110 -0
  335. package/skills/tools/code-exec/google-colab-guide/SKILL.md +276 -0
  336. package/skills/tools/code-exec/kaggle-api-guide/SKILL.md +216 -0
  337. package/skills/tools/code-exec/overleaf-cli-guide/SKILL.md +279 -0
  338. package/skills/tools/diagram/clawphd-guide/SKILL.md +149 -0
  339. package/skills/tools/diagram/code-flow-visualizer/SKILL.md +197 -0
  340. package/skills/tools/diagram/excalidraw-diagram-guide/SKILL.md +170 -0
  341. package/skills/tools/diagram/json-data-visualizer/SKILL.md +270 -0
  342. package/skills/tools/diagram/kroki-diagram-api/SKILL.md +198 -0
  343. package/skills/tools/diagram/mermaid-architect-guide/SKILL.md +219 -0
  344. package/skills/tools/diagram/scientific-graphical-abstract/SKILL.md +201 -0
  345. package/skills/tools/diagram/tldraw-whiteboard-guide/SKILL.md +397 -0
  346. package/skills/tools/document/docsgpt-guide/SKILL.md +130 -0
  347. package/skills/tools/document/large-document-reader/SKILL.md +202 -0
  348. package/skills/tools/document/md2pdf-xelatex/SKILL.md +212 -0
  349. package/skills/tools/document/openpaper-guide/SKILL.md +232 -0
  350. package/skills/tools/document/paper-parse-guide/SKILL.md +243 -0
  351. package/skills/tools/document/weknora-guide/SKILL.md +216 -0
  352. package/skills/tools/document/zotero-addon-market-guide/SKILL.md +108 -0
  353. package/skills/tools/document/zotero-night-theme-guide/SKILL.md +142 -0
  354. package/skills/tools/document/zotero-style-guide/SKILL.md +217 -0
  355. package/skills/tools/knowledge-graph/citation-network-builder/SKILL.md +244 -0
  356. package/skills/tools/knowledge-graph/concept-map-generator/SKILL.md +284 -0
  357. package/skills/tools/knowledge-graph/graphiti-guide/SKILL.md +219 -0
  358. package/skills/tools/knowledge-graph/mimir-memory-guide/SKILL.md +135 -0
  359. package/skills/tools/knowledge-graph/notero-zotero-notion-guide/SKILL.md +187 -0
  360. package/skills/tools/knowledge-graph/open-webui-tools-guide/SKILL.md +156 -0
  361. package/skills/tools/knowledge-graph/openspg-guide/SKILL.md +210 -0
  362. package/skills/tools/knowledge-graph/paperpile-notion-guide/SKILL.md +84 -0
  363. package/skills/tools/knowledge-graph/zotero-markdb-connect-guide/SKILL.md +162 -0
  364. package/skills/tools/ocr-translate/latex-translation-guide/SKILL.md +176 -0
  365. package/skills/tools/ocr-translate/math-equation-renderer/SKILL.md +198 -0
  366. package/skills/tools/ocr-translate/pdf-math-translate-guide/SKILL.md +141 -0
  367. package/skills/tools/ocr-translate/zotero-pdf-translate-guide/SKILL.md +95 -0
  368. package/skills/tools/ocr-translate/zotero-pdf2zh-guide/SKILL.md +143 -0
  369. package/skills/tools/scraping/dataset-finder-guide/SKILL.md +253 -0
  370. package/skills/tools/scraping/easy-spider-guide/SKILL.md +250 -0
  371. package/skills/tools/scraping/google-scholar-scraper/SKILL.md +255 -0
  372. package/skills/tools/scraping/repository-harvesting-guide/SKILL.md +310 -0
  373. package/skills/writing/citation/academic-citation-manager/SKILL.md +314 -0
  374. package/skills/writing/citation/academic-citation-manager-guide/SKILL.md +182 -0
  375. package/skills/writing/citation/citation-assistant-skill/SKILL.md +192 -0
  376. package/skills/writing/citation/jabref-reference-guide/SKILL.md +127 -0
  377. package/skills/writing/citation/jasminum-zotero-guide/SKILL.md +103 -0
  378. package/skills/writing/citation/mendeley-api/SKILL.md +231 -0
  379. package/skills/writing/citation/obsidian-citation-guide/SKILL.md +164 -0
  380. package/skills/writing/citation/obsidian-zotero-guide/SKILL.md +137 -0
  381. package/skills/writing/citation/onecite-reference-guide/SKILL.md +168 -0
  382. package/skills/writing/citation/papersgpt-zotero-guide/SKILL.md +132 -0
  383. package/skills/writing/citation/papis-cli-guide/SKILL.md +213 -0
  384. package/skills/writing/citation/zotero-better-bibtex-guide/SKILL.md +107 -0
  385. package/skills/writing/citation/zotero-better-notes-guide/SKILL.md +121 -0
  386. package/skills/writing/citation/zotero-gpt-guide/SKILL.md +111 -0
  387. package/skills/writing/citation/zotero-mcp-guide/SKILL.md +164 -0
  388. package/skills/writing/citation/zotero-mdnotes-guide/SKILL.md +162 -0
  389. package/skills/writing/citation/zotero-reference-guide/SKILL.md +139 -0
  390. package/skills/writing/citation/zotero-scholar-guide/SKILL.md +294 -0
  391. package/skills/writing/citation/zotfile-attachment-guide/SKILL.md +140 -0
  392. package/skills/writing/composition/ml-paper-writing/SKILL.md +163 -0
  393. package/skills/writing/composition/opendraft-thesis-guide/SKILL.md +200 -0
  394. package/skills/writing/composition/paper-debugger-guide/SKILL.md +143 -0
  395. package/skills/writing/composition/paperforge-guide/SKILL.md +205 -0
  396. package/skills/writing/composition/research-paper-writer/SKILL.md +226 -0
  397. package/skills/writing/composition/scientific-writing-resources/SKILL.md +151 -0
  398. package/skills/writing/composition/scientific-writing-wrapper/SKILL.md +153 -0
  399. package/skills/writing/latex/academic-writing-latex/SKILL.md +285 -0
  400. package/skills/writing/latex/latex-drawing-collection/SKILL.md +154 -0
  401. package/skills/writing/latex/latex-templates-collection/SKILL.md +159 -0
  402. package/skills/writing/latex/md-to-pdf-academic/SKILL.md +230 -0
  403. package/skills/writing/latex/tex-render-guide/SKILL.md +243 -0
  404. package/skills/writing/polish/academic-tone-guide/SKILL.md +209 -0
  405. package/skills/writing/polish/chinese-text-humanizer/SKILL.md +140 -0
  406. package/skills/writing/polish/conciseness-editing-guide/SKILL.md +225 -0
  407. package/skills/writing/polish/paper-polish-guide/SKILL.md +160 -0
  408. package/skills/writing/templates/arxiv-preprint-template/SKILL.md +184 -0
  409. package/skills/writing/templates/elegant-paper-template/SKILL.md +141 -0
  410. package/skills/writing/templates/graphical-abstract-guide/SKILL.md +183 -0
  411. package/skills/writing/templates/novathesis-guide/SKILL.md +152 -0
  412. package/skills/writing/templates/scientific-article-pdf/SKILL.md +261 -0
  413. package/skills/writing/templates/sjtuthesis-guide/SKILL.md +197 -0
  414. package/skills/writing/templates/thuthesis-guide/SKILL.md +181 -0
  415. package/skills/literature/fulltext/repository-harvesting-guide/SKILL.md +0 -207
@@ -0,0 +1,146 @@
1
+ ---
2
+ name: generative-ai-guide
3
+ description: "Curated guide to generative AI covering LLMs and diffusion models"
4
+ version: 1.0.0
5
+ author: wentor-community
6
+ source: https://github.com/aishwaryanr/awesome-generative-ai-guide
7
+ metadata:
8
+ openclaw:
9
+ category: "domains"
10
+ subcategory: "ai-ml"
11
+ keywords:
12
+ - generative-ai
13
+ - large-language-models
14
+ - diffusion-models
15
+ - transformers
16
+ - prompt-engineering
17
+ - ai-research
18
+ ---
19
+
20
+ # Generative AI Guide
21
+
22
+ A skill providing a comprehensive, curated guide to generative AI research and practice, covering large language models (LLMs), diffusion models, transformer architectures, prompt engineering, and evaluation methodologies. Based on the awesome-generative-ai-guide repository (25K stars), this skill equips researchers with structured knowledge of the rapidly evolving generative AI landscape.
23
+
24
+ ## Overview
25
+
26
+ Generative AI has become one of the most active areas of research across computer science, with implications spanning natural language processing, computer vision, audio synthesis, code generation, scientific discovery, and creative applications. The pace of development makes it challenging for researchers to maintain a current understanding of the field. This skill provides a structured map of the generative AI landscape, organized by topic and application area, with guidance on key papers, methods, and practical considerations.
27
+
28
+ Whether you are an AI researcher staying current with the field, a domain scientist exploring how generative AI can accelerate your work, or a student entering the field, this skill provides the orientation and resources needed to navigate the space effectively.
29
+
30
+ ## Large Language Models
31
+
32
+ **Architecture Foundations**
33
+ - Transformer architecture: self-attention mechanism, positional encoding, layer normalization
34
+ - Scaling laws: the relationship between model size, data, compute, and performance
35
+ - Training objectives: causal language modeling, masked language modeling, instruction tuning
36
+ - Context windows: evolution from 512 tokens to 100K+ tokens and associated techniques
37
+ - Mixture of Experts (MoE): sparse activation for efficient scaling
38
+
39
+ **Key Model Families**
40
+ - GPT series (OpenAI): decoder-only architecture, scaling-driven approach
41
+ - Claude series (Anthropic): emphasis on safety, instruction following, and long context
42
+ - Llama series (Meta): open-weight models enabling community research
43
+ - Gemini series (Google): multimodal from the ground up
44
+ - Open-source ecosystem: Mistral, Qwen, DeepSeek, and community fine-tunes
45
+
46
+ **Training Pipeline**
47
+ - Pre-training: large-scale unsupervised learning on web-scale text corpora
48
+ - Supervised fine-tuning (SFT): training on high-quality instruction-response pairs
49
+ - Reinforcement learning from human feedback (RLHF): aligning outputs with human preferences
50
+ - Direct preference optimization (DPO): simplified alignment without reward models
51
+ - Constitutional AI: self-improvement using principle-based critique
52
+
53
+ **Inference Optimization**
54
+ - Quantization: reducing model precision (FP16, INT8, INT4) for faster inference
55
+ - KV-cache optimization: efficient memory management for long sequences
56
+ - Speculative decoding: using small models to draft and large models to verify
57
+ - Batching strategies: continuous batching for throughput optimization
58
+ - Serving frameworks: vLLM, TGI, and other high-performance inference engines
59
+
60
+ ## Diffusion Models
61
+
62
+ **Core Concepts**
63
+ - Forward process: gradually adding noise to data until reaching pure noise
64
+ - Reverse process: learning to denoise step by step to generate new data
65
+ - Score matching: estimating the gradient of the data distribution
66
+ - Classifier-free guidance: controlling generation fidelity and diversity
67
+ - Latent diffusion: operating in compressed latent space for efficiency
68
+
69
+ **Key Architectures**
70
+ - DDPM (Denoising Diffusion Probabilistic Models): foundational formulation
71
+ - Stable Diffusion: latent space diffusion with text conditioning
72
+ - DALL-E series: text-to-image generation with CLIP-based conditioning
73
+ - Imagen: text-to-image with cascaded diffusion models
74
+ - Video diffusion models: extending to temporal generation
75
+
76
+ **Applications in Research**
77
+ - Molecular generation: designing new drug candidates and materials
78
+ - Protein structure prediction: generating plausible protein conformations
79
+ - Scientific data augmentation: creating synthetic training data
80
+ - Image restoration: denoising, super-resolution, inpainting for microscopy
81
+ - Simulation acceleration: approximating expensive physical simulations
82
+
83
+ ## Prompt Engineering
84
+
85
+ **Fundamental Techniques**
86
+ - Zero-shot prompting: direct instruction without examples
87
+ - Few-shot prompting: providing examples to establish the desired pattern
88
+ - Chain-of-thought (CoT): requesting step-by-step reasoning
89
+ - Self-consistency: sampling multiple reasoning chains and selecting the majority
90
+ - Tree of thought: exploring multiple reasoning branches systematically
91
+
92
+ **Advanced Strategies**
93
+ - ReAct (Reasoning + Acting): interleaving reasoning with tool use
94
+ - Retrieval-augmented generation (RAG): grounding responses in retrieved documents
95
+ - Program-aided language models: generating and executing code for precise computation
96
+ - Structured output: constraining generation to valid JSON, XML, or other formats
97
+ - Multi-agent prompting: orchestrating multiple LLM instances for complex tasks
98
+
99
+ **Research-Specific Prompting**
100
+ - Literature synthesis: prompting for balanced integration of multiple sources
101
+ - Hypothesis generation: structured prompts for creative scientific reasoning
102
+ - Code debugging: providing error context and asking for systematic diagnosis
103
+ - Data analysis: chaining prompts through exploratory analysis to interpretation
104
+ - Writing assistance: iterative refinement prompts that preserve the author's voice
105
+
106
+ ## Evaluation and Benchmarks
107
+
108
+ **Language Model Evaluation**
109
+ - Perplexity: intrinsic measure of model quality on held-out text
110
+ - MMLU: massive multi-task language understanding across 57 subjects
111
+ - HumanEval: code generation benchmark with function completion tasks
112
+ - MT-Bench: multi-turn conversation quality assessment
113
+ - Arena Elo: head-to-head comparison ratings from human preferences
114
+
115
+ **Generation Quality Metrics**
116
+ - FID (Frechet Inception Distance): image generation quality and diversity
117
+ - CLIP score: text-image alignment for conditional generation
118
+ - BLEU, ROUGE: text generation overlap metrics (limited but widely used)
119
+ - Human evaluation: gold standard requiring careful protocol design
120
+ - Calibration: measuring whether model confidence matches actual accuracy
121
+
122
+ **Safety and Alignment Evaluation**
123
+ - Red-teaming: adversarial testing for harmful outputs
124
+ - Bias benchmarks: measuring demographic and cultural biases
125
+ - Hallucination detection: identifying fabricated facts in generated text
126
+ - Instruction following: measuring compliance with complex multi-step instructions
127
+ - Robustness testing: evaluating consistency under paraphrased inputs
128
+
129
+ ## Integration with Research-Claw
130
+
131
+ This skill provides the Research-Claw agent with generative AI domain expertise:
132
+
133
+ - Help researchers understand and apply generative AI techniques to their domain
134
+ - Guide model selection based on task requirements and resource constraints
135
+ - Assist with prompt engineering for research-specific applications
136
+ - Connect with analysis skills for evaluating generative model outputs
137
+ - Support writing skills with knowledge of the latest developments for literature reviews
138
+
139
+ ## Best Practices
140
+
141
+ - Stay current by monitoring key conferences (NeurIPS, ICML, ICLR, ACL, CVPR) and arXiv
142
+ - Distinguish between benchmark performance and real-world applicability
143
+ - Consider computational costs and environmental impact when selecting models
144
+ - Evaluate models on your specific task rather than relying solely on leaderboard rankings
145
+ - Document prompt strategies and model versions for reproducibility
146
+ - Be aware of the limitations: hallucination, bias, and sensitivity to prompt phrasing
@@ -0,0 +1,125 @@
1
+ ---
2
+ name: graph-learning-papers-guide
3
+ description: "Conference papers on graph neural networks and graph learning"
4
+ metadata:
5
+ openclaw:
6
+ emoji: "📊"
7
+ category: "domains"
8
+ subcategory: "ai-ml"
9
+ keywords: ["graph neural network", "GNN", "graph learning", "graph transformer", "message passing", "node classification"]
10
+ source: "https://github.com/doujiang-zheng/Awesome-Graph-Learning-Papers-List"
11
+ ---
12
+
13
+ # Graph Learning Papers Guide
14
+
15
+ ## Overview
16
+
17
+ A curated list of graph learning papers from top AI/ML conferences (NeurIPS, ICML, ICLR, KDD, WWW, AAAI). Covers graph neural networks, graph transformers, spectral methods, message passing, and applications in molecular science, social networks, and recommendation systems. Organized by venue, year, and topic for systematic tracking.
18
+
19
+ ## Topic Taxonomy
20
+
21
+ ```
22
+ Graph Learning
23
+ ├── Graph Neural Networks
24
+ │ ├── Message Passing (GCN, GAT, GraphSAGE, GIN)
25
+ │ ├── Spectral (ChebNet, CayleyNet)
26
+ │ ├── Graph Transformers (Graphormer, GPS)
27
+ │ └── Equivariant GNNs (EGNN, SE(3)-Transformers)
28
+ ├── Graph Generation
29
+ │ ├── VAE-based (GraphVAE)
30
+ │ ├── Autoregressive (GraphRNN)
31
+ │ ├── Diffusion (GDSS, DiGress)
32
+ │ └── Flow-based (GraphFlow)
33
+ ├── Self-supervised Learning
34
+ │ ├── Contrastive (GraphCL, GCA)
35
+ │ ├── Generative (GraphMAE)
36
+ │ └── Predictive (GPT-GNN)
37
+ ├── Scalability
38
+ │ ├── Sampling (GraphSAINT, ClusterGCN)
39
+ │ ├── Knowledge distillation
40
+ │ └── Graph condensation
41
+ ├── Temporal Graphs
42
+ │ ├── Dynamic GNNs
43
+ │ ├── Temporal interaction
44
+ │ └── Evolving graphs
45
+ └── Applications
46
+ ├── Molecular property prediction
47
+ ├── Drug discovery
48
+ ├── Social network analysis
49
+ ├── Recommendation systems
50
+ └── Traffic forecasting
51
+ ```
52
+
53
+ ## Key Models
54
+
55
+ | Model | Year | Innovation |
56
+ |-------|------|-----------|
57
+ | **GCN** | 2017 | Spectral convolution simplified |
58
+ | **GraphSAGE** | 2017 | Inductive with sampling |
59
+ | **GAT** | 2018 | Attention over neighbors |
60
+ | **GIN** | 2019 | WL-test as powerful as possible |
61
+ | **Graphormer** | 2021 | Transformer on graphs |
62
+ | **GPS** | 2022 | General, powerful, scalable recipe |
63
+ | **GraphMAE** | 2022 | Masked autoencoding on graphs |
64
+
65
+ ## Paper Search
66
+
67
+ ```python
68
+ import arxiv
69
+
70
+ def find_gnn_papers(topic="graph neural network", max_results=20):
71
+ """Find recent GNN papers."""
72
+ search = arxiv.Search(
73
+ query=f"abs:{topic}",
74
+ max_results=max_results,
75
+ sort_by=arxiv.SortCriterion.SubmittedDate,
76
+ )
77
+
78
+ for r in search.results():
79
+ print(f"[{r.published.strftime('%Y-%m-%d')}] {r.title}")
80
+
81
+ find_gnn_papers("graph transformer")
82
+ find_gnn_papers("molecular graph generation")
83
+ ```
84
+
85
+ ## Benchmark Datasets
86
+
87
+ ```python
88
+ datasets = {
89
+ "Node Classification": {
90
+ "Cora": "Citation network, 7 classes",
91
+ "PubMed": "Medical citation, 3 classes",
92
+ "ogbn-arxiv": "arXiv papers, 40 classes",
93
+ "ogbn-papers100M": "100M papers (large-scale)",
94
+ },
95
+ "Graph Classification": {
96
+ "ZINC": "Molecular graphs, regression",
97
+ "ogbg-molpcba": "128 molecular tasks",
98
+ "PROTEINS": "Protein function prediction",
99
+ },
100
+ "Link Prediction": {
101
+ "ogbl-collab": "Author collaborations",
102
+ "ogbl-citation2": "Citation prediction",
103
+ },
104
+ }
105
+
106
+ for task, ds in datasets.items():
107
+ print(f"\n{task}:")
108
+ for name, desc in ds.items():
109
+ print(f" {name}: {desc}")
110
+ ```
111
+
112
+ ## Use Cases
113
+
114
+ 1. **Literature survey**: Track GNN research across top venues
115
+ 2. **Method comparison**: Compare GNN architectures and results
116
+ 3. **Research planning**: Identify trends and open problems
117
+ 4. **Course preparation**: Curate reading lists for GNN courses
118
+ 5. **Benchmark tracking**: Monitor SOTA on OGB leaderboards
119
+
120
+ ## References
121
+
122
+ - [Awesome-Graph-Learning-Papers-List](https://github.com/doujiang-zheng/Awesome-Graph-Learning-Papers-List)
123
+ - [Open Graph Benchmark](https://ogb.stanford.edu/)
124
+ - [PyG (PyTorch Geometric)](https://pyg.org/)
125
+ - [DGL (Deep Graph Library)](https://www.dgl.ai/)
@@ -0,0 +1,196 @@
1
+ ---
2
+ name: huggingface-inference-guide
3
+ description: "Run NLP and CV model inference via Hugging Face free-tier API"
4
+ metadata:
5
+ openclaw:
6
+ emoji: "🤗"
7
+ category: "domains"
8
+ subcategory: "ai-ml"
9
+ keywords: ["huggingface", "inference", "nlp", "machine-learning", "transformers", "models"]
10
+ source: "https://huggingface.co/docs/api-inference/index"
11
+ ---
12
+
13
+ # Hugging Face Inference API Guide
14
+
15
+ ## Overview
16
+
17
+ The Hugging Face Inference API provides instant access to thousands of pre-trained machine learning models for natural language processing, computer vision, audio processing, and multimodal tasks. Researchers can run inference on state-of-the-art models without managing infrastructure, GPU resources, or complex deployment pipelines.
18
+
19
+ The API hosts models from the Hugging Face Hub, which contains over 500,000 models contributed by the research community. This includes transformer models for text classification, named entity recognition, summarization, translation, question answering, text generation, and image classification. For academic researchers, the Inference API is invaluable for rapid prototyping, benchmark evaluation, and integrating ML capabilities into research workflows without dedicated compute resources.
20
+
21
+ The free tier provides access to a broad selection of models with rate limits suitable for development and small-scale research. An API token is required for authentication, available for free at huggingface.co.
22
+
23
+ ## Authentication
24
+
25
+ A free Hugging Face API token is required. Create an account and generate a token at https://huggingface.co/settings/tokens.
26
+
27
+ Store your token securely in an environment variable:
28
+
29
+ ```bash
30
+ export HF_API_TOKEN=$HF_API_TOKEN
31
+ ```
32
+
33
+ ```bash
34
+ curl -X POST "https://api-inference.huggingface.co/models/bert-base-uncased" \
35
+ -H "Authorization: Bearer $HF_API_TOKEN" \
36
+ -H "Content-Type: application/json" \
37
+ -d '{"inputs": "The goal of life is [MASK]."}'
38
+ ```
39
+
40
+ ## Core Endpoints
41
+
42
+ ### Text Classification (Sentiment Analysis)
43
+
44
+ ```
45
+ POST https://api-inference.huggingface.co/models/{model_id}
46
+ ```
47
+
48
+ ```bash
49
+ curl -s -X POST \
50
+ "https://api-inference.huggingface.co/models/distilbert-base-uncased-finetuned-sst-2-english" \
51
+ -H "Authorization: Bearer $HF_API_TOKEN" \
52
+ -H "Content-Type: application/json" \
53
+ -d '{"inputs": "This research methodology provides robust and reproducible results."}' \
54
+ | python3 -m json.tool
55
+ ```
56
+
57
+ ### Named Entity Recognition
58
+
59
+ ```bash
60
+ curl -s -X POST \
61
+ "https://api-inference.huggingface.co/models/dslim/bert-base-NER" \
62
+ -H "Authorization: Bearer $HF_API_TOKEN" \
63
+ -H "Content-Type: application/json" \
64
+ -d '{"inputs": "Dr. Marie Curie conducted research at the University of Paris on radioactivity."}' \
65
+ | python3 -m json.tool
66
+ ```
67
+
68
+ ### Text Summarization
69
+
70
+ ```bash
71
+ curl -s -X POST \
72
+ "https://api-inference.huggingface.co/models/facebook/bart-large-cnn" \
73
+ -H "Authorization: Bearer $HF_API_TOKEN" \
74
+ -H "Content-Type: application/json" \
75
+ -d '{
76
+ "inputs": "The study of quantum computing has seen tremendous advances in the past decade. Researchers have demonstrated quantum supremacy with processors containing over 100 qubits. Error correction remains a significant challenge, but recent breakthroughs in topological qubits and surface codes suggest viable paths forward. Applications in drug discovery, materials science, and cryptography are expected to be among the first practical use cases.",
77
+ "parameters": {"max_length": 80, "min_length": 30}
78
+ }' | python3 -m json.tool
79
+ ```
80
+
81
+ ### Zero-Shot Classification
82
+
83
+ Classify text into arbitrary categories without fine-tuning.
84
+
85
+ ```bash
86
+ curl -s -X POST \
87
+ "https://api-inference.huggingface.co/models/facebook/bart-large-mnli" \
88
+ -H "Authorization: Bearer $HF_API_TOKEN" \
89
+ -H "Content-Type: application/json" \
90
+ -d '{
91
+ "inputs": "New CRISPR technique enables precise gene editing in human stem cells",
92
+ "parameters": {"candidate_labels": ["biology", "computer science", "physics", "economics"]}
93
+ }' | python3 -m json.tool
94
+ ```
95
+
96
+ ### Python Example: Batch Sentiment Analysis of Paper Abstracts
97
+
98
+ ```python
99
+ import requests
100
+ import os
101
+ import time
102
+
103
+ API_URL = "https://api-inference.huggingface.co/models/distilbert-base-uncased-finetuned-sst-2-english"
104
+ HEADERS = {"Authorization": f"Bearer {os.environ['HF_API_TOKEN']}"}
105
+
106
+ def classify_sentiment(texts):
107
+ """Classify sentiment for a batch of texts."""
108
+ response = requests.post(API_URL, headers=HEADERS, json={"inputs": texts})
109
+ if response.status_code == 503:
110
+ # Model is loading, wait and retry
111
+ wait_time = response.json().get("estimated_time", 20)
112
+ print(f"Model loading, waiting {wait_time:.0f}s...")
113
+ time.sleep(wait_time)
114
+ response = requests.post(API_URL, headers=HEADERS, json={"inputs": texts})
115
+ response.raise_for_status()
116
+ return response.json()
117
+
118
+ abstracts = [
119
+ "Our results demonstrate a significant improvement over baseline methods.",
120
+ "The proposed approach failed to achieve meaningful gains on the benchmark.",
121
+ "We present preliminary findings that warrant further investigation.",
122
+ ]
123
+
124
+ results = classify_sentiment(abstracts)
125
+ for abstract, result in zip(abstracts, results):
126
+ top = max(result, key=lambda x: x["score"])
127
+ print(f"Sentiment: {top['label']} ({top['score']:.3f})")
128
+ print(f" Text: {abstract[:80]}...")
129
+ print()
130
+ ```
131
+
132
+ ### Python Example: Research Paper Topic Classification
133
+
134
+ ```python
135
+ import requests
136
+ import os
137
+
138
+ ZSC_URL = "https://api-inference.huggingface.co/models/facebook/bart-large-mnli"
139
+ HEADERS = {"Authorization": f"Bearer {os.environ['HF_API_TOKEN']}"}
140
+
141
+ def classify_paper(abstract, categories):
142
+ """Classify a paper abstract into research categories."""
143
+ payload = {
144
+ "inputs": abstract,
145
+ "parameters": {"candidate_labels": categories}
146
+ }
147
+ resp = requests.post(ZSC_URL, headers=HEADERS, json=payload)
148
+ resp.raise_for_status()
149
+ return resp.json()
150
+
151
+ categories = [
152
+ "machine learning",
153
+ "computational biology",
154
+ "natural language processing",
155
+ "computer vision",
156
+ "reinforcement learning",
157
+ "quantum computing"
158
+ ]
159
+
160
+ abstract = "We propose a novel transformer architecture for protein structure prediction that achieves state-of-the-art results on CASP benchmarks."
161
+ result = classify_paper(abstract, categories)
162
+
163
+ print("Topic classification:")
164
+ for label, score in zip(result["labels"], result["scores"]):
165
+ bar = "#" * int(score * 40)
166
+ print(f" {label:<30} {score:.3f} {bar}")
167
+ ```
168
+
169
+ ## Common Research Patterns
170
+
171
+ **Literature Screening:** Use zero-shot classification to automatically categorize and filter large collections of paper abstracts by research topic, methodology, or relevance to a specific research question.
172
+
173
+ **Sentiment and Stance Detection:** Analyze the tone and conclusions of research papers, review comments, or social media discussions about scientific topics using sentiment analysis models.
174
+
175
+ **Named Entity Extraction:** Extract researcher names, institutions, chemical compounds, gene names, and other domain-specific entities from unstructured text in papers and reports.
176
+
177
+ **Automated Summarization:** Generate concise summaries of lengthy research papers or grant proposals to accelerate literature review workflows.
178
+
179
+ **Multilingual Research:** Use translation and multilingual models to access and analyze research published in languages other than English.
180
+
181
+ ## Rate Limits and Best Practices
182
+
183
+ - **Free tier:** Rate-limited; approximately 1,000 requests per day depending on model and load
184
+ - **Model loading:** Cold models may take 20-60 seconds to load; handle 503 responses with retry logic
185
+ - **Batch inputs:** Send multiple texts as an array in a single request to improve throughput
186
+ - **Model selection:** Use distilled or smaller variants (e.g., `distilbert` instead of `bert-large`) for faster inference
187
+ - **Timeouts:** Set request timeouts to 60+ seconds for large models or first requests after cold start
188
+ - **Caching:** Cache inference results for identical inputs to avoid redundant API calls
189
+ - **Pro tier:** For production workloads, consider the Inference Endpoints or Pro subscription for dedicated resources
190
+
191
+ ## References
192
+
193
+ - Hugging Face Inference API Documentation: https://huggingface.co/docs/api-inference/index
194
+ - Hugging Face Model Hub: https://huggingface.co/models
195
+ - Hugging Face API Token Settings: https://huggingface.co/settings/tokens
196
+ - Hugging Face Tasks Overview: https://huggingface.co/tasks
@@ -0,0 +1,210 @@
1
+ ---
2
+ name: keras-deep-learning
3
+ description: "Build and debug deep learning models with Keras and TensorFlow backend"
4
+ metadata:
5
+ openclaw:
6
+ emoji: "🔬"
7
+ category: "domains"
8
+ subcategory: "ai-ml"
9
+ keywords: ["Keras", "deep learning", "neural network", "model training", "TensorFlow", "classification"]
10
+ source: "https://github.com/fchollet/deep-learning-with-python-notebooks"
11
+ ---
12
+
13
+ # Keras Deep Learning Guide
14
+
15
+ ## Overview
16
+
17
+ Keras is the high-level deep learning API that ships as part of TensorFlow 2.x and is the recommended interface for building, training, and deploying neural networks. Its Sequential and Functional APIs provide a progressive disclosure of complexity: beginners can stack layers in minutes, while researchers can build arbitrary DAG architectures, custom training loops, and multi-output models with the same framework.
18
+
19
+ This guide covers practical patterns for academic research with Keras, from image classification and sequence modeling to custom loss functions and experiment reproducibility. The focus is on patterns that appear repeatedly in published work -- data loading pipelines, callback orchestration, hyperparameter search, and model introspection -- rather than toy examples.
20
+
21
+ Keras is particularly strong in rapid prototyping for research papers. Its integration with TensorBoard, Weights & Biases, and tf.data pipelines makes it straightforward to go from idea to reproducible experiment to publication-quality results.
22
+
23
+ ## Model Architecture Patterns
24
+
25
+ ### Sequential API for Standard Architectures
26
+
27
+ ```python
28
+ import tensorflow as tf
29
+ from tensorflow import keras
30
+ from tensorflow.keras import layers
31
+
32
+ # Image classification baseline
33
+ model = keras.Sequential([
34
+ layers.Input(shape=(224, 224, 3)),
35
+ layers.Rescaling(1.0 / 255),
36
+ layers.Conv2D(32, 3, activation="relu", padding="same"),
37
+ layers.BatchNormalization(),
38
+ layers.MaxPooling2D(2),
39
+ layers.Conv2D(64, 3, activation="relu", padding="same"),
40
+ layers.BatchNormalization(),
41
+ layers.MaxPooling2D(2),
42
+ layers.Conv2D(128, 3, activation="relu", padding="same"),
43
+ layers.GlobalAveragePooling2D(),
44
+ layers.Dropout(0.3),
45
+ layers.Dense(256, activation="relu"),
46
+ layers.Dense(10, activation="softmax"),
47
+ ])
48
+
49
+ model.compile(
50
+ optimizer=keras.optimizers.AdamW(learning_rate=1e-3, weight_decay=1e-4),
51
+ loss="sparse_categorical_crossentropy",
52
+ metrics=["accuracy"],
53
+ )
54
+ ```
55
+
56
+ ### Functional API for Multi-Input/Multi-Output Models
57
+
58
+ ```python
59
+ # Multi-input model for multimodal research
60
+ image_input = keras.Input(shape=(224, 224, 3), name="image")
61
+ text_input = keras.Input(shape=(128,), dtype="int32", name="text")
62
+
63
+ # Image branch
64
+ x_img = keras.applications.EfficientNetV2B0(
65
+ include_top=False, weights="imagenet", input_tensor=image_input
66
+ ).output
67
+ x_img = layers.GlobalAveragePooling2D()(x_img)
68
+
69
+ # Text branch
70
+ x_txt = layers.Embedding(10000, 128)(text_input)
71
+ x_txt = layers.Bidirectional(layers.LSTM(64))(x_txt)
72
+
73
+ # Merge
74
+ merged = layers.Concatenate()([x_img, x_txt])
75
+ merged = layers.Dense(256, activation="relu")(merged)
76
+ merged = layers.Dropout(0.4)(merged)
77
+ output = layers.Dense(5, activation="softmax", name="classification")(merged)
78
+
79
+ model = keras.Model(inputs=[image_input, text_input], outputs=output)
80
+ ```
81
+
82
+ ## Data Pipeline with tf.data
83
+
84
+ Efficient data loading is critical for GPU utilization in research experiments:
85
+
86
+ ```python
87
+ def build_dataset(file_pattern, batch_size=32, training=True):
88
+ """Build a tf.data pipeline with augmentation for research experiments."""
89
+ dataset = tf.data.Dataset.list_files(file_pattern, shuffle=training)
90
+
91
+ def parse_image(path):
92
+ img = tf.io.read_file(path)
93
+ img = tf.image.decode_jpeg(img, channels=3)
94
+ img = tf.image.resize(img, [256, 256])
95
+ label = tf.strings.split(path, os.sep)[-2]
96
+ return img, label
97
+
98
+ dataset = dataset.map(parse_image, num_parallel_calls=tf.data.AUTOTUNE)
99
+
100
+ if training:
101
+ dataset = dataset.shuffle(1000)
102
+ dataset = dataset.map(
103
+ lambda x, y: (tf.image.random_flip_left_right(x), y),
104
+ num_parallel_calls=tf.data.AUTOTUNE,
105
+ )
106
+
107
+ dataset = dataset.batch(batch_size)
108
+ dataset = dataset.prefetch(tf.data.AUTOTUNE)
109
+ return dataset
110
+ ```
111
+
112
+ ## Training and Callback Orchestration
113
+
114
+ ### Reproducible Training Setup
115
+
116
+ ```python
117
+ import os
118
+ import random
119
+ import numpy as np
120
+
121
+ def set_seed(seed=42):
122
+ """Ensure reproducibility across runs for paper results."""
123
+ os.environ["PYTHONHASHSEED"] = str(seed)
124
+ random.seed(seed)
125
+ np.random.seed(seed)
126
+ tf.random.set_seed(seed)
127
+
128
+ set_seed(42)
129
+
130
+ callbacks = [
131
+ keras.callbacks.ModelCheckpoint(
132
+ "best_model.keras", monitor="val_loss", save_best_only=True
133
+ ),
134
+ keras.callbacks.EarlyStopping(
135
+ monitor="val_loss", patience=10, restore_best_weights=True
136
+ ),
137
+ keras.callbacks.ReduceLROnPlateau(
138
+ monitor="val_loss", factor=0.5, patience=5, min_lr=1e-6
139
+ ),
140
+ keras.callbacks.TensorBoard(log_dir="./logs", histogram_freq=1),
141
+ keras.callbacks.CSVLogger("training_log.csv"),
142
+ ]
143
+
144
+ history = model.fit(
145
+ train_dataset,
146
+ validation_data=val_dataset,
147
+ epochs=100,
148
+ callbacks=callbacks,
149
+ )
150
+ ```
151
+
152
+ ### Custom Training Loop for Research
153
+
154
+ ```python
155
+ @tf.function
156
+ def train_step(model, optimizer, x, y, loss_fn):
157
+ with tf.GradientTape() as tape:
158
+ predictions = model(x, training=True)
159
+ loss = loss_fn(y, predictions)
160
+ gradients = tape.gradient(loss, model.trainable_variables)
161
+ optimizer.apply_gradients(zip(gradients, model.trainable_variables))
162
+ return loss
163
+
164
+ # Custom metric tracking
165
+ train_loss = keras.metrics.Mean(name="train_loss")
166
+ for epoch in range(num_epochs):
167
+ train_loss.reset_state()
168
+ for x_batch, y_batch in train_dataset:
169
+ loss = train_step(model, optimizer, x_batch, y_batch, loss_fn)
170
+ train_loss.update_state(loss)
171
+ print(f"Epoch {epoch+1}, Loss: {train_loss.result():.4f}")
172
+ ```
173
+
174
+ ## Debugging and Common Pitfalls
175
+
176
+ | Issue | Symptom | Solution |
177
+ |-------|---------|----------|
178
+ | Exploding gradients | Loss becomes NaN | Add gradient clipping, reduce learning rate |
179
+ | Overfitting | Val loss diverges from train loss | Add Dropout, data augmentation, weight decay |
180
+ | Underfitting | Both losses plateau high | Increase model capacity, reduce regularization |
181
+ | Slow training | Low GPU utilization | Use tf.data with prefetch, increase batch size |
182
+ | Memory errors | OOM on GPU | Reduce batch size, use mixed precision |
183
+ | Non-deterministic results | Different results per run | Call `set_seed()`, set `TF_DETERMINISTIC_OPS=1` |
184
+
185
+ ### Mixed Precision Training
186
+
187
+ ```python
188
+ # Enable mixed precision for 2x speedup on modern GPUs
189
+ keras.mixed_precision.set_global_policy("mixed_float16")
190
+
191
+ # Ensure the output layer uses float32 for numerical stability
192
+ output = layers.Dense(10, activation="softmax", dtype="float32")(x)
193
+ ```
194
+
195
+ ## Best Practices for Research
196
+
197
+ - **Version pin everything.** Record `tensorflow`, `keras`, `numpy`, and `cuda` versions in your paper appendix.
198
+ - **Use `keras.utils.set_random_seed(42)`** for full determinism (TF 2.12+).
199
+ - **Save models in `.keras` format** (not HDF5) for forward compatibility.
200
+ - **Profile with TensorBoard** to identify data pipeline bottlenecks before scaling up.
201
+ - **Use `tf.debugging.enable_check_numerics()`** during development to catch NaN/Inf early.
202
+ - **Export with `tf.saved_model`** for deployment; export ONNX for cross-framework comparison.
203
+
204
+ ## References
205
+
206
+ - [Deep Learning with Python, 2nd Edition](https://www.manning.com/books/deep-learning-with-python-second-edition) -- Francois Chollet (Keras creator)
207
+ - [Keras documentation](https://keras.io/) -- Official API reference and guides
208
+ - [TensorFlow tutorials](https://www.tensorflow.org/tutorials) -- End-to-end examples
209
+ - [fchollet/deep-learning-with-python-notebooks](https://github.com/fchollet/deep-learning-with-python-notebooks) -- Code companion to the book
210
+ - [Keras examples gallery](https://keras.io/examples/) -- 100+ community-contributed examples