@wentorai/research-plugins 1.0.0 → 1.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (415) hide show
  1. package/README.md +22 -22
  2. package/curated/analysis/README.md +82 -56
  3. package/curated/domains/README.md +225 -69
  4. package/curated/literature/README.md +115 -46
  5. package/curated/research/README.md +106 -58
  6. package/curated/tools/README.md +107 -87
  7. package/curated/writing/README.md +92 -45
  8. package/mcp-configs/academic-db/alphafold-mcp.json +20 -0
  9. package/mcp-configs/academic-db/brightspace-mcp.json +21 -0
  10. package/mcp-configs/academic-db/climatiq-mcp.json +20 -0
  11. package/mcp-configs/academic-db/gibs-mcp.json +20 -0
  12. package/mcp-configs/academic-db/gis-mcp-server.json +22 -0
  13. package/mcp-configs/academic-db/google-earth-engine-mcp.json +21 -0
  14. package/mcp-configs/academic-db/m4-clinical-mcp.json +21 -0
  15. package/mcp-configs/academic-db/medical-mcp.json +21 -0
  16. package/mcp-configs/academic-db/nexonco-mcp.json +20 -0
  17. package/mcp-configs/academic-db/omop-mcp.json +20 -0
  18. package/mcp-configs/academic-db/onekgpd-mcp.json +20 -0
  19. package/mcp-configs/academic-db/openedu-mcp.json +20 -0
  20. package/mcp-configs/academic-db/opengenes-mcp.json +20 -0
  21. package/mcp-configs/academic-db/openstax-mcp.json +21 -0
  22. package/mcp-configs/academic-db/openstreetmap-mcp.json +21 -0
  23. package/mcp-configs/academic-db/opentargets-mcp.json +21 -0
  24. package/mcp-configs/academic-db/pdb-mcp.json +21 -0
  25. package/mcp-configs/academic-db/smithsonian-mcp.json +20 -0
  26. package/mcp-configs/ai-platform/magi-researchers.json +21 -0
  27. package/mcp-configs/ai-platform/mcp-academic-researcher.json +22 -0
  28. package/mcp-configs/ai-platform/open-paper-machine.json +21 -0
  29. package/mcp-configs/ai-platform/paper-intelligence.json +21 -0
  30. package/mcp-configs/ai-platform/paper-reader.json +21 -0
  31. package/mcp-configs/ai-platform/paperdebugger.json +21 -0
  32. package/mcp-configs/browser/exa-mcp.json +20 -0
  33. package/mcp-configs/browser/mcp-searxng.json +21 -0
  34. package/mcp-configs/browser/mcp-webresearch.json +20 -0
  35. package/mcp-configs/cloud-docs/confluence-mcp.json +37 -0
  36. package/mcp-configs/cloud-docs/google-drive-mcp.json +35 -0
  37. package/mcp-configs/cloud-docs/notion-mcp.json +29 -0
  38. package/mcp-configs/communication/discord-mcp.json +29 -0
  39. package/mcp-configs/communication/discourse-mcp.json +21 -0
  40. package/mcp-configs/communication/slack-mcp.json +29 -0
  41. package/mcp-configs/communication/telegram-mcp.json +28 -0
  42. package/mcp-configs/data-platform/automl-stat-mcp.json +21 -0
  43. package/mcp-configs/data-platform/jefferson-stats-mcp.json +22 -0
  44. package/mcp-configs/data-platform/mcp-excel-server.json +21 -0
  45. package/mcp-configs/data-platform/mcp-stata.json +21 -0
  46. package/mcp-configs/data-platform/mcpstack-jupyter.json +21 -0
  47. package/mcp-configs/data-platform/ml-mcp.json +21 -0
  48. package/mcp-configs/data-platform/nasdaq-data-link-mcp.json +20 -0
  49. package/mcp-configs/data-platform/numpy-mcp.json +21 -0
  50. package/mcp-configs/database/neo4j-mcp.json +37 -0
  51. package/mcp-configs/database/postgres-mcp.json +28 -0
  52. package/mcp-configs/database/sqlite-mcp.json +29 -0
  53. package/mcp-configs/dev-platform/geogebra-mcp.json +21 -0
  54. package/mcp-configs/dev-platform/github-mcp.json +31 -0
  55. package/mcp-configs/dev-platform/gitlab-mcp.json +34 -0
  56. package/mcp-configs/dev-platform/latex-mcp-server.json +21 -0
  57. package/mcp-configs/dev-platform/manim-mcp.json +20 -0
  58. package/mcp-configs/dev-platform/mcp-echarts.json +20 -0
  59. package/mcp-configs/dev-platform/panel-viz-mcp.json +20 -0
  60. package/mcp-configs/dev-platform/paperbanana.json +20 -0
  61. package/mcp-configs/dev-platform/texflow-mcp.json +20 -0
  62. package/mcp-configs/dev-platform/texmcp.json +20 -0
  63. package/mcp-configs/dev-platform/typst-mcp.json +21 -0
  64. package/mcp-configs/dev-platform/vizro-mcp.json +20 -0
  65. package/mcp-configs/email/email-mcp.json +40 -0
  66. package/mcp-configs/email/gmail-mcp.json +37 -0
  67. package/mcp-configs/note-knowledge/local-faiss-mcp.json +21 -0
  68. package/mcp-configs/note-knowledge/mcp-memory-service.json +21 -0
  69. package/mcp-configs/note-knowledge/mcp-obsidian.json +23 -0
  70. package/mcp-configs/note-knowledge/mcp-ragdocs.json +20 -0
  71. package/mcp-configs/note-knowledge/mcp-summarizer.json +21 -0
  72. package/mcp-configs/note-knowledge/mediawiki-mcp.json +21 -0
  73. package/mcp-configs/note-knowledge/openzim-mcp.json +20 -0
  74. package/mcp-configs/note-knowledge/zettelkasten-mcp.json +21 -0
  75. package/mcp-configs/reference-mgr/academic-paper-mcp-http.json +20 -0
  76. package/mcp-configs/reference-mgr/academix.json +20 -0
  77. package/mcp-configs/reference-mgr/arxiv-research-mcp.json +21 -0
  78. package/mcp-configs/reference-mgr/google-scholar-abstract-mcp.json +19 -0
  79. package/mcp-configs/reference-mgr/google-scholar-mcp.json +20 -0
  80. package/mcp-configs/reference-mgr/mcp-paperswithcode.json +21 -0
  81. package/mcp-configs/reference-mgr/mcp-scholarly.json +20 -0
  82. package/mcp-configs/reference-mgr/mcp-simple-arxiv.json +20 -0
  83. package/mcp-configs/reference-mgr/mcp-simple-pubmed.json +20 -0
  84. package/mcp-configs/reference-mgr/mcp-zotero.json +21 -0
  85. package/mcp-configs/reference-mgr/mendeley-mcp.json +20 -0
  86. package/mcp-configs/reference-mgr/ncbi-mcp-server.json +22 -0
  87. package/mcp-configs/reference-mgr/onecite.json +21 -0
  88. package/mcp-configs/reference-mgr/paper-search-mcp.json +21 -0
  89. package/mcp-configs/reference-mgr/pubmed-search-mcp.json +21 -0
  90. package/mcp-configs/reference-mgr/scholar-mcp.json +21 -0
  91. package/mcp-configs/reference-mgr/scholar-multi-mcp.json +21 -0
  92. package/mcp-configs/reference-mgr/seerai.json +21 -0
  93. package/mcp-configs/reference-mgr/semantic-scholar-fastmcp.json +21 -0
  94. package/mcp-configs/reference-mgr/sourcelibrary.json +20 -0
  95. package/mcp-configs/registry.json +178 -149
  96. package/mcp-configs/repository/dataverse-mcp.json +33 -0
  97. package/mcp-configs/repository/huggingface-mcp.json +29 -0
  98. package/openclaw.plugin.json +2 -2
  99. package/package.json +2 -2
  100. package/skills/analysis/dataviz/algorithm-visualizer-guide/SKILL.md +259 -0
  101. package/skills/analysis/dataviz/bokeh-visualization-guide/SKILL.md +270 -0
  102. package/skills/analysis/dataviz/chart-image-generator/SKILL.md +229 -0
  103. package/skills/analysis/dataviz/citation-map-guide/SKILL.md +184 -0
  104. package/skills/analysis/dataviz/d3-visualization-guide/SKILL.md +281 -0
  105. package/skills/analysis/dataviz/data-visualization-principles/SKILL.md +171 -0
  106. package/skills/analysis/dataviz/echarts-visualization-guide/SKILL.md +250 -0
  107. package/skills/analysis/dataviz/metabase-analytics-guide/SKILL.md +242 -0
  108. package/skills/analysis/dataviz/plotly-interactive-guide/SKILL.md +266 -0
  109. package/skills/analysis/dataviz/redash-analytics-guide/SKILL.md +284 -0
  110. package/skills/analysis/econometrics/econml-causal-guide/SKILL.md +163 -0
  111. package/skills/analysis/econometrics/empirical-paper-analysis/SKILL.md +192 -0
  112. package/skills/analysis/econometrics/mostly-harmless-guide/SKILL.md +139 -0
  113. package/skills/analysis/econometrics/panel-data-analyst/SKILL.md +259 -0
  114. package/skills/analysis/econometrics/panel-data-regression-workflow/SKILL.md +267 -0
  115. package/skills/analysis/econometrics/python-causality-guide/SKILL.md +134 -0
  116. package/skills/analysis/econometrics/stata-accounting-guide/SKILL.md +269 -0
  117. package/skills/analysis/econometrics/stata-analyst-guide/SKILL.md +245 -0
  118. package/skills/analysis/econometrics/stata-reference-guide/SKILL.md +293 -0
  119. package/skills/analysis/statistics/data-anomaly-detection/SKILL.md +157 -0
  120. package/skills/analysis/statistics/general-statistics-guide/SKILL.md +226 -0
  121. package/skills/analysis/statistics/infiagent-benchmark-guide/SKILL.md +106 -0
  122. package/skills/analysis/statistics/ml-experiment-tracker/SKILL.md +212 -0
  123. package/skills/analysis/statistics/pywayne-statistics-guide/SKILL.md +192 -0
  124. package/skills/analysis/statistics/quantitative-methods-guide/SKILL.md +193 -0
  125. package/skills/analysis/statistics/senior-data-scientist-guide/SKILL.md +223 -0
  126. package/skills/analysis/wrangling/claude-data-analysis-guide/SKILL.md +100 -0
  127. package/skills/analysis/wrangling/csv-data-analyzer/SKILL.md +170 -0
  128. package/skills/analysis/wrangling/data-cleaning-pipeline/SKILL.md +266 -0
  129. package/skills/analysis/wrangling/data-cog-guide/SKILL.md +178 -0
  130. package/skills/analysis/wrangling/open-data-scientist-guide/SKILL.md +197 -0
  131. package/skills/analysis/wrangling/stata-data-cleaning/SKILL.md +276 -0
  132. package/skills/analysis/wrangling/streamline-analyst-guide/SKILL.md +119 -0
  133. package/skills/analysis/wrangling/survey-data-processing/SKILL.md +298 -0
  134. package/skills/domains/ai-ml/ai-agent-papers-guide/SKILL.md +146 -0
  135. package/skills/domains/ai-ml/ai-model-benchmarking/SKILL.md +209 -0
  136. package/skills/domains/ai-ml/annotated-dl-papers-guide/SKILL.md +159 -0
  137. package/skills/domains/ai-ml/anomaly-detection-papers-guide/SKILL.md +167 -0
  138. package/skills/domains/ai-ml/autonomous-agents-papers-guide/SKILL.md +178 -0
  139. package/skills/domains/ai-ml/dl-transformer-finetune/SKILL.md +239 -0
  140. package/skills/domains/ai-ml/domain-adaptation-papers-guide/SKILL.md +173 -0
  141. package/skills/domains/ai-ml/generative-ai-guide/SKILL.md +146 -0
  142. package/skills/domains/ai-ml/graph-learning-papers-guide/SKILL.md +125 -0
  143. package/skills/domains/ai-ml/huggingface-inference-guide/SKILL.md +196 -0
  144. package/skills/domains/ai-ml/keras-deep-learning/SKILL.md +210 -0
  145. package/skills/domains/ai-ml/kolmogorov-arnold-networks-guide/SKILL.md +185 -0
  146. package/skills/domains/ai-ml/llm-from-scratch-guide/SKILL.md +124 -0
  147. package/skills/domains/ai-ml/ml-pipeline-guide/SKILL.md +295 -0
  148. package/skills/domains/ai-ml/nlp-toolkit-guide/SKILL.md +247 -0
  149. package/skills/domains/ai-ml/npcpy-research-guide/SKILL.md +137 -0
  150. package/skills/domains/ai-ml/pytorch-guide/SKILL.md +281 -0
  151. package/skills/domains/ai-ml/pytorch-lightning-guide/SKILL.md +244 -0
  152. package/skills/domains/ai-ml/responsible-ai-guide/SKILL.md +126 -0
  153. package/skills/domains/ai-ml/tensorflow-guide/SKILL.md +241 -0
  154. package/skills/domains/ai-ml/vmas-simulator-guide/SKILL.md +129 -0
  155. package/skills/domains/biomedical/bioagents-guide/SKILL.md +308 -0
  156. package/skills/domains/biomedical/clawbio-guide/SKILL.md +167 -0
  157. package/skills/domains/biomedical/clinical-dialogue-agents-guide/SKILL.md +145 -0
  158. package/skills/domains/biomedical/ena-sequence-api/SKILL.md +175 -0
  159. package/skills/domains/biomedical/genomas-guide/SKILL.md +126 -0
  160. package/skills/domains/biomedical/genotex-benchmark-guide/SKILL.md +125 -0
  161. package/skills/domains/biomedical/med-researcher-guide/SKILL.md +161 -0
  162. package/skills/domains/biomedical/med-researcher-r1-guide/SKILL.md +146 -0
  163. package/skills/domains/biomedical/medgeclaw-guide/SKILL.md +345 -0
  164. package/skills/domains/biomedical/medical-imaging-guide/SKILL.md +305 -0
  165. package/skills/domains/biomedical/ncbi-blast-api/SKILL.md +195 -0
  166. package/skills/domains/biomedical/ncbi-datasets-api/SKILL.md +220 -0
  167. package/skills/domains/biomedical/quickgo-api/SKILL.md +181 -0
  168. package/skills/domains/business/architecture-design-guide/SKILL.md +279 -0
  169. package/skills/domains/business/innovation-management-guide/SKILL.md +257 -0
  170. package/skills/domains/business/operations-research-guide/SKILL.md +258 -0
  171. package/skills/domains/business/xpert-bi-guide/SKILL.md +84 -0
  172. package/skills/domains/chemistry/cactus-cheminformatics-guide/SKILL.md +89 -0
  173. package/skills/domains/chemistry/chemeagle-guide/SKILL.md +147 -0
  174. package/skills/domains/chemistry/chemgraph-agent-guide/SKILL.md +120 -0
  175. package/skills/domains/chemistry/molecular-dynamics-guide/SKILL.md +237 -0
  176. package/skills/domains/chemistry/pubchem-api-guide/SKILL.md +180 -0
  177. package/skills/domains/chemistry/spectroscopy-analysis-guide/SKILL.md +290 -0
  178. package/skills/domains/cs/ai-security-papers-guide/SKILL.md +103 -0
  179. package/skills/domains/cs/code-llm-papers-guide/SKILL.md +131 -0
  180. package/skills/domains/cs/distributed-systems-guide/SKILL.md +268 -0
  181. package/skills/domains/cs/formal-verification-guide/SKILL.md +298 -0
  182. package/skills/domains/cs/gaussian-splatting-papers-guide/SKILL.md +158 -0
  183. package/skills/domains/cs/llm-aiops-guide/SKILL.md +70 -0
  184. package/skills/domains/cs/software-heritage-api/SKILL.md +200 -0
  185. package/skills/domains/ecology/species-distribution-guide/SKILL.md +343 -0
  186. package/skills/domains/economics/imf-data-api-guide/SKILL.md +174 -0
  187. package/skills/domains/economics/nber-working-papers-api/SKILL.md +177 -0
  188. package/skills/domains/economics/post-labor-economics/SKILL.md +254 -0
  189. package/skills/domains/economics/pricing-psychology-guide/SKILL.md +273 -0
  190. package/skills/domains/economics/repec-economics-api/SKILL.md +188 -0
  191. package/skills/domains/economics/world-bank-data-guide/SKILL.md +179 -0
  192. package/skills/domains/education/academic-study-methods/SKILL.md +228 -0
  193. package/skills/domains/education/assessment-design-guide/SKILL.md +213 -0
  194. package/skills/domains/education/educational-research-methods/SKILL.md +179 -0
  195. package/skills/domains/education/edumcp-guide/SKILL.md +74 -0
  196. package/skills/domains/education/mooc-analytics-guide/SKILL.md +206 -0
  197. package/skills/domains/education/open-syllabus-api/SKILL.md +171 -0
  198. package/skills/domains/finance/akshare-finance-data/SKILL.md +207 -0
  199. package/skills/domains/finance/finsight-research-guide/SKILL.md +113 -0
  200. package/skills/domains/finance/options-analytics-agent-guide/SKILL.md +117 -0
  201. package/skills/domains/finance/portfolio-optimization-guide/SKILL.md +279 -0
  202. package/skills/domains/finance/risk-modeling-guide/SKILL.md +260 -0
  203. package/skills/domains/finance/stata-accounting-research/SKILL.md +372 -0
  204. package/skills/domains/geoscience/climate-modeling-guide/SKILL.md +215 -0
  205. package/skills/domains/geoscience/pangaea-data-api/SKILL.md +197 -0
  206. package/skills/domains/geoscience/satellite-remote-sensing/SKILL.md +193 -0
  207. package/skills/domains/geoscience/seismology-data-guide/SKILL.md +208 -0
  208. package/skills/domains/humanities/digital-humanities-methods/SKILL.md +232 -0
  209. package/skills/domains/humanities/ethical-philosophy-guide/SKILL.md +244 -0
  210. package/skills/domains/humanities/history-research-guide/SKILL.md +260 -0
  211. package/skills/domains/humanities/political-history-guide/SKILL.md +241 -0
  212. package/skills/domains/law/caselaw-access-api/SKILL.md +149 -0
  213. package/skills/domains/law/legal-agent-skills-guide/SKILL.md +132 -0
  214. package/skills/domains/law/legal-nlp-guide/SKILL.md +236 -0
  215. package/skills/domains/law/legal-research-methods/SKILL.md +190 -0
  216. package/skills/domains/law/opencontracts-guide/SKILL.md +168 -0
  217. package/skills/domains/law/patent-analysis-guide/SKILL.md +257 -0
  218. package/skills/domains/law/regulatory-compliance-guide/SKILL.md +267 -0
  219. package/skills/domains/math/lean-theorem-proving-guide/SKILL.md +140 -0
  220. package/skills/domains/math/symbolic-computation-guide/SKILL.md +263 -0
  221. package/skills/domains/math/topology-data-analysis/SKILL.md +305 -0
  222. package/skills/domains/pharma/clinical-trial-design-guide/SKILL.md +271 -0
  223. package/skills/domains/pharma/drug-target-interaction/SKILL.md +242 -0
  224. package/skills/domains/pharma/madd-drug-discovery-guide/SKILL.md +153 -0
  225. package/skills/domains/pharma/pharmacovigilance-guide/SKILL.md +216 -0
  226. package/skills/domains/physics/astrophysics-data-guide/SKILL.md +305 -0
  227. package/skills/domains/physics/particle-physics-guide/SKILL.md +287 -0
  228. package/skills/domains/social-science/ipums-microdata-api/SKILL.md +211 -0
  229. package/skills/domains/social-science/network-analysis-guide/SKILL.md +310 -0
  230. package/skills/domains/social-science/psychology-research-guide/SKILL.md +270 -0
  231. package/skills/domains/social-science/sociology-research-guide/SKILL.md +238 -0
  232. package/skills/domains/social-science/sociology-research-methods/SKILL.md +181 -0
  233. package/skills/literature/discovery/arxiv-paper-monitoring/SKILL.md +233 -0
  234. package/skills/literature/discovery/paper-recommendation-guide/SKILL.md +120 -0
  235. package/skills/literature/discovery/papers-we-love-guide/SKILL.md +169 -0
  236. package/skills/literature/discovery/semantic-paper-radar/SKILL.md +144 -0
  237. package/skills/literature/discovery/zotero-arxiv-daily-guide/SKILL.md +94 -0
  238. package/skills/literature/fulltext/bioc-pmc-api/SKILL.md +146 -0
  239. package/skills/literature/fulltext/core-api-guide/SKILL.md +144 -0
  240. package/skills/literature/fulltext/dataverse-api/SKILL.md +215 -0
  241. package/skills/literature/fulltext/hal-archive-api/SKILL.md +218 -0
  242. package/skills/literature/fulltext/institutional-repository-guide/SKILL.md +212 -0
  243. package/skills/literature/fulltext/open-access-mining-guide/SKILL.md +341 -0
  244. package/skills/literature/fulltext/osf-api/SKILL.md +212 -0
  245. package/skills/literature/fulltext/pmc-ftp-bulk-download/SKILL.md +182 -0
  246. package/skills/literature/fulltext/zotero-ai-butler-guide/SKILL.md +166 -0
  247. package/skills/literature/fulltext/zotero-scihub-guide/SKILL.md +168 -0
  248. package/skills/literature/metadata/academic-paper-summarizer/SKILL.md +101 -0
  249. package/skills/literature/metadata/bibliometrix-guide/SKILL.md +164 -0
  250. package/skills/literature/metadata/crossref-event-data-api/SKILL.md +183 -0
  251. package/skills/literature/metadata/doi-content-negotiation/SKILL.md +202 -0
  252. package/skills/literature/metadata/orkg-api/SKILL.md +153 -0
  253. package/skills/literature/metadata/plumx-metrics-api/SKILL.md +188 -0
  254. package/skills/literature/metadata/ror-organization-api/SKILL.md +208 -0
  255. package/skills/literature/metadata/sophosia-reference-guide/SKILL.md +110 -0
  256. package/skills/literature/metadata/viaf-authority-api/SKILL.md +209 -0
  257. package/skills/literature/metadata/wikidata-api-guide/SKILL.md +156 -0
  258. package/skills/literature/metadata/zoplicate-dedup-guide/SKILL.md +147 -0
  259. package/skills/literature/metadata/zotero-actions-tags-guide/SKILL.md +212 -0
  260. package/skills/literature/metadata/zotmoov-guide/SKILL.md +120 -0
  261. package/skills/literature/metadata/zutilo-guide/SKILL.md +140 -0
  262. package/skills/literature/search/arxiv-batch-reporting/SKILL.md +133 -0
  263. package/skills/literature/search/arxiv-cli-tools/SKILL.md +172 -0
  264. package/skills/literature/search/arxiv-osiris/SKILL.md +199 -0
  265. package/skills/literature/search/arxiv-paper-processor/SKILL.md +141 -0
  266. package/skills/literature/search/baidu-scholar-guide/SKILL.md +110 -0
  267. package/skills/literature/search/base-academic-search/SKILL.md +196 -0
  268. package/skills/literature/search/chatpaper-guide/SKILL.md +122 -0
  269. package/skills/literature/search/citeseerx-api/SKILL.md +183 -0
  270. package/skills/literature/search/deep-literature-search/SKILL.md +149 -0
  271. package/skills/literature/search/deepgit-search-guide/SKILL.md +147 -0
  272. package/skills/literature/search/eric-education-api/SKILL.md +199 -0
  273. package/skills/literature/search/findpapers-guide/SKILL.md +177 -0
  274. package/skills/literature/search/ieee-xplore-api/SKILL.md +177 -0
  275. package/skills/literature/search/lens-scholarly-api/SKILL.md +211 -0
  276. package/skills/literature/search/multi-database-literature-search/SKILL.md +198 -0
  277. package/skills/literature/search/open-library-api/SKILL.md +196 -0
  278. package/skills/literature/search/open-semantic-search-guide/SKILL.md +190 -0
  279. package/skills/literature/search/openaire-api/SKILL.md +141 -0
  280. package/skills/literature/search/paper-search-mcp-guide/SKILL.md +107 -0
  281. package/skills/literature/search/papers-chat-guide/SKILL.md +194 -0
  282. package/skills/literature/search/pasa-paper-search-guide/SKILL.md +138 -0
  283. package/skills/literature/search/plos-open-access-api/SKILL.md +203 -0
  284. package/skills/literature/search/scielo-api/SKILL.md +182 -0
  285. package/skills/literature/search/share-research-api/SKILL.md +129 -0
  286. package/skills/literature/search/worldcat-search-api/SKILL.md +224 -0
  287. package/skills/research/automation/ai-scientist-v2-guide/SKILL.md +284 -0
  288. package/skills/research/automation/aim-experiment-guide/SKILL.md +234 -0
  289. package/skills/research/automation/claude-academic-workflow-guide/SKILL.md +202 -0
  290. package/skills/research/automation/coexist-ai-guide/SKILL.md +149 -0
  291. package/skills/research/automation/datagen-research-guide/SKILL.md +131 -0
  292. package/skills/research/automation/foam-agent-guide/SKILL.md +203 -0
  293. package/skills/research/automation/kedro-pipeline-guide/SKILL.md +216 -0
  294. package/skills/research/automation/mle-agent-guide/SKILL.md +139 -0
  295. package/skills/research/automation/paper-to-agent-guide/SKILL.md +116 -0
  296. package/skills/research/automation/rd-agent-guide/SKILL.md +246 -0
  297. package/skills/research/automation/research-paper-orchestrator/SKILL.md +254 -0
  298. package/skills/research/deep-research/academic-deep-research/SKILL.md +190 -0
  299. package/skills/research/deep-research/auto-deep-research-guide/SKILL.md +141 -0
  300. package/skills/research/deep-research/cognitive-kernel-guide/SKILL.md +200 -0
  301. package/skills/research/deep-research/corvus-research-guide/SKILL.md +132 -0
  302. package/skills/research/deep-research/deep-research-pro/SKILL.md +213 -0
  303. package/skills/research/deep-research/deep-research-work/SKILL.md +204 -0
  304. package/skills/research/deep-research/deep-searcher-guide/SKILL.md +253 -0
  305. package/skills/research/deep-research/gpt-researcher-guide/SKILL.md +191 -0
  306. package/skills/research/deep-research/in-depth-research-guide/SKILL.md +205 -0
  307. package/skills/research/deep-research/khoj-research-guide/SKILL.md +200 -0
  308. package/skills/research/deep-research/kosmos-scientist-guide/SKILL.md +185 -0
  309. package/skills/research/deep-research/llm-scientific-discovery-guide/SKILL.md +178 -0
  310. package/skills/research/deep-research/local-deep-research-guide/SKILL.md +253 -0
  311. package/skills/research/deep-research/open-researcher-guide/SKILL.md +138 -0
  312. package/skills/research/deep-research/tongyi-deep-research-guide/SKILL.md +217 -0
  313. package/skills/research/funding/eu-horizon-guide/SKILL.md +244 -0
  314. package/skills/research/funding/grant-budget-guide/SKILL.md +284 -0
  315. package/skills/research/funding/nih-reporter-api-guide/SKILL.md +166 -0
  316. package/skills/research/funding/nsf-award-api-guide/SKILL.md +133 -0
  317. package/skills/research/methodology/academic-mentor-guide/SKILL.md +169 -0
  318. package/skills/research/methodology/claude-scientific-guide/SKILL.md +122 -0
  319. package/skills/research/methodology/deep-innovator-guide/SKILL.md +242 -0
  320. package/skills/research/methodology/osf-api-guide/SKILL.md +165 -0
  321. package/skills/research/methodology/parsifal-slr-guide/SKILL.md +154 -0
  322. package/skills/research/methodology/research-paper-kb/SKILL.md +263 -0
  323. package/skills/research/methodology/research-pipeline-units-guide/SKILL.md +169 -0
  324. package/skills/research/methodology/research-town-guide/SKILL.md +263 -0
  325. package/skills/research/methodology/slr-automation-guide/SKILL.md +235 -0
  326. package/skills/research/paper-review/automated-review-guide/SKILL.md +281 -0
  327. package/skills/research/paper-review/latte-review-guide/SKILL.md +175 -0
  328. package/skills/research/paper-review/paper-compare-guide/SKILL.md +238 -0
  329. package/skills/research/paper-review/paper-critique-framework/SKILL.md +181 -0
  330. package/skills/research/paper-review/paper-digest-guide/SKILL.md +240 -0
  331. package/skills/research/paper-review/paper-research-assistant/SKILL.md +231 -0
  332. package/skills/research/paper-review/research-quality-filter/SKILL.md +261 -0
  333. package/skills/research/paper-review/review-response-guide/SKILL.md +275 -0
  334. package/skills/tools/code-exec/contextplus-mcp-guide/SKILL.md +110 -0
  335. package/skills/tools/code-exec/google-colab-guide/SKILL.md +276 -0
  336. package/skills/tools/code-exec/kaggle-api-guide/SKILL.md +216 -0
  337. package/skills/tools/code-exec/overleaf-cli-guide/SKILL.md +279 -0
  338. package/skills/tools/diagram/clawphd-guide/SKILL.md +149 -0
  339. package/skills/tools/diagram/code-flow-visualizer/SKILL.md +197 -0
  340. package/skills/tools/diagram/excalidraw-diagram-guide/SKILL.md +170 -0
  341. package/skills/tools/diagram/json-data-visualizer/SKILL.md +270 -0
  342. package/skills/tools/diagram/kroki-diagram-api/SKILL.md +198 -0
  343. package/skills/tools/diagram/mermaid-architect-guide/SKILL.md +219 -0
  344. package/skills/tools/diagram/scientific-graphical-abstract/SKILL.md +201 -0
  345. package/skills/tools/diagram/tldraw-whiteboard-guide/SKILL.md +397 -0
  346. package/skills/tools/document/docsgpt-guide/SKILL.md +130 -0
  347. package/skills/tools/document/large-document-reader/SKILL.md +202 -0
  348. package/skills/tools/document/md2pdf-xelatex/SKILL.md +212 -0
  349. package/skills/tools/document/openpaper-guide/SKILL.md +232 -0
  350. package/skills/tools/document/paper-parse-guide/SKILL.md +243 -0
  351. package/skills/tools/document/weknora-guide/SKILL.md +216 -0
  352. package/skills/tools/document/zotero-addon-market-guide/SKILL.md +108 -0
  353. package/skills/tools/document/zotero-night-theme-guide/SKILL.md +142 -0
  354. package/skills/tools/document/zotero-style-guide/SKILL.md +217 -0
  355. package/skills/tools/knowledge-graph/citation-network-builder/SKILL.md +244 -0
  356. package/skills/tools/knowledge-graph/concept-map-generator/SKILL.md +284 -0
  357. package/skills/tools/knowledge-graph/graphiti-guide/SKILL.md +219 -0
  358. package/skills/tools/knowledge-graph/mimir-memory-guide/SKILL.md +135 -0
  359. package/skills/tools/knowledge-graph/notero-zotero-notion-guide/SKILL.md +187 -0
  360. package/skills/tools/knowledge-graph/open-webui-tools-guide/SKILL.md +156 -0
  361. package/skills/tools/knowledge-graph/openspg-guide/SKILL.md +210 -0
  362. package/skills/tools/knowledge-graph/paperpile-notion-guide/SKILL.md +84 -0
  363. package/skills/tools/knowledge-graph/zotero-markdb-connect-guide/SKILL.md +162 -0
  364. package/skills/tools/ocr-translate/latex-translation-guide/SKILL.md +176 -0
  365. package/skills/tools/ocr-translate/math-equation-renderer/SKILL.md +198 -0
  366. package/skills/tools/ocr-translate/pdf-math-translate-guide/SKILL.md +141 -0
  367. package/skills/tools/ocr-translate/zotero-pdf-translate-guide/SKILL.md +95 -0
  368. package/skills/tools/ocr-translate/zotero-pdf2zh-guide/SKILL.md +143 -0
  369. package/skills/tools/scraping/dataset-finder-guide/SKILL.md +253 -0
  370. package/skills/tools/scraping/easy-spider-guide/SKILL.md +250 -0
  371. package/skills/tools/scraping/google-scholar-scraper/SKILL.md +255 -0
  372. package/skills/tools/scraping/repository-harvesting-guide/SKILL.md +310 -0
  373. package/skills/writing/citation/academic-citation-manager/SKILL.md +314 -0
  374. package/skills/writing/citation/academic-citation-manager-guide/SKILL.md +182 -0
  375. package/skills/writing/citation/citation-assistant-skill/SKILL.md +192 -0
  376. package/skills/writing/citation/jabref-reference-guide/SKILL.md +127 -0
  377. package/skills/writing/citation/jasminum-zotero-guide/SKILL.md +103 -0
  378. package/skills/writing/citation/mendeley-api/SKILL.md +231 -0
  379. package/skills/writing/citation/obsidian-citation-guide/SKILL.md +164 -0
  380. package/skills/writing/citation/obsidian-zotero-guide/SKILL.md +137 -0
  381. package/skills/writing/citation/onecite-reference-guide/SKILL.md +168 -0
  382. package/skills/writing/citation/papersgpt-zotero-guide/SKILL.md +132 -0
  383. package/skills/writing/citation/papis-cli-guide/SKILL.md +213 -0
  384. package/skills/writing/citation/zotero-better-bibtex-guide/SKILL.md +107 -0
  385. package/skills/writing/citation/zotero-better-notes-guide/SKILL.md +121 -0
  386. package/skills/writing/citation/zotero-gpt-guide/SKILL.md +111 -0
  387. package/skills/writing/citation/zotero-mcp-guide/SKILL.md +164 -0
  388. package/skills/writing/citation/zotero-mdnotes-guide/SKILL.md +162 -0
  389. package/skills/writing/citation/zotero-reference-guide/SKILL.md +139 -0
  390. package/skills/writing/citation/zotero-scholar-guide/SKILL.md +294 -0
  391. package/skills/writing/citation/zotfile-attachment-guide/SKILL.md +140 -0
  392. package/skills/writing/composition/ml-paper-writing/SKILL.md +163 -0
  393. package/skills/writing/composition/opendraft-thesis-guide/SKILL.md +200 -0
  394. package/skills/writing/composition/paper-debugger-guide/SKILL.md +143 -0
  395. package/skills/writing/composition/paperforge-guide/SKILL.md +205 -0
  396. package/skills/writing/composition/research-paper-writer/SKILL.md +226 -0
  397. package/skills/writing/composition/scientific-writing-resources/SKILL.md +151 -0
  398. package/skills/writing/composition/scientific-writing-wrapper/SKILL.md +153 -0
  399. package/skills/writing/latex/academic-writing-latex/SKILL.md +285 -0
  400. package/skills/writing/latex/latex-drawing-collection/SKILL.md +154 -0
  401. package/skills/writing/latex/latex-templates-collection/SKILL.md +159 -0
  402. package/skills/writing/latex/md-to-pdf-academic/SKILL.md +230 -0
  403. package/skills/writing/latex/tex-render-guide/SKILL.md +243 -0
  404. package/skills/writing/polish/academic-tone-guide/SKILL.md +209 -0
  405. package/skills/writing/polish/chinese-text-humanizer/SKILL.md +140 -0
  406. package/skills/writing/polish/conciseness-editing-guide/SKILL.md +225 -0
  407. package/skills/writing/polish/paper-polish-guide/SKILL.md +160 -0
  408. package/skills/writing/templates/arxiv-preprint-template/SKILL.md +184 -0
  409. package/skills/writing/templates/elegant-paper-template/SKILL.md +141 -0
  410. package/skills/writing/templates/graphical-abstract-guide/SKILL.md +183 -0
  411. package/skills/writing/templates/novathesis-guide/SKILL.md +152 -0
  412. package/skills/writing/templates/scientific-article-pdf/SKILL.md +261 -0
  413. package/skills/writing/templates/sjtuthesis-guide/SKILL.md +197 -0
  414. package/skills/writing/templates/thuthesis-guide/SKILL.md +181 -0
  415. package/skills/literature/fulltext/repository-harvesting-guide/SKILL.md +0 -207
@@ -0,0 +1,146 @@
1
+ ---
2
+ name: ai-agent-papers-guide
3
+ description: "Curated 2024-2026 AI agent research papers collection"
4
+ metadata:
5
+ openclaw:
6
+ emoji: "📑"
7
+ category: "domains"
8
+ subcategory: "ai-ml"
9
+ keywords: ["AI agents", "agent papers", "2024 research", "LLM agents", "agent frameworks", "survey"]
10
+ source: "https://github.com/VoltAgent/awesome-ai-agent-papers"
11
+ ---
12
+
13
+ # AI Agent Papers Guide (2024-2026)
14
+
15
+ ## Overview
16
+
17
+ A focused collection of AI agent research papers from 2024-2026, tracking the latest developments in LLM-based agent systems. Unlike broader collections, this focuses on recent breakthroughs — new architectures, benchmarks, multi-agent coordination, and real-world applications. Updated frequently as the field evolves rapidly.
18
+
19
+ ## Paper Categories
20
+
21
+ ```
22
+ Recent AI Agent Research
23
+ ├── Agent Architectures
24
+ │ ├── Planning (o1-style reasoning, search-augmented)
25
+ │ ├── Memory (long-term, episodic, working)
26
+ │ └── Tool use (function calling, code execution)
27
+ ├── Multi-Agent Systems
28
+ │ ├── Collaboration (task decomposition, debate)
29
+ │ ├── Competition (red team, adversarial)
30
+ │ └── Emergence (self-organization, culture)
31
+ ├── Evaluation
32
+ │ ├── Benchmarks (SWE-bench, WebArena, GAIA)
33
+ │ ├── Safety (jailbreak, misuse, alignment)
34
+ │ └── Reliability (error recovery, hallucination)
35
+ ├── Applications
36
+ │ ├── Software engineering (coding agents)
37
+ │ ├── Scientific research (lab automation)
38
+ │ ├── Web automation (browsing, form-filling)
39
+ │ └── Enterprise (workflow, data analysis)
40
+ └── Infrastructure
41
+ ├── Frameworks (LangGraph, CrewAI, AutoGen)
42
+ ├── Protocols (MCP, A2A, tool standards)
43
+ └── Deployment (scaling, monitoring, cost)
44
+ ```
45
+
46
+ ## Highlighted Papers (2024-2025)
47
+
48
+ | Paper | Venue | Key Contribution |
49
+ |-------|-------|-----------------|
50
+ | SWE-agent | ICLR 2025 | Agent interface design for SE |
51
+ | OpenHands | 2024 | Open platform for coding agents |
52
+ | AgentBench | ICLR 2024 | Multi-environment agent benchmark |
53
+ | GAIA | ICLR 2024 | General AI assistant benchmark |
54
+ | Voyager | NeurIPS 2024 | Lifelong learning in Minecraft |
55
+ | OS-Copilot | 2024 | Self-improving computer agent |
56
+ | AutoGen | 2024 | Multi-agent conversation framework |
57
+ | Agent-FLAN | ACL 2024 | Agent fine-tuning methodology |
58
+
59
+ ## Tracking New Papers
60
+
61
+ ```python
62
+ import arxiv
63
+ from datetime import datetime, timedelta
64
+
65
+ def find_recent_agent_papers(days=14):
66
+ """Find cutting-edge agent papers."""
67
+ queries = [
68
+ "ti:agent AND (ti:LLM OR ti:language model)",
69
+ "abs:autonomous agent AND abs:tool use AND abs:2024",
70
+ "ti:multi-agent AND abs:large language",
71
+ "abs:coding agent OR abs:software agent",
72
+ ]
73
+
74
+ seen = set()
75
+ papers = []
76
+
77
+ for q in queries:
78
+ search = arxiv.Search(
79
+ query=q, max_results=15,
80
+ sort_by=arxiv.SortCriterion.SubmittedDate,
81
+ )
82
+ for r in search.results():
83
+ if r.entry_id not in seen:
84
+ seen.add(r.entry_id)
85
+ papers.append({
86
+ "title": r.title,
87
+ "date": r.published.strftime("%Y-%m-%d"),
88
+ "url": r.entry_id,
89
+ })
90
+
91
+ papers.sort(key=lambda x: x["date"], reverse=True)
92
+ for p in papers[:20]:
93
+ print(f"[{p['date']}] {p['title']}")
94
+ print(f" {p['url']}")
95
+
96
+ find_recent_agent_papers()
97
+ ```
98
+
99
+ ## Framework Comparison
100
+
101
+ ```python
102
+ frameworks = {
103
+ "LangGraph": {
104
+ "paradigm": "Graph-based workflows",
105
+ "persistence": "Built-in checkpointing",
106
+ "multi_agent": "Yes",
107
+ "language": "Python/JS",
108
+ },
109
+ "CrewAI": {
110
+ "paradigm": "Role-based agents",
111
+ "persistence": "Memory module",
112
+ "multi_agent": "Yes (crew)",
113
+ "language": "Python",
114
+ },
115
+ "AutoGen": {
116
+ "paradigm": "Conversational agents",
117
+ "persistence": "Chat history",
118
+ "multi_agent": "Yes (group chat)",
119
+ "language": "Python/.NET",
120
+ },
121
+ "OpenHands": {
122
+ "paradigm": "Computer use agent",
123
+ "persistence": "Workspace state",
124
+ "multi_agent": "No",
125
+ "language": "Python",
126
+ },
127
+ }
128
+
129
+ for name, info in frameworks.items():
130
+ print(f"\n{name}:")
131
+ for k, v in info.items():
132
+ print(f" {k}: {v}")
133
+ ```
134
+
135
+ ## Use Cases
136
+
137
+ 1. **Literature tracking**: Stay current on agent research
138
+ 2. **Framework selection**: Compare agent development tools
139
+ 3. **Research planning**: Identify open problems and trends
140
+ 4. **Course material**: Teach cutting-edge agent systems
141
+ 5. **Benchmark tracking**: Compare agent capabilities
142
+
143
+ ## References
144
+
145
+ - [awesome-ai-agent-papers](https://github.com/VoltAgent/awesome-ai-agent-papers)
146
+ - [VoltAgent Framework](https://github.com/VoltAgent/voltagent)
@@ -0,0 +1,209 @@
1
+ ---
2
+ name: ai-model-benchmarking
3
+ description: "Benchmark AI models across 60+ academic evaluation suites and metrics"
4
+ metadata:
5
+ openclaw:
6
+ emoji: "📊"
7
+ category: "domains"
8
+ subcategory: "ai-ml"
9
+ keywords: ["benchmarking", "evaluation", "LLM", "MMLU", "leaderboard", "metrics", "lm-eval"]
10
+ source: "https://github.com/EleutherAI/lm-evaluation-harness"
11
+ ---
12
+
13
+ # AI Model Benchmarking Guide
14
+
15
+ ## Overview
16
+
17
+ Rigorous evaluation is the backbone of machine learning research. A model is only as credible as its evaluation protocol: which benchmarks were used, how metrics were computed, whether results are reproducible, and how they compare to baselines. The proliferation of LLMs has made this both more important and more complex, with over 60 established benchmarks and a rapidly evolving landscape.
18
+
19
+ This guide covers the practical side of model benchmarking: how to use the EleutherAI Language Model Evaluation Harness (lm-evaluation-harness), how to select benchmarks for different research claims, how to avoid common evaluation pitfalls, and how to present results for publication. The focus is on academic rigor rather than leaderboard chasing.
20
+
21
+ Whether you are evaluating a fine-tuned model for a paper, comparing architectures for an ablation study, or reviewing a submitted manuscript's evaluation section, these patterns will help ensure the evaluation is sound.
22
+
23
+ ## The lm-evaluation-harness
24
+
25
+ The EleutherAI lm-evaluation-harness is the de facto standard for LLM evaluation in academic research, supporting 60+ tasks and used by most major LLM papers.
26
+
27
+ ### Installation and Basic Usage
28
+
29
+ ```bash
30
+ # Install
31
+ pip install lm-eval
32
+
33
+ # Run a single benchmark
34
+ lm_eval --model hf \
35
+ --model_args pretrained=meta-llama/Llama-2-7b-hf \
36
+ --tasks mmlu \
37
+ --batch_size auto \
38
+ --output_path results/llama2-7b/
39
+
40
+ # Run multiple benchmarks
41
+ lm_eval --model hf \
42
+ --model_args pretrained=meta-llama/Llama-2-7b-hf \
43
+ --tasks mmlu,hellaswag,arc_challenge,winogrande,truthfulqa_mc2 \
44
+ --batch_size auto \
45
+ --num_fewshot 5 \
46
+ --output_path results/llama2-7b/
47
+ ```
48
+
49
+ ### Programmatic API
50
+
51
+ ```python
52
+ import lm_eval
53
+
54
+ results = lm_eval.simple_evaluate(
55
+ model="hf",
56
+ model_args="pretrained=meta-llama/Llama-2-7b-hf",
57
+ tasks=["mmlu", "hellaswag", "arc_challenge"],
58
+ num_fewshot=5,
59
+ batch_size="auto",
60
+ device="cuda",
61
+ )
62
+
63
+ # Access results
64
+ for task, metrics in results["results"].items():
65
+ print(f"{task}: {metrics}")
66
+ ```
67
+
68
+ ## Benchmark Selection by Research Claim
69
+
70
+ | Research Claim | Required Benchmarks | Why |
71
+ |---------------|--------------------|----|
72
+ | General knowledge | MMLU, ARC, TriviaQA | Broad factual coverage |
73
+ | Reasoning | GSM8K, BBH, ARC-Challenge | Multi-step logical reasoning |
74
+ | Coding | HumanEval, MBPP, DS-1000 | Code generation and understanding |
75
+ | Instruction following | MT-Bench, AlpacaEval, IFEval | Open-ended instruction quality |
76
+ | Safety | TruthfulQA, ToxiGen, BBQ | Truthfulness, toxicity, bias |
77
+ | Multilingual | MGSM, XWinograd, FLORES | Cross-lingual transfer |
78
+ | Long context | SCROLLS, LongBench, RULER | Long document understanding |
79
+ | Domain-specific | MedQA, LegalBench, SciQ | Professional domain knowledge |
80
+
81
+ ## Core Benchmarks Deep Dive
82
+
83
+ ### MMLU (Massive Multitask Language Understanding)
84
+
85
+ ```
86
+ - 57 subjects: STEM, humanities, social sciences, professional
87
+ - 14,042 questions, multiple choice (4 options)
88
+ - Standard: 5-shot evaluation
89
+ - Metric: Accuracy (macro-averaged across subjects)
90
+ - Citation: Hendrycks et al., 2021
91
+
92
+ Score interpretation:
93
+ < 30%: Below random (model is miscalibrated)
94
+ 30-40%: Near random (4 choices = 25% baseline)
95
+ 40-60%: Basic knowledge
96
+ 60-70%: Strong general knowledge
97
+ 70-80%: Expert-level for most subjects
98
+ > 80%: State-of-the-art (as of 2024)
99
+ ```
100
+
101
+ ### GSM8K (Grade School Math)
102
+
103
+ ```
104
+ - 8,792 grade school math word problems
105
+ - Requires multi-step arithmetic reasoning
106
+ - Standard: 8-shot chain-of-thought
107
+ - Metric: Exact match on final numerical answer
108
+ - Citation: Cobbe et al., 2021
109
+
110
+ Common pitfalls:
111
+ - Regex matching for final answer extraction
112
+ - Calculator use vs. pure model computation
113
+ - Reporting with vs. without chain-of-thought
114
+ ```
115
+
116
+ ### HumanEval (Code Generation)
117
+
118
+ ```
119
+ - 164 Python programming problems
120
+ - Function signature + docstring -> implementation
121
+ - Metric: pass@k (k=1 standard, k=10 and k=100 also reported)
122
+ - Citation: Chen et al., 2021
123
+
124
+ pass@k computation (unbiased estimator):
125
+ pass@k = 1 - C(n-c, k) / C(n, k)
126
+ where n = total samples, c = correct samples
127
+ ```
128
+
129
+ ## Evaluation Pitfalls
130
+
131
+ | Pitfall | Problem | Solution |
132
+ |---------|---------|----------|
133
+ | Data contamination | Benchmark data in training set | Use canary strings, report contamination analysis |
134
+ | Prompt sensitivity | Results vary with prompt format | Report results across 3+ prompt variants |
135
+ | Few-shot selection | Cherry-picked examples boost scores | Use fixed random seed for example selection |
136
+ | Metric gaming | Optimizing for specific metrics | Report multiple metrics, include calibration |
137
+ | Incomplete reporting | Only showing best results | Report mean and std across seeds |
138
+ | Version mismatch | Different benchmark versions | Pin exact dataset version and commit hash |
139
+
140
+ ### Contamination Detection
141
+
142
+ ```python
143
+ def check_contamination(training_data: list, benchmark_data: list, n: int = 13) -> dict:
144
+ """
145
+ Check for n-gram overlap between training data and benchmark.
146
+ 13-gram overlap is the standard threshold (GPT-4 technical report).
147
+ """
148
+ from collections import defaultdict
149
+
150
+ def extract_ngrams(text, n):
151
+ words = text.lower().split()
152
+ return set(tuple(words[i:i+n]) for i in range(len(words) - n + 1))
153
+
154
+ # Build training n-gram index
155
+ train_ngrams = set()
156
+ for text in training_data:
157
+ train_ngrams.update(extract_ngrams(text, n))
158
+
159
+ # Check benchmark items
160
+ contaminated = []
161
+ for i, item in enumerate(benchmark_data):
162
+ item_ngrams = extract_ngrams(item, n)
163
+ overlap = item_ngrams & train_ngrams
164
+ if overlap:
165
+ contaminated.append({
166
+ "index": i,
167
+ "overlap_count": len(overlap),
168
+ "overlap_ratio": len(overlap) / max(len(item_ngrams), 1),
169
+ })
170
+
171
+ return {
172
+ "total_items": len(benchmark_data),
173
+ "contaminated_items": len(contaminated),
174
+ "contamination_rate": len(contaminated) / len(benchmark_data),
175
+ "details": contaminated,
176
+ }
177
+ ```
178
+
179
+ ## Reporting Results for Publication
180
+
181
+ ### Standard Results Table Format
182
+
183
+ ```markdown
184
+ | Model | Params | MMLU | GSM8K | HumanEval | ARC-C | HellaSwag | Avg |
185
+ |-------|--------|------|-------|-----------|-------|-----------|-----|
186
+ | Baseline | 7B | 45.2 | 12.3 | 15.8 | 42.1 | 72.3 | 37.5 |
187
+ | Ours | 7B | 52.1 (+6.9) | 28.7 (+16.4) | 22.0 (+6.2) | 48.9 (+6.8) | 76.1 (+3.8) | 45.6 |
188
+ | Ours (ablation A) | 7B | 49.8 | 24.1 | 19.5 | 46.2 | 74.8 | 42.9 |
189
+
190
+ All results: 5-shot for MMLU, 8-shot CoT for GSM8K, 0-shot for HumanEval,
191
+ 25-shot for ARC-C, 10-shot for HellaSwag. Mean of 3 seeds reported.
192
+ ```
193
+
194
+ ## Best Practices
195
+
196
+ - **Always report the exact evaluation framework version** (e.g., `lm-eval v0.4.2`).
197
+ - **Use the same number of few-shot examples** as the original benchmark paper.
198
+ - **Report standard deviations** across at least 3 random seeds.
199
+ - **Include a contamination analysis** for any new model trained on web data.
200
+ - **Compare against published numbers using the same evaluation code** -- do not mix results from different frameworks.
201
+ - **Report inference details:** precision (fp16/bf16/int8), context length, decoding strategy.
202
+
203
+ ## References
204
+
205
+ - [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) -- EleutherAI's evaluation framework (12,000+ stars)
206
+ - [Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard) -- Hugging Face community leaderboard
207
+ - [MMLU paper](https://arxiv.org/abs/2009.03300) -- Hendrycks et al., 2021
208
+ - [Holistic Evaluation of Language Models (HELM)](https://crfm.stanford.edu/helm/) -- Stanford CRFM
209
+ - [Chatbot Arena](https://chat.lmsys.org/) -- Human preference-based evaluation (LMSYS)
@@ -0,0 +1,159 @@
1
+ ---
2
+ name: annotated-dl-papers-guide
3
+ description: "Annotated deep learning paper implementations with side-by-side notes"
4
+ metadata:
5
+ openclaw:
6
+ emoji: "📝"
7
+ category: "domains"
8
+ subcategory: "ai-ml"
9
+ keywords: ["deep-learning", "paper-implementation", "annotations", "transformer", "gan", "diffusion"]
10
+ source: "https://github.com/labmlai/annotated_deep_learning_paper_implementations"
11
+ ---
12
+
13
+ # Annotated Deep Learning Papers Guide
14
+
15
+ ## Overview
16
+
17
+ The annotated_deep_learning_paper_implementations project, maintained by labml.ai with over 66,000 GitHub stars, provides 60+ implementations of influential deep learning papers with detailed, line-by-line annotations. Each implementation is presented as a literate programming document where the code and explanations are interwoven, making it possible to read the paper and understand the implementation simultaneously.
18
+
19
+ This project bridges the gap between reading a research paper and understanding its practical implementation. For academic researchers, this is an essential resource because many breakthrough papers omit crucial implementation details, and reproducing results from a paper description alone can take weeks. The annotated implementations cover transformers, GANs, diffusion models, reinforcement learning algorithms, optimizers, and many other core deep learning building blocks.
20
+
21
+ All implementations are written in PyTorch and are designed to be self-contained, readable, and runnable. The project also provides a web interface at papers.labml.ai where you can browse implementations with syntax-highlighted code alongside formatted annotations.
22
+
23
+ ## Installation and Setup
24
+
25
+ Install the labml packages to use the implementations and experiment tracking:
26
+
27
+ ```bash
28
+ # Install the core library
29
+ pip install labml labml-nn
30
+
31
+ # Clone for direct access to all implementations
32
+ git clone https://github.com/labmlai/annotated_deep_learning_paper_implementations.git
33
+ cd annotated_deep_learning_paper_implementations
34
+
35
+ # Install in development mode
36
+ pip install -e .
37
+ ```
38
+
39
+ Requirements:
40
+
41
+ - Python 3.8+
42
+ - PyTorch >= 1.9
43
+ - labml >= 0.5 (experiment tracking and configuration)
44
+ - numpy, einops for tensor operations
45
+
46
+ The `labml` library provides experiment tracking, configuration management, and training loop utilities that are used throughout the implementations.
47
+
48
+ ## Core Paper Categories
49
+
50
+ ### Transformers and Attention
51
+
52
+ The project includes comprehensive implementations of the transformer family:
53
+
54
+ - **Original Transformer** (Vaswani et al., 2017): Multi-head attention, positional encoding, encoder-decoder architecture
55
+ - **GPT and GPT-2**: Autoregressive language modeling with causal attention
56
+ - **BERT**: Masked language modeling and next sentence prediction
57
+ - **Vision Transformer (ViT)**: Applying transformers to image classification
58
+ - **Flash Attention**: Memory-efficient attention computation
59
+ - **Rotary Position Embeddings (RoPE)**: Position encoding used in modern LLMs
60
+ - **Mixture of Experts (MoE)**: Sparse expert routing for scaling models
61
+
62
+ ```python
63
+ # Example: Multi-head attention from the transformer implementation
64
+ from labml_nn.transformers.mha import MultiHeadAttention
65
+
66
+ # The implementation includes detailed annotations explaining
67
+ # each step of the attention computation
68
+ mha = MultiHeadAttention(
69
+ heads=8,
70
+ d_model=512,
71
+ dropout_prob=0.1
72
+ )
73
+ ```
74
+
75
+ ### Generative Models
76
+
77
+ - **GAN** (Goodfellow et al., 2014): Original generative adversarial network
78
+ - **DCGAN**: Deep convolutional GAN with architectural guidelines
79
+ - **StyleGAN**: Style-based generator architecture
80
+ - **Diffusion Models (DDPM)**: Denoising diffusion probabilistic models
81
+ - **Stable Diffusion**: Latent diffusion with CLIP conditioning
82
+ - **VAE**: Variational autoencoders with KL divergence
83
+
84
+ ### Optimization and Training
85
+
86
+ - **Adam, AdamW**: Adaptive learning rate optimizers
87
+ - **LAMB**: Large batch optimization for distributed training
88
+ - **Noam learning rate schedule**: Warmup + inverse square root decay
89
+ - **Gradient clipping**: Norm-based and value-based clipping
90
+ - **Mixed precision training**: FP16/BF16 training techniques
91
+
92
+ ### Normalization and Regularization
93
+
94
+ - **Batch Normalization**: Per-batch statistics normalization
95
+ - **Layer Normalization**: Per-sample normalization for transformers
96
+ - **RMSNorm**: Simplified normalization used in LLaMA
97
+ - **Dropout and DropPath**: Stochastic regularization methods
98
+
99
+ ## Using Implementations in Research
100
+
101
+ Each implementation can be used as a building block in your own research projects. The modular design allows you to swap components easily:
102
+
103
+ ```python
104
+ from labml_nn.transformers import TransformerLayer
105
+ from labml_nn.transformers.mha import MultiHeadAttention
106
+ from labml_nn.normalization.rmsnorm import RMSNorm
107
+
108
+ # Build a custom transformer block with RMSNorm instead of LayerNorm
109
+ class CustomTransformerBlock(nn.Module):
110
+ def __init__(self, d_model, heads, d_ff):
111
+ super().__init__()
112
+ self.attention = MultiHeadAttention(heads, d_model)
113
+ self.norm1 = RMSNorm(d_model)
114
+ self.norm2 = RMSNorm(d_model)
115
+ self.feed_forward = nn.Sequential(
116
+ nn.Linear(d_model, d_ff),
117
+ nn.GELU(),
118
+ nn.Linear(d_ff, d_model)
119
+ )
120
+
121
+ def forward(self, x):
122
+ x = x + self.attention(self.norm1(x), self.norm1(x), self.norm1(x), None)
123
+ x = x + self.feed_forward(self.norm2(x))
124
+ return x
125
+ ```
126
+
127
+ The experiment tracking integration with labml makes it straightforward to log metrics, hyperparameters, and model checkpoints:
128
+
129
+ ```python
130
+ from labml import experiment, tracker
131
+
132
+ # Create an experiment
133
+ experiment.create(name="custom_transformer_ablation")
134
+
135
+ # Track metrics during training
136
+ for epoch in range(num_epochs):
137
+ for batch in dataloader:
138
+ loss = train_step(batch)
139
+ tracker.save({"loss": loss, "epoch": epoch})
140
+ ```
141
+
142
+ ## Research Workflow Integration
143
+
144
+ This project fits naturally into an academic deep learning research workflow:
145
+
146
+ 1. **Literature review**: Read the annotated implementation alongside the original paper to build deep understanding
147
+ 2. **Baseline reproduction**: Use the provided implementation as a verified baseline for comparison experiments
148
+ 3. **Architecture modification**: Fork a specific implementation and modify components for your research hypothesis
149
+ 4. **Ablation studies**: Systematically disable or replace components to measure their contribution
150
+ 5. **Paper writing**: Reference the annotated implementation for accurate method descriptions
151
+
152
+ The web interface at papers.labml.ai provides a searchable index of all implementations, organized by topic. Each page shows the paper citation, a brief summary, and the annotated code with toggleable explanations.
153
+
154
+ ## References
155
+
156
+ - Repository: https://github.com/labmlai/annotated_deep_learning_paper_implementations
157
+ - Web interface: https://nn.labml.ai/
158
+ - labml experiment tracking: https://github.com/labmlai/labml
159
+ - PyTorch documentation: https://pytorch.org/docs/stable/
@@ -0,0 +1,167 @@
1
+ ---
2
+ name: anomaly-detection-papers-guide
3
+ description: "Industrial anomaly detection methods and benchmark papers"
4
+ metadata:
5
+ openclaw:
6
+ emoji: "🔍"
7
+ category: "domains"
8
+ subcategory: "ai-ml"
9
+ keywords: ["anomaly detection", "industrial inspection", "defect detection", "MVTec", "unsupervised AD", "visual inspection"]
10
+ source: "https://github.com/M-3LAB/awesome-industrial-anomaly-detection"
11
+ ---
12
+
13
+ # Industrial Anomaly Detection Papers Guide
14
+
15
+ ## Overview
16
+
17
+ Industrial anomaly detection uses machine learning to identify defects, faults, and anomalies in manufacturing and quality inspection. This curated collection covers methods from reconstruction-based (autoencoders) to memory-bank approaches (PatchCore), normalizing flows, knowledge distillation, and foundation model-based detectors. Includes benchmark datasets, evaluation metrics, and real-world deployment considerations.
18
+
19
+ ## Method Taxonomy
20
+
21
+ ```
22
+ Anomaly Detection Methods
23
+ ├── Reconstruction-based
24
+ │ ├── Autoencoder (AE, VAE)
25
+ │ ├── GAN-based (AnoGAN, GANomaly)
26
+ │ └── Diffusion-based (AnoDDPM)
27
+ ├── Embedding-based
28
+ │ ├── Memory bank (PatchCore, PaDiM)
29
+ │ ├── Knowledge distillation (STPM, RD4AD)
30
+ │ └── Self-supervised (CutPaste, DRAEM)
31
+ ├── Normalizing Flows
32
+ │ ├── FastFlow, CFLOW-AD, CS-Flow
33
+ │ └── DifferNet
34
+ ├── Foundation Models
35
+ │ ├── CLIP-based (WinCLIP, AnomalyCLIP)
36
+ │ ├── SAM-based (GroundedSAM-AD)
37
+ │ └── Vision-language (AnomalyGPT)
38
+ └── 3D Anomaly Detection
39
+ ├── Point cloud methods
40
+ └── Multi-modal (RGB + 3D)
41
+ ```
42
+
43
+ ## Key Methods
44
+
45
+ | Method | Year | Approach | MVTec AUROC |
46
+ |--------|------|----------|-------------|
47
+ | **PatchCore** | 2022 | Memory bank | 99.1% |
48
+ | **PaDiM** | 2021 | Multivariate Gaussian | 97.9% |
49
+ | **RD4AD** | 2022 | Knowledge distillation | 98.5% |
50
+ | **FastFlow** | 2022 | Normalizing flow | 99.4% |
51
+ | **SimpleNet** | 2023 | Feature adaptation | 99.6% |
52
+ | **WinCLIP** | 2023 | CLIP zero-shot | 95.2% |
53
+ | **AnomalyGPT** | 2024 | Vision-language | 96.3% |
54
+
55
+ ## Benchmark Datasets
56
+
57
+ ```python
58
+ benchmarks = {
59
+ "MVTec AD": {
60
+ "categories": 15,
61
+ "images": 5354,
62
+ "type": "Product/texture defects",
63
+ "annotation": "Pixel-level masks",
64
+ },
65
+ "MVTec 3D-AD": {
66
+ "categories": 10,
67
+ "images": 4147,
68
+ "type": "3D point cloud + RGB",
69
+ },
70
+ "VisA": {
71
+ "categories": 12,
72
+ "images": 10821,
73
+ "type": "Complex structure anomalies",
74
+ },
75
+ "BTAD": {
76
+ "categories": 3,
77
+ "images": 2830,
78
+ "type": "Industrial body/surface",
79
+ },
80
+ "MPDD": {
81
+ "categories": 6,
82
+ "images": 1064,
83
+ "type": "Metal parts defects",
84
+ },
85
+ }
86
+
87
+ for name, info in benchmarks.items():
88
+ print(f"{name}: {info['categories']} categories, "
89
+ f"{info['images']} images — {info['type']}")
90
+ ```
91
+
92
+ ## Quick Implementation
93
+
94
+ ```python
95
+ # PatchCore-style anomaly detection
96
+ from anomalib.data import MVTec
97
+ from anomalib.models import Patchcore
98
+ from anomalib.engine import Engine
99
+
100
+ # Setup dataset
101
+ datamodule = MVTec(
102
+ root="./datasets/MVTec",
103
+ category="bottle",
104
+ image_size=(256, 256),
105
+ )
106
+
107
+ # Initialize model
108
+ model = Patchcore(
109
+ backbone="wide_resnet50_2",
110
+ layers=["layer2", "layer3"],
111
+ coreset_sampling_ratio=0.1,
112
+ )
113
+
114
+ # Train and test
115
+ engine = Engine()
116
+ engine.fit(model=model, datamodule=datamodule)
117
+ results = engine.test(model=model, datamodule=datamodule)
118
+ print(f"Image AUROC: {results[0]['image_AUROC']:.3f}")
119
+ print(f"Pixel AUROC: {results[0]['pixel_AUROC']:.3f}")
120
+ ```
121
+
122
+ ## Evaluation Metrics
123
+
124
+ ```python
125
+ # Standard anomaly detection metrics
126
+ from sklearn.metrics import roc_auc_score
127
+ import numpy as np
128
+
129
+ # Image-level: Is this image anomalous?
130
+ image_auroc = roc_auc_score(y_true_image, y_score_image)
131
+
132
+ # Pixel-level: Where is the anomaly?
133
+ pixel_auroc = roc_auc_score(
134
+ y_true_pixel.flatten(), y_score_pixel.flatten()
135
+ )
136
+
137
+ # PRO metric: Per-Region Overlap
138
+ # Better than pixel AUROC for small anomalies
139
+ # Weights each connected anomaly region equally
140
+ ```
141
+
142
+ ## Research Frontiers
143
+
144
+ ```markdown
145
+ ### Active Directions (2024-2025)
146
+ 1. **Zero/few-shot AD** — Detect anomalies without normal training data
147
+ 2. **Multi-class unified** — One model for all product categories
148
+ 3. **Foundation model AD** — CLIP/SAM/LLM-based detection
149
+ 4. **Logical anomalies** — Structural/contextual defects
150
+ 5. **Continual learning** — Adapt to new defect types
151
+ 6. **3D anomaly detection** — Point cloud and multi-modal
152
+ 7. **Real-time deployment** — Edge device optimization
153
+ ```
154
+
155
+ ## Use Cases
156
+
157
+ 1. **Manufacturing QC**: Automated visual inspection pipelines
158
+ 2. **Research benchmarking**: Compare new methods on standard datasets
159
+ 3. **Survey writing**: Comprehensive method taxonomy and comparison
160
+ 4. **Course teaching**: Industrial AI and computer vision curricula
161
+ 5. **Defect analysis**: Understanding failure modes and patterns
162
+
163
+ ## References
164
+
165
+ - [awesome-industrial-anomaly-detection](https://github.com/M-3LAB/awesome-industrial-anomaly-detection)
166
+ - [Anomalib Library](https://github.com/openvinotoolkit/anomalib)
167
+ - [MVTec AD Dataset](https://www.mvtec.com/company/research/datasets/mvtec-ad)