@bgicli/bgicli 2.1.1 → 2.2.0

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.
Files changed (1266) hide show
  1. package/data/skills/aav-vector-design-agent/SKILL.md +198 -0
  2. package/data/skills/adaptyv/SKILL.md +112 -0
  3. package/data/skills/adhd-daily-planner/SKILL.md +271 -0
  4. package/data/skills/aeon/SKILL.md +372 -0
  5. package/data/skills/agent-browser/SKILL.md +159 -0
  6. package/data/skills/agentd-drug-discovery/SKILL.md +52 -0
  7. package/data/skills/ai-analyzer/SKILL.md +218 -0
  8. package/data/skills/alphafold/SKILL.md +183 -0
  9. package/data/skills/alphafold-database/SKILL.md +500 -0
  10. package/data/skills/anndata/SKILL.md +394 -0
  11. package/data/skills/antibody-design-agent/SKILL.md +64 -0
  12. package/data/skills/arboreto/SKILL.md +237 -0
  13. package/data/skills/armored-cart-design-agent/SKILL.md +225 -0
  14. package/data/skills/arxiv-search/SKILL.md +224 -0
  15. package/data/skills/autonomous-oncology-agent/SKILL.md +77 -0
  16. package/data/skills/bayesian-optimizer/SKILL.md +60 -0
  17. package/data/skills/benchling-integration/SKILL.md +473 -0
  18. package/data/skills/bgpt-paper-search/SKILL.md +81 -0
  19. package/data/skills/bindcraft/SKILL.md +198 -0
  20. package/data/skills/binder-design/SKILL.md +182 -0
  21. package/data/skills/binding-characterization/SKILL.md +234 -0
  22. package/data/skills/bindingdb-database/SKILL.md +332 -0
  23. package/data/skills/bio-admet-prediction/SKILL.md +224 -0
  24. package/data/skills/bio-alignment-files-bam-statistics/SKILL.md +340 -0
  25. package/data/skills/bio-alignment-filtering/SKILL.md +322 -0
  26. package/data/skills/bio-alignment-indexing/SKILL.md +249 -0
  27. package/data/skills/bio-alignment-io/SKILL.md +301 -0
  28. package/data/skills/bio-alignment-msa-parsing/SKILL.md +366 -0
  29. package/data/skills/bio-alignment-msa-statistics/SKILL.md +375 -0
  30. package/data/skills/bio-alignment-pairwise/SKILL.md +277 -0
  31. package/data/skills/bio-alignment-sorting/SKILL.md +296 -0
  32. package/data/skills/bio-alignment-validation/SKILL.md +374 -0
  33. package/data/skills/bio-atac-seq-atac-peak-calling/SKILL.md +221 -0
  34. package/data/skills/bio-atac-seq-atac-qc/SKILL.md +292 -0
  35. package/data/skills/bio-atac-seq-differential-accessibility/SKILL.md +268 -0
  36. package/data/skills/bio-atac-seq-footprinting/SKILL.md +256 -0
  37. package/data/skills/bio-atac-seq-motif-deviation/SKILL.md +319 -0
  38. package/data/skills/bio-atac-seq-nucleosome-positioning/SKILL.md +321 -0
  39. package/data/skills/bio-basecalling/SKILL.md +368 -0
  40. package/data/skills/bio-batch-downloads/SKILL.md +384 -0
  41. package/data/skills/bio-batch-processing/SKILL.md +303 -0
  42. package/data/skills/bio-bedgraph-handling/SKILL.md +336 -0
  43. package/data/skills/bio-blast-searches/SKILL.md +354 -0
  44. package/data/skills/bio-causal-genomics-colocalization-analysis/SKILL.md +264 -0
  45. package/data/skills/bio-causal-genomics-fine-mapping/SKILL.md +267 -0
  46. package/data/skills/bio-causal-genomics-mediation-analysis/SKILL.md +264 -0
  47. package/data/skills/bio-causal-genomics-mendelian-randomization/SKILL.md +221 -0
  48. package/data/skills/bio-causal-genomics-pleiotropy-detection/SKILL.md +292 -0
  49. package/data/skills/bio-cfdna-preprocessing/SKILL.md +200 -0
  50. package/data/skills/bio-chipseq-differential-binding/SKILL.md +262 -0
  51. package/data/skills/bio-chipseq-motif-analysis/SKILL.md +387 -0
  52. package/data/skills/bio-chipseq-peak-annotation/SKILL.md +239 -0
  53. package/data/skills/bio-chipseq-peak-calling/SKILL.md +277 -0
  54. package/data/skills/bio-chipseq-qc/SKILL.md +391 -0
  55. package/data/skills/bio-chipseq-super-enhancers/SKILL.md +288 -0
  56. package/data/skills/bio-chipseq-visualization/SKILL.md +289 -0
  57. package/data/skills/bio-clinical-databases-clinvar-lookup/SKILL.md +188 -0
  58. package/data/skills/bio-clinical-databases-dbsnp-queries/SKILL.md +171 -0
  59. package/data/skills/bio-clinical-databases-gnomad-frequencies/SKILL.md +205 -0
  60. package/data/skills/bio-clinical-databases-hla-typing/SKILL.md +248 -0
  61. package/data/skills/bio-clinical-databases-myvariant-queries/SKILL.md +174 -0
  62. package/data/skills/bio-clinical-databases-pharmacogenomics/SKILL.md +232 -0
  63. package/data/skills/bio-clinical-databases-polygenic-risk/SKILL.md +276 -0
  64. package/data/skills/bio-clinical-databases-somatic-signatures/SKILL.md +261 -0
  65. package/data/skills/bio-clinical-databases-tumor-mutational-burden/SKILL.md +301 -0
  66. package/data/skills/bio-clinical-databases-variant-prioritization/SKILL.md +225 -0
  67. package/data/skills/bio-clip-seq-binding-site-annotation/SKILL.md +66 -0
  68. package/data/skills/bio-clip-seq-clip-alignment/SKILL.md +70 -0
  69. package/data/skills/bio-clip-seq-clip-motif-analysis/SKILL.md +62 -0
  70. package/data/skills/bio-clip-seq-clip-peak-calling/SKILL.md +282 -0
  71. package/data/skills/bio-clip-seq-clip-preprocessing/SKILL.md +142 -0
  72. package/data/skills/bio-codon-usage/SKILL.md +353 -0
  73. package/data/skills/bio-comparative-genomics-ancestral-reconstruction/SKILL.md +312 -0
  74. package/data/skills/bio-comparative-genomics-hgt-detection/SKILL.md +341 -0
  75. package/data/skills/bio-comparative-genomics-ortholog-inference/SKILL.md +308 -0
  76. package/data/skills/bio-comparative-genomics-positive-selection/SKILL.md +354 -0
  77. package/data/skills/bio-comparative-genomics-synteny-analysis/SKILL.md +315 -0
  78. package/data/skills/bio-compressed-files/SKILL.md +263 -0
  79. package/data/skills/bio-consensus-sequences/SKILL.md +340 -0
  80. package/data/skills/bio-copy-number-cnv-annotation/SKILL.md +307 -0
  81. package/data/skills/bio-copy-number-cnv-visualization/SKILL.md +294 -0
  82. package/data/skills/bio-copy-number-cnvkit-analysis/SKILL.md +290 -0
  83. package/data/skills/bio-copy-number-gatk-cnv/SKILL.md +270 -0
  84. package/data/skills/bio-crispr-screens-base-editing-analysis/SKILL.md +110 -0
  85. package/data/skills/bio-crispr-screens-batch-correction/SKILL.md +316 -0
  86. package/data/skills/bio-crispr-screens-crispresso-editing/SKILL.md +205 -0
  87. package/data/skills/bio-crispr-screens-hit-calling/SKILL.md +264 -0
  88. package/data/skills/bio-crispr-screens-jacks-analysis/SKILL.md +313 -0
  89. package/data/skills/bio-crispr-screens-library-design/SKILL.md +417 -0
  90. package/data/skills/bio-crispr-screens-mageck-analysis/SKILL.md +222 -0
  91. package/data/skills/bio-crispr-screens-screen-qc/SKILL.md +243 -0
  92. package/data/skills/bio-ctdna-mutation-detection/SKILL.md +234 -0
  93. package/data/skills/bio-data-visualization-circos-plots/SKILL.md +405 -0
  94. package/data/skills/bio-data-visualization-color-palettes/SKILL.md +244 -0
  95. package/data/skills/bio-data-visualization-genome-browser-tracks/SKILL.md +328 -0
  96. package/data/skills/bio-data-visualization-genome-tracks/SKILL.md +249 -0
  97. package/data/skills/bio-data-visualization-ggplot2-fundamentals/SKILL.md +313 -0
  98. package/data/skills/bio-data-visualization-heatmaps-clustering/SKILL.md +227 -0
  99. package/data/skills/bio-data-visualization-interactive-visualization/SKILL.md +210 -0
  100. package/data/skills/bio-data-visualization-multipanel-figures/SKILL.md +274 -0
  101. package/data/skills/bio-data-visualization-specialized-omics-plots/SKILL.md +251 -0
  102. package/data/skills/bio-data-visualization-upset-plots/SKILL.md +228 -0
  103. package/data/skills/bio-data-visualization-volcano-customization/SKILL.md +233 -0
  104. package/data/skills/bio-de-deseq2-basics/SKILL.md +376 -0
  105. package/data/skills/bio-de-edger-basics/SKILL.md +418 -0
  106. package/data/skills/bio-de-results/SKILL.md +378 -0
  107. package/data/skills/bio-de-visualization/SKILL.md +408 -0
  108. package/data/skills/bio-differential-expression-batch-correction/SKILL.md +253 -0
  109. package/data/skills/bio-differential-expression-timeseries-de/SKILL.md +370 -0
  110. package/data/skills/bio-differential-splicing/SKILL.md +177 -0
  111. package/data/skills/bio-duplicate-handling/SKILL.md +292 -0
  112. package/data/skills/bio-entrez-fetch/SKILL.md +334 -0
  113. package/data/skills/bio-entrez-link/SKILL.md +325 -0
  114. package/data/skills/bio-entrez-search/SKILL.md +311 -0
  115. package/data/skills/bio-epidemiological-genomics-amr-surveillance/SKILL.md +233 -0
  116. package/data/skills/bio-epidemiological-genomics-pathogen-typing/SKILL.md +202 -0
  117. package/data/skills/bio-epidemiological-genomics-phylodynamics/SKILL.md +207 -0
  118. package/data/skills/bio-epidemiological-genomics-transmission-inference/SKILL.md +237 -0
  119. package/data/skills/bio-epidemiological-genomics-variant-surveillance/SKILL.md +237 -0
  120. package/data/skills/bio-epitranscriptomics-m6a-differential/SKILL.md +88 -0
  121. package/data/skills/bio-epitranscriptomics-m6a-peak-calling/SKILL.md +89 -0
  122. package/data/skills/bio-epitranscriptomics-m6anet-analysis/SKILL.md +101 -0
  123. package/data/skills/bio-epitranscriptomics-merip-preprocessing/SKILL.md +81 -0
  124. package/data/skills/bio-epitranscriptomics-modification-visualization/SKILL.md +98 -0
  125. package/data/skills/bio-experimental-design-batch-design/SKILL.md +110 -0
  126. package/data/skills/bio-experimental-design-multiple-testing/SKILL.md +98 -0
  127. package/data/skills/bio-experimental-design-power-analysis/SKILL.md +84 -0
  128. package/data/skills/bio-experimental-design-sample-size/SKILL.md +93 -0
  129. package/data/skills/bio-expression-matrix-counts-ingest/SKILL.md +220 -0
  130. package/data/skills/bio-expression-matrix-gene-id-mapping/SKILL.md +256 -0
  131. package/data/skills/bio-expression-matrix-metadata-joins/SKILL.md +271 -0
  132. package/data/skills/bio-expression-matrix-sparse-handling/SKILL.md +247 -0
  133. package/data/skills/bio-fastq-quality/SKILL.md +279 -0
  134. package/data/skills/bio-filter-sequences/SKILL.md +265 -0
  135. package/data/skills/bio-flow-cytometry-bead-normalization/SKILL.md +315 -0
  136. package/data/skills/bio-flow-cytometry-clustering-phenotyping/SKILL.md +237 -0
  137. package/data/skills/bio-flow-cytometry-compensation-transformation/SKILL.md +196 -0
  138. package/data/skills/bio-flow-cytometry-cytometry-qc/SKILL.md +382 -0
  139. package/data/skills/bio-flow-cytometry-differential-analysis/SKILL.md +217 -0
  140. package/data/skills/bio-flow-cytometry-doublet-detection/SKILL.md +288 -0
  141. package/data/skills/bio-flow-cytometry-fcs-handling/SKILL.md +221 -0
  142. package/data/skills/bio-flow-cytometry-gating-analysis/SKILL.md +193 -0
  143. package/data/skills/bio-format-conversion/SKILL.md +193 -0
  144. package/data/skills/bio-fragment-analysis/SKILL.md +214 -0
  145. package/data/skills/bio-gatk-variant-calling/SKILL.md +422 -0
  146. package/data/skills/bio-genome-assembly-assembly-polishing/SKILL.md +333 -0
  147. package/data/skills/bio-genome-assembly-assembly-qc/SKILL.md +344 -0
  148. package/data/skills/bio-genome-assembly-contamination-detection/SKILL.md +235 -0
  149. package/data/skills/bio-genome-assembly-hifi-assembly/SKILL.md +178 -0
  150. package/data/skills/bio-genome-assembly-long-read-assembly/SKILL.md +307 -0
  151. package/data/skills/bio-genome-assembly-metagenome-assembly/SKILL.md +227 -0
  152. package/data/skills/bio-genome-assembly-scaffolding/SKILL.md +204 -0
  153. package/data/skills/bio-genome-assembly-short-read-assembly/SKILL.md +319 -0
  154. package/data/skills/bio-genome-engineering-base-editing-design/SKILL.md +277 -0
  155. package/data/skills/bio-genome-engineering-grna-design/SKILL.md +221 -0
  156. package/data/skills/bio-genome-engineering-hdr-template-design/SKILL.md +264 -0
  157. package/data/skills/bio-genome-engineering-off-target-prediction/SKILL.md +232 -0
  158. package/data/skills/bio-genome-engineering-prime-editing-design/SKILL.md +275 -0
  159. package/data/skills/bio-genome-intervals-bed-file-basics/SKILL.md +357 -0
  160. package/data/skills/bio-genome-intervals-bigwig-tracks/SKILL.md +351 -0
  161. package/data/skills/bio-genome-intervals-coverage-analysis/SKILL.md +300 -0
  162. package/data/skills/bio-genome-intervals-gtf-gff-handling/SKILL.md +345 -0
  163. package/data/skills/bio-genome-intervals-interval-arithmetic/SKILL.md +485 -0
  164. package/data/skills/bio-genome-intervals-proximity-operations/SKILL.md +337 -0
  165. package/data/skills/bio-geo-data/SKILL.md +380 -0
  166. package/data/skills/bio-hi-c-analysis-compartment-analysis/SKILL.md +261 -0
  167. package/data/skills/bio-hi-c-analysis-contact-pairs/SKILL.md +278 -0
  168. package/data/skills/bio-hi-c-analysis-hic-data-io/SKILL.md +260 -0
  169. package/data/skills/bio-hi-c-analysis-hic-differential/SKILL.md +328 -0
  170. package/data/skills/bio-hi-c-analysis-hic-visualization/SKILL.md +297 -0
  171. package/data/skills/bio-hi-c-analysis-loop-calling/SKILL.md +284 -0
  172. package/data/skills/bio-hi-c-analysis-matrix-operations/SKILL.md +274 -0
  173. package/data/skills/bio-hi-c-analysis-tad-detection/SKILL.md +239 -0
  174. package/data/skills/bio-imaging-mass-cytometry-cell-segmentation/SKILL.md +241 -0
  175. package/data/skills/bio-imaging-mass-cytometry-data-preprocessing/SKILL.md +279 -0
  176. package/data/skills/bio-imaging-mass-cytometry-interactive-annotation/SKILL.md +304 -0
  177. package/data/skills/bio-imaging-mass-cytometry-phenotyping/SKILL.md +231 -0
  178. package/data/skills/bio-imaging-mass-cytometry-quality-metrics/SKILL.md +316 -0
  179. package/data/skills/bio-imaging-mass-cytometry-spatial-analysis/SKILL.md +246 -0
  180. package/data/skills/bio-immunoinformatics-epitope-prediction/SKILL.md +259 -0
  181. package/data/skills/bio-immunoinformatics-immunogenicity-scoring/SKILL.md +275 -0
  182. package/data/skills/bio-immunoinformatics-mhc-binding-prediction/SKILL.md +260 -0
  183. package/data/skills/bio-immunoinformatics-neoantigen-prediction/SKILL.md +277 -0
  184. package/data/skills/bio-immunoinformatics-tcr-epitope-binding/SKILL.md +257 -0
  185. package/data/skills/bio-isoform-switching/SKILL.md +192 -0
  186. package/data/skills/bio-liquid-biopsy-pipeline/SKILL.md +311 -0
  187. package/data/skills/bio-local-blast/SKILL.md +350 -0
  188. package/data/skills/bio-long-read-sequencing-clair3-variants/SKILL.md +252 -0
  189. package/data/skills/bio-long-read-sequencing-isoseq-analysis/SKILL.md +334 -0
  190. package/data/skills/bio-long-read-sequencing-nanopore-methylation/SKILL.md +110 -0
  191. package/data/skills/bio-longitudinal-monitoring/SKILL.md +271 -0
  192. package/data/skills/bio-longread-alignment/SKILL.md +193 -0
  193. package/data/skills/bio-longread-medaka/SKILL.md +176 -0
  194. package/data/skills/bio-longread-qc/SKILL.md +224 -0
  195. package/data/skills/bio-longread-structural-variants/SKILL.md +201 -0
  196. package/data/skills/bio-machine-learning-atlas-mapping/SKILL.md +139 -0
  197. package/data/skills/bio-machine-learning-biomarker-discovery/SKILL.md +157 -0
  198. package/data/skills/bio-machine-learning-model-validation/SKILL.md +148 -0
  199. package/data/skills/bio-machine-learning-omics-classifiers/SKILL.md +146 -0
  200. package/data/skills/bio-machine-learning-prediction-explanation/SKILL.md +162 -0
  201. package/data/skills/bio-machine-learning-survival-analysis/SKILL.md +176 -0
  202. package/data/skills/bio-metabolomics-lipidomics/SKILL.md +265 -0
  203. package/data/skills/bio-metabolomics-metabolite-annotation/SKILL.md +241 -0
  204. package/data/skills/bio-metabolomics-msdial-preprocessing/SKILL.md +308 -0
  205. package/data/skills/bio-metabolomics-normalization-qc/SKILL.md +283 -0
  206. package/data/skills/bio-metabolomics-pathway-mapping/SKILL.md +237 -0
  207. package/data/skills/bio-metabolomics-statistical-analysis/SKILL.md +276 -0
  208. package/data/skills/bio-metabolomics-targeted-analysis/SKILL.md +314 -0
  209. package/data/skills/bio-metabolomics-xcms-preprocessing/SKILL.md +268 -0
  210. package/data/skills/bio-metagenomics-abundance/SKILL.md +203 -0
  211. package/data/skills/bio-metagenomics-amr-detection/SKILL.md +293 -0
  212. package/data/skills/bio-metagenomics-functional-profiling/SKILL.md +252 -0
  213. package/data/skills/bio-metagenomics-kraken/SKILL.md +204 -0
  214. package/data/skills/bio-metagenomics-metaphlan/SKILL.md +214 -0
  215. package/data/skills/bio-metagenomics-strain-tracking/SKILL.md +292 -0
  216. package/data/skills/bio-metagenomics-visualization/SKILL.md +240 -0
  217. package/data/skills/bio-methylation-based-detection/SKILL.md +223 -0
  218. package/data/skills/bio-methylation-bismark-alignment/SKILL.md +195 -0
  219. package/data/skills/bio-methylation-calling/SKILL.md +200 -0
  220. package/data/skills/bio-methylation-dmr-detection/SKILL.md +211 -0
  221. package/data/skills/bio-methylation-methylkit/SKILL.md +219 -0
  222. package/data/skills/bio-microbiome-amplicon-processing/SKILL.md +137 -0
  223. package/data/skills/bio-microbiome-differential-abundance/SKILL.md +147 -0
  224. package/data/skills/bio-microbiome-diversity-analysis/SKILL.md +188 -0
  225. package/data/skills/bio-microbiome-functional-prediction/SKILL.md +153 -0
  226. package/data/skills/bio-microbiome-qiime2-workflow/SKILL.md +219 -0
  227. package/data/skills/bio-microbiome-taxonomy-assignment/SKILL.md +168 -0
  228. package/data/skills/bio-molecular-descriptors/SKILL.md +200 -0
  229. package/data/skills/bio-molecular-io/SKILL.md +188 -0
  230. package/data/skills/bio-motif-search/SKILL.md +354 -0
  231. package/data/skills/bio-multi-omics-data-harmonization/SKILL.md +228 -0
  232. package/data/skills/bio-multi-omics-mixomics-analysis/SKILL.md +221 -0
  233. package/data/skills/bio-multi-omics-mofa-integration/SKILL.md +225 -0
  234. package/data/skills/bio-multi-omics-similarity-network/SKILL.md +235 -0
  235. package/data/skills/bio-orchestrator/SKILL.md +133 -0
  236. package/data/skills/bio-paired-end-fastq/SKILL.md +334 -0
  237. package/data/skills/bio-pathway-enrichment-visualization/SKILL.md +278 -0
  238. package/data/skills/bio-pathway-go-enrichment/SKILL.md +218 -0
  239. package/data/skills/bio-pathway-gsea/SKILL.md +227 -0
  240. package/data/skills/bio-pathway-kegg-pathways/SKILL.md +234 -0
  241. package/data/skills/bio-pathway-reactome/SKILL.md +215 -0
  242. package/data/skills/bio-pathway-wikipathways/SKILL.md +255 -0
  243. package/data/skills/bio-pdb-geometric-analysis/SKILL.md +475 -0
  244. package/data/skills/bio-pdb-structure-io/SKILL.md +296 -0
  245. package/data/skills/bio-pdb-structure-modification/SKILL.md +448 -0
  246. package/data/skills/bio-pdb-structure-navigation/SKILL.md +335 -0
  247. package/data/skills/bio-phasing-imputation-genotype-imputation/SKILL.md +201 -0
  248. package/data/skills/bio-phasing-imputation-haplotype-phasing/SKILL.md +190 -0
  249. package/data/skills/bio-phasing-imputation-imputation-qc/SKILL.md +265 -0
  250. package/data/skills/bio-phasing-imputation-reference-panels/SKILL.md +203 -0
  251. package/data/skills/bio-phylo-distance-calculations/SKILL.md +307 -0
  252. package/data/skills/bio-phylo-modern-tree-inference/SKILL.md +274 -0
  253. package/data/skills/bio-phylo-tree-io/SKILL.md +252 -0
  254. package/data/skills/bio-phylo-tree-manipulation/SKILL.md +375 -0
  255. package/data/skills/bio-phylo-tree-visualization/SKILL.md +275 -0
  256. package/data/skills/bio-pileup-generation/SKILL.md +314 -0
  257. package/data/skills/bio-population-genetics-association-testing/SKILL.md +293 -0
  258. package/data/skills/bio-population-genetics-linkage-disequilibrium/SKILL.md +260 -0
  259. package/data/skills/bio-population-genetics-plink-basics/SKILL.md +338 -0
  260. package/data/skills/bio-population-genetics-population-structure/SKILL.md +352 -0
  261. package/data/skills/bio-population-genetics-scikit-allel-analysis/SKILL.md +306 -0
  262. package/data/skills/bio-population-genetics-selection-statistics/SKILL.md +251 -0
  263. package/data/skills/bio-primer-design-primer-basics/SKILL.md +289 -0
  264. package/data/skills/bio-primer-design-primer-validation/SKILL.md +344 -0
  265. package/data/skills/bio-primer-design-qpcr-primers/SKILL.md +273 -0
  266. package/data/skills/bio-proteomics-data-import/SKILL.md +122 -0
  267. package/data/skills/bio-proteomics-dia-analysis/SKILL.md +246 -0
  268. package/data/skills/bio-proteomics-differential-abundance/SKILL.md +129 -0
  269. package/data/skills/bio-proteomics-peptide-identification/SKILL.md +122 -0
  270. package/data/skills/bio-proteomics-protein-inference/SKILL.md +174 -0
  271. package/data/skills/bio-proteomics-proteomics-qc/SKILL.md +208 -0
  272. package/data/skills/bio-proteomics-ptm-analysis/SKILL.md +139 -0
  273. package/data/skills/bio-proteomics-quantification/SKILL.md +141 -0
  274. package/data/skills/bio-proteomics-spectral-libraries/SKILL.md +270 -0
  275. package/data/skills/bio-reaction-enumeration/SKILL.md +251 -0
  276. package/data/skills/bio-read-alignment-bowtie2-alignment/SKILL.md +189 -0
  277. package/data/skills/bio-read-alignment-bwa-alignment/SKILL.md +166 -0
  278. package/data/skills/bio-read-alignment-hisat2-alignment/SKILL.md +205 -0
  279. package/data/skills/bio-read-alignment-star-alignment/SKILL.md +204 -0
  280. package/data/skills/bio-read-qc-adapter-trimming/SKILL.md +222 -0
  281. package/data/skills/bio-read-qc-contamination-screening/SKILL.md +252 -0
  282. package/data/skills/bio-read-qc-fastp-workflow/SKILL.md +278 -0
  283. package/data/skills/bio-read-qc-quality-filtering/SKILL.md +231 -0
  284. package/data/skills/bio-read-qc-quality-reports/SKILL.md +204 -0
  285. package/data/skills/bio-read-qc-umi-processing/SKILL.md +391 -0
  286. package/data/skills/bio-read-sequences/SKILL.md +319 -0
  287. package/data/skills/bio-reference-operations/SKILL.md +302 -0
  288. package/data/skills/bio-reporting-automated-qc-reports/SKILL.md +103 -0
  289. package/data/skills/bio-reporting-figure-export/SKILL.md +112 -0
  290. package/data/skills/bio-reporting-jupyter-reports/SKILL.md +98 -0
  291. package/data/skills/bio-reporting-quarto-reports/SKILL.md +295 -0
  292. package/data/skills/bio-reporting-rmarkdown-reports/SKILL.md +276 -0
  293. package/data/skills/bio-research-tools-biomarker-signature-studio/SKILL.md +99 -0
  294. package/data/skills/bio-restriction-enzyme-selection/SKILL.md +342 -0
  295. package/data/skills/bio-restriction-fragment-analysis/SKILL.md +259 -0
  296. package/data/skills/bio-restriction-mapping/SKILL.md +239 -0
  297. package/data/skills/bio-restriction-sites/SKILL.md +222 -0
  298. package/data/skills/bio-reverse-complement/SKILL.md +250 -0
  299. package/data/skills/bio-ribo-seq-orf-detection/SKILL.md +303 -0
  300. package/data/skills/bio-ribo-seq-riboseq-preprocessing/SKILL.md +176 -0
  301. package/data/skills/bio-ribo-seq-ribosome-periodicity/SKILL.md +182 -0
  302. package/data/skills/bio-ribo-seq-ribosome-stalling/SKILL.md +217 -0
  303. package/data/skills/bio-ribo-seq-translation-efficiency/SKILL.md +183 -0
  304. package/data/skills/bio-rna-quantification-alignment-free-quant/SKILL.md +226 -0
  305. package/data/skills/bio-rna-quantification-count-matrix-qc/SKILL.md +310 -0
  306. package/data/skills/bio-rna-quantification-featurecounts-counting/SKILL.md +190 -0
  307. package/data/skills/bio-rna-quantification-tximport-workflow/SKILL.md +240 -0
  308. package/data/skills/bio-rnaseq-qc/SKILL.md +320 -0
  309. package/data/skills/bio-sam-bam-basics/SKILL.md +248 -0
  310. package/data/skills/bio-sashimi-plots/SKILL.md +175 -0
  311. package/data/skills/bio-seq-objects/SKILL.md +240 -0
  312. package/data/skills/bio-sequence-properties/SKILL.md +397 -0
  313. package/data/skills/bio-sequence-similarity/SKILL.md +335 -0
  314. package/data/skills/bio-sequence-slicing/SKILL.md +232 -0
  315. package/data/skills/bio-sequence-statistics/SKILL.md +318 -0
  316. package/data/skills/bio-similarity-searching/SKILL.md +200 -0
  317. package/data/skills/bio-single-cell-batch-integration/SKILL.md +317 -0
  318. package/data/skills/bio-single-cell-cell-annotation/SKILL.md +259 -0
  319. package/data/skills/bio-single-cell-cell-communication/SKILL.md +257 -0
  320. package/data/skills/bio-single-cell-clustering/SKILL.md +330 -0
  321. package/data/skills/bio-single-cell-data-io/SKILL.md +315 -0
  322. package/data/skills/bio-single-cell-doublet-detection/SKILL.md +362 -0
  323. package/data/skills/bio-single-cell-lineage-tracing/SKILL.md +319 -0
  324. package/data/skills/bio-single-cell-markers-annotation/SKILL.md +317 -0
  325. package/data/skills/bio-single-cell-metabolite-communication/SKILL.md +258 -0
  326. package/data/skills/bio-single-cell-multimodal-integration/SKILL.md +242 -0
  327. package/data/skills/bio-single-cell-perturb-seq/SKILL.md +258 -0
  328. package/data/skills/bio-single-cell-preprocessing/SKILL.md +338 -0
  329. package/data/skills/bio-single-cell-scatac-analysis/SKILL.md +326 -0
  330. package/data/skills/bio-single-cell-splicing/SKILL.md +199 -0
  331. package/data/skills/bio-single-cell-trajectory-inference/SKILL.md +225 -0
  332. package/data/skills/bio-small-rna-seq-differential-mirna/SKILL.md +194 -0
  333. package/data/skills/bio-small-rna-seq-mirdeep2-analysis/SKILL.md +180 -0
  334. package/data/skills/bio-small-rna-seq-mirge3-analysis/SKILL.md +178 -0
  335. package/data/skills/bio-small-rna-seq-smrna-preprocessing/SKILL.md +174 -0
  336. package/data/skills/bio-small-rna-seq-target-prediction/SKILL.md +202 -0
  337. package/data/skills/bio-spatial-transcriptomics-image-analysis/SKILL.md +283 -0
  338. package/data/skills/bio-spatial-transcriptomics-spatial-communication/SKILL.md +299 -0
  339. package/data/skills/bio-spatial-transcriptomics-spatial-data-io/SKILL.md +272 -0
  340. package/data/skills/bio-spatial-transcriptomics-spatial-deconvolution/SKILL.md +314 -0
  341. package/data/skills/bio-spatial-transcriptomics-spatial-domains/SKILL.md +254 -0
  342. package/data/skills/bio-spatial-transcriptomics-spatial-multiomics/SKILL.md +181 -0
  343. package/data/skills/bio-spatial-transcriptomics-spatial-neighbors/SKILL.md +198 -0
  344. package/data/skills/bio-spatial-transcriptomics-spatial-preprocessing/SKILL.md +269 -0
  345. package/data/skills/bio-spatial-transcriptomics-spatial-proteomics/SKILL.md +124 -0
  346. package/data/skills/bio-spatial-transcriptomics-spatial-statistics/SKILL.md +237 -0
  347. package/data/skills/bio-spatial-transcriptomics-spatial-visualization/SKILL.md +287 -0
  348. package/data/skills/bio-splicing-pipeline/SKILL.md +253 -0
  349. package/data/skills/bio-splicing-qc/SKILL.md +190 -0
  350. package/data/skills/bio-splicing-quantification/SKILL.md +145 -0
  351. package/data/skills/bio-sra-data/SKILL.md +363 -0
  352. package/data/skills/bio-structural-biology-alphafold-predictions/SKILL.md +258 -0
  353. package/data/skills/bio-structural-biology-modern-structure-prediction/SKILL.md +346 -0
  354. package/data/skills/bio-substructure-search/SKILL.md +206 -0
  355. package/data/skills/bio-systems-biology-context-specific-models/SKILL.md +241 -0
  356. package/data/skills/bio-systems-biology-flux-balance-analysis/SKILL.md +206 -0
  357. package/data/skills/bio-systems-biology-gene-essentiality/SKILL.md +235 -0
  358. package/data/skills/bio-systems-biology-metabolic-reconstruction/SKILL.md +215 -0
  359. package/data/skills/bio-systems-biology-model-curation/SKILL.md +243 -0
  360. package/data/skills/bio-tcr-bcr-analysis-immcantation-analysis/SKILL.md +195 -0
  361. package/data/skills/bio-tcr-bcr-analysis-mixcr-analysis/SKILL.md +167 -0
  362. package/data/skills/bio-tcr-bcr-analysis-repertoire-visualization/SKILL.md +224 -0
  363. package/data/skills/bio-tcr-bcr-analysis-scirpy-analysis/SKILL.md +168 -0
  364. package/data/skills/bio-tcr-bcr-analysis-vdjtools-analysis/SKILL.md +188 -0
  365. package/data/skills/bio-transcription-translation/SKILL.md +237 -0
  366. package/data/skills/bio-tumor-fraction-estimation/SKILL.md +211 -0
  367. package/data/skills/bio-uniprot-access/SKILL.md +239 -0
  368. package/data/skills/bio-variant-annotation/SKILL.md +410 -0
  369. package/data/skills/bio-variant-calling/SKILL.md +266 -0
  370. package/data/skills/bio-variant-calling-clinical-interpretation/SKILL.md +355 -0
  371. package/data/skills/bio-variant-calling-deepvariant/SKILL.md +315 -0
  372. package/data/skills/bio-variant-calling-filtering-best-practices/SKILL.md +403 -0
  373. package/data/skills/bio-variant-calling-joint-calling/SKILL.md +338 -0
  374. package/data/skills/bio-variant-calling-structural-variant-calling/SKILL.md +253 -0
  375. package/data/skills/bio-variant-normalization/SKILL.md +325 -0
  376. package/data/skills/bio-vcf-basics/SKILL.md +342 -0
  377. package/data/skills/bio-vcf-manipulation/SKILL.md +429 -0
  378. package/data/skills/bio-vcf-statistics/SKILL.md +445 -0
  379. package/data/skills/bio-virtual-screening/SKILL.md +263 -0
  380. package/data/skills/bio-workflow-management-cwl-workflows/SKILL.md +433 -0
  381. package/data/skills/bio-workflow-management-nextflow-pipelines/SKILL.md +386 -0
  382. package/data/skills/bio-workflow-management-snakemake-workflows/SKILL.md +383 -0
  383. package/data/skills/bio-workflow-management-wdl-workflows/SKILL.md +500 -0
  384. package/data/skills/bio-workflows-atacseq-pipeline/SKILL.md +362 -0
  385. package/data/skills/bio-workflows-biomarker-pipeline/SKILL.md +272 -0
  386. package/data/skills/bio-workflows-chipseq-pipeline/SKILL.md +282 -0
  387. package/data/skills/bio-workflows-clip-pipeline/SKILL.md +268 -0
  388. package/data/skills/bio-workflows-cnv-pipeline/SKILL.md +324 -0
  389. package/data/skills/bio-workflows-crispr-editing-pipeline/SKILL.md +455 -0
  390. package/data/skills/bio-workflows-crispr-screen-pipeline/SKILL.md +278 -0
  391. package/data/skills/bio-workflows-cytometry-pipeline/SKILL.md +328 -0
  392. package/data/skills/bio-workflows-expression-to-pathways/SKILL.md +329 -0
  393. package/data/skills/bio-workflows-fastq-to-variants/SKILL.md +374 -0
  394. package/data/skills/bio-workflows-genome-assembly-pipeline/SKILL.md +290 -0
  395. package/data/skills/bio-workflows-gwas-pipeline/SKILL.md +323 -0
  396. package/data/skills/bio-workflows-hic-pipeline/SKILL.md +304 -0
  397. package/data/skills/bio-workflows-imc-pipeline/SKILL.md +304 -0
  398. package/data/skills/bio-workflows-longread-sv-pipeline/SKILL.md +281 -0
  399. package/data/skills/bio-workflows-merip-pipeline/SKILL.md +222 -0
  400. package/data/skills/bio-workflows-metabolic-modeling-pipeline/SKILL.md +408 -0
  401. package/data/skills/bio-workflows-metabolomics-pipeline/SKILL.md +297 -0
  402. package/data/skills/bio-workflows-metagenomics-pipeline/SKILL.md +283 -0
  403. package/data/skills/bio-workflows-methylation-pipeline/SKILL.md +274 -0
  404. package/data/skills/bio-workflows-microbiome-pipeline/SKILL.md +221 -0
  405. package/data/skills/bio-workflows-multi-omics-pipeline/SKILL.md +362 -0
  406. package/data/skills/bio-workflows-multiome-pipeline/SKILL.md +298 -0
  407. package/data/skills/bio-workflows-neoantigen-pipeline/SKILL.md +325 -0
  408. package/data/skills/bio-workflows-outbreak-pipeline/SKILL.md +341 -0
  409. package/data/skills/bio-workflows-proteomics-pipeline/SKILL.md +226 -0
  410. package/data/skills/bio-workflows-riboseq-pipeline/SKILL.md +94 -0
  411. package/data/skills/bio-workflows-rnaseq-to-de/SKILL.md +345 -0
  412. package/data/skills/bio-workflows-scrnaseq-pipeline/SKILL.md +354 -0
  413. package/data/skills/bio-workflows-smrna-pipeline/SKILL.md +86 -0
  414. package/data/skills/bio-workflows-somatic-variant-pipeline/SKILL.md +313 -0
  415. package/data/skills/bio-workflows-spatial-pipeline/SKILL.md +267 -0
  416. package/data/skills/bio-workflows-tcr-pipeline/SKILL.md +84 -0
  417. package/data/skills/bio-write-sequences/SKILL.md +205 -0
  418. package/data/skills/bioinformatics-singlecell/SKILL.md +143 -0
  419. package/data/skills/biokernel/SKILL.md +61 -0
  420. package/data/skills/biologist-analyst/SKILL.md +799 -0
  421. package/data/skills/biomaster-workflows/SKILL.md +55 -0
  422. package/data/skills/biomcp-server/SKILL.md +65 -0
  423. package/data/skills/biomedical-data-analysis/SKILL.md +56 -0
  424. package/data/skills/biomedical-search/SKILL.md +214 -0
  425. package/data/skills/biomni/SKILL.md +309 -0
  426. package/data/skills/biomni-general-agent/SKILL.md +43 -0
  427. package/data/skills/biomni-research-agent/SKILL.md +76 -0
  428. package/data/skills/biopython/SKILL.md +437 -0
  429. package/data/skills/biorxiv-database/SKILL.md +477 -0
  430. package/data/skills/bioservices/SKILL.md +355 -0
  431. package/data/skills/boltz/SKILL.md +188 -0
  432. package/data/skills/boltzgen/SKILL.md +287 -0
  433. package/data/skills/bone-marrow-ai-agent/SKILL.md +163 -0
  434. package/data/skills/brainstorming/SKILL.md +96 -0
  435. package/data/skills/brenda-database/SKILL.md +714 -0
  436. package/data/skills/bulk-combat-correction/SKILL.md +54 -0
  437. package/data/skills/bulk-deg-analysis/SKILL.md +61 -0
  438. package/data/skills/bulk-deseq2-analysis/SKILL.md +50 -0
  439. package/data/skills/bulk-stringdb-ppi/SKILL.md +49 -0
  440. package/data/skills/bulk-to-single-deconvolution/SKILL.md +50 -0
  441. package/data/skills/bulk-trajblend-interpolation/SKILL.md +52 -0
  442. package/data/skills/bulk-wgcna-analysis/SKILL.md +56 -0
  443. package/data/skills/cancer-metabolism-agent/SKILL.md +180 -0
  444. package/data/skills/care-coordination/SKILL.md +35 -0
  445. package/data/skills/cart-design-optimizer-agent/SKILL.md +162 -0
  446. package/data/skills/cbioportal-database/SKILL.md +367 -0
  447. package/data/skills/cell-free-expression/SKILL.md +291 -0
  448. package/data/skills/cellagent-annotation/SKILL.md +69 -0
  449. package/data/skills/cellfree-rna-agent/SKILL.md +182 -0
  450. package/data/skills/cellular-senescence-agent/SKILL.md +183 -0
  451. package/data/skills/cellxgene-census/SKILL.md +505 -0
  452. package/data/skills/chai/SKILL.md +272 -0
  453. package/data/skills/chatehr-clinician-assistant/SKILL.md +67 -0
  454. package/data/skills/chematagent-drug-discovery/SKILL.md +68 -0
  455. package/data/skills/chembl-database/SKILL.md +383 -0
  456. package/data/skills/chembl-search/SKILL.md +211 -0
  457. package/data/skills/chemcrow-drug-discovery/SKILL.md +61 -0
  458. package/data/skills/chemical-property-lookup/SKILL.md +42 -0
  459. package/data/skills/chemist-analyst/SKILL.md +1603 -0
  460. package/data/skills/chemistry-agent/SKILL.md +62 -0
  461. package/data/skills/chip-clonal-hematopoiesis-agent/SKILL.md +224 -0
  462. package/data/skills/chromosomal-instability-agent/SKILL.md +187 -0
  463. package/data/skills/citation-management/SKILL.md +1081 -0
  464. package/data/skills/claims-appeals/SKILL.md +35 -0
  465. package/data/skills/claw-ancestry-pca/SKILL.md +145 -0
  466. package/data/skills/claw-metagenomics/SKILL.md +238 -0
  467. package/data/skills/claw-semantic-sim/SKILL.md +151 -0
  468. package/data/skills/clinical-decision-support/SKILL.md +504 -0
  469. package/data/skills/clinical-diagnostic-reasoning/SKILL.md +222 -0
  470. package/data/skills/clinical-nlp-extractor/SKILL.md +59 -0
  471. package/data/skills/clinical-note-summarization/SKILL.md +52 -0
  472. package/data/skills/clinical-reports/SKILL.md +1127 -0
  473. package/data/skills/clinical-trial-protocol-skill/SKILL.md +508 -0
  474. package/data/skills/clinical-trials-search/SKILL.md +211 -0
  475. package/data/skills/clinicaltrials-database/SKILL.md +501 -0
  476. package/data/skills/clinpgx/SKILL.md +96 -0
  477. package/data/skills/clinpgx-database/SKILL.md +632 -0
  478. package/data/skills/clinvar-database/SKILL.md +356 -0
  479. package/data/skills/cnv-caller-agent/SKILL.md +171 -0
  480. package/data/skills/coagulation-thrombosis-agent/SKILL.md +141 -0
  481. package/data/skills/cobrapy/SKILL.md +457 -0
  482. package/data/skills/compbioagent-explorer/SKILL.md +67 -0
  483. package/data/skills/computational-pathology-agent/SKILL.md +72 -0
  484. package/data/skills/convergence-study/SKILL.md +98 -0
  485. package/data/skills/cosmic-database/SKILL.md +330 -0
  486. package/data/skills/crisis-detection-intervention-ai/SKILL.md +569 -0
  487. package/data/skills/crisis-response-protocol/SKILL.md +456 -0
  488. package/data/skills/crispr-guide-design/SKILL.md +72 -0
  489. package/data/skills/crispr-offtarget-predictor/SKILL.md +56 -0
  490. package/data/skills/cryoem-ai-drug-design-agent/SKILL.md +216 -0
  491. package/data/skills/ctdna-dynamics-mrd-agent/SKILL.md +206 -0
  492. package/data/skills/cytokine-storm-analysis-agent/SKILL.md +180 -0
  493. package/data/skills/dask/SKILL.md +454 -0
  494. package/data/skills/data-stats-analysis/SKILL.md +477 -0
  495. package/data/skills/data-transform/SKILL.md +576 -0
  496. package/data/skills/data-visualization-biomedical/SKILL.md +252 -0
  497. package/data/skills/data-visualization-expert/SKILL.md +72 -0
  498. package/data/skills/data-viz-plots/SKILL.md +461 -0
  499. package/data/skills/datacommons-client/SKILL.md +253 -0
  500. package/data/skills/datamol/SKILL.md +700 -0
  501. package/data/skills/deep-research/SKILL.md +111 -0
  502. package/data/skills/deep-research-swarm/SKILL.md +62 -0
  503. package/data/skills/deep-visual-proteomics-agent/SKILL.md +149 -0
  504. package/data/skills/deepchem/SKILL.md +591 -0
  505. package/data/skills/deeptools/SKILL.md +525 -0
  506. package/data/skills/depmap/SKILL.md +300 -0
  507. package/data/skills/diffdock/SKILL.md +477 -0
  508. package/data/skills/differentiation-schemes/SKILL.md +159 -0
  509. package/data/skills/digital-twin-clinical-agent/SKILL.md +228 -0
  510. package/data/skills/dispatching-parallel-agents/SKILL.md +180 -0
  511. package/data/skills/dnanexus-integration/SKILL.md +376 -0
  512. package/data/skills/doc-coauthoring/SKILL.md +375 -0
  513. package/data/skills/docx/SKILL.md +590 -0
  514. package/data/skills/docx-official/SKILL.md +197 -0
  515. package/data/skills/drug-discovery-search/SKILL.md +214 -0
  516. package/data/skills/drug-interaction-checker/SKILL.md +56 -0
  517. package/data/skills/drug-labels-search/SKILL.md +211 -0
  518. package/data/skills/drug-photo/SKILL.md +149 -0
  519. package/data/skills/drugbank-database/SKILL.md +184 -0
  520. package/data/skills/drugbank-search/SKILL.md +211 -0
  521. package/data/skills/ehr-fhir-integration/SKILL.md +60 -0
  522. package/data/skills/emergency-card/SKILL.md +426 -0
  523. package/data/skills/ena-database/SKILL.md +198 -0
  524. package/data/skills/ensembl-database/SKILL.md +305 -0
  525. package/data/skills/epidemiologist-analyst/SKILL.md +1844 -0
  526. package/data/skills/epigenomics-methylgpt-agent/SKILL.md +111 -0
  527. package/data/skills/equity-scorer/SKILL.md +182 -0
  528. package/data/skills/esm/SKILL.md +300 -0
  529. package/data/skills/etetoolkit/SKILL.md +617 -0
  530. package/data/skills/executing-plans/SKILL.md +84 -0
  531. package/data/skills/exosome-ev-analysis-agent/SKILL.md +171 -0
  532. package/data/skills/exploratory-data-analysis/SKILL.md +440 -0
  533. package/data/skills/family-health-analyzer/SKILL.md +137 -0
  534. package/data/skills/fastq-analysis/SKILL.md +191 -0
  535. package/data/skills/fda-database/SKILL.md +512 -0
  536. package/data/skills/fhir-developer-skill/SKILL.md +294 -0
  537. package/data/skills/fhir-development/SKILL.md +35 -0
  538. package/data/skills/find-skills/SKILL.md +133 -0
  539. package/data/skills/finishing-a-development-branch/SKILL.md +200 -0
  540. package/data/skills/fitness-analyzer/SKILL.md +431 -0
  541. package/data/skills/flowio/SKILL.md +602 -0
  542. package/data/skills/foldseek/SKILL.md +179 -0
  543. package/data/skills/galaxy-bridge/SKILL.md +215 -0
  544. package/data/skills/gene-database/SKILL.md +173 -0
  545. package/data/skills/gene-panel-design-agent/SKILL.md +192 -0
  546. package/data/skills/geniml/SKILL.md +312 -0
  547. package/data/skills/genome-compare/SKILL.md +127 -0
  548. package/data/skills/geo-database/SKILL.md +809 -0
  549. package/data/skills/geopandas/SKILL.md +245 -0
  550. package/data/skills/gget/SKILL.md +865 -0
  551. package/data/skills/ginkgo-cloud-lab/SKILL.md +56 -0
  552. package/data/skills/glycoengineering/SKILL.md +338 -0
  553. package/data/skills/gnomad-database/SKILL.md +395 -0
  554. package/data/skills/goal-analyzer/SKILL.md +605 -0
  555. package/data/skills/grief-companion/SKILL.md +250 -0
  556. package/data/skills/gsea-enrichment/SKILL.md +151 -0
  557. package/data/skills/gtars/SKILL.md +279 -0
  558. package/data/skills/gtex-database/SKILL.md +315 -0
  559. package/data/skills/gwas-database/SKILL.md +602 -0
  560. package/data/skills/gwas-lookup/SKILL.md +122 -0
  561. package/data/skills/gwas-prs/SKILL.md +178 -0
  562. package/data/skills/health-trend-analyzer/SKILL.md +451 -0
  563. package/data/skills/hemoglobinopathy-analysis-agent/SKILL.md +167 -0
  564. package/data/skills/hipaa-compliance/SKILL.md +230 -0
  565. package/data/skills/histolab/SKILL.md +672 -0
  566. package/data/skills/hmdb-database/SKILL.md +190 -0
  567. package/data/skills/hrd-analysis-agent/SKILL.md +184 -0
  568. package/data/skills/hrv-alexithymia-expert/SKILL.md +151 -0
  569. package/data/skills/hypogenic/SKILL.md +649 -0
  570. package/data/skills/hypothesis-generation/SKILL.md +286 -0
  571. package/data/skills/imaging-data-commons/SKILL.md +843 -0
  572. package/data/skills/immune-checkpoint-combination-agent/SKILL.md +170 -0
  573. package/data/skills/infographics/SKILL.md +563 -0
  574. package/data/skills/instrument-data-to-allotrope/SKILL.md +280 -0
  575. package/data/skills/interpro-database/SKILL.md +305 -0
  576. package/data/skills/ipsae/SKILL.md +190 -0
  577. package/data/skills/iso-13485-certification/SKILL.md +678 -0
  578. package/data/skills/jaspar-database/SKILL.md +351 -0
  579. package/data/skills/jungian-psychologist/SKILL.md +191 -0
  580. package/data/skills/kegg-database/SKILL.md +371 -0
  581. package/data/skills/knowledge-synthesis/SKILL.md +283 -0
  582. package/data/skills/kragen-knowledge-graph/SKILL.md +68 -0
  583. package/data/skills/lab-results/SKILL.md +35 -0
  584. package/data/skills/labarchive-integration/SKILL.md +262 -0
  585. package/data/skills/labstep/SKILL.md +208 -0
  586. package/data/skills/lamindb/SKILL.md +384 -0
  587. package/data/skills/latchbio-integration/SKILL.md +347 -0
  588. package/data/skills/latex-posters/SKILL.md +1602 -0
  589. package/data/skills/leads-literature-mining/SKILL.md +68 -0
  590. package/data/skills/ligandmpnn/SKILL.md +170 -0
  591. package/data/skills/linear-solvers/SKILL.md +165 -0
  592. package/data/skills/liquid-biopsy-analytics-agent/SKILL.md +171 -0
  593. package/data/skills/lit-synthesizer/SKILL.md +53 -0
  594. package/data/skills/literature-review/SKILL.md +584 -0
  595. package/data/skills/literature-search/SKILL.md +214 -0
  596. package/data/skills/lobster-bioinformatics/SKILL.md +305 -0
  597. package/data/skills/long-read-sequencing-agent/SKILL.md +181 -0
  598. package/data/skills/mage-antibody-generator/SKILL.md +54 -0
  599. package/data/skills/markdown-mermaid-writing/SKILL.md +327 -0
  600. package/data/skills/markitdown/SKILL.md +486 -0
  601. package/data/skills/matchms/SKILL.md +197 -0
  602. package/data/skills/matplotlib/SKILL.md +359 -0
  603. package/data/skills/mcpmed-bioinformatics-server/SKILL.md +42 -0
  604. package/data/skills/medchem/SKILL.md +400 -0
  605. package/data/skills/medea-therapeutic-discovery/SKILL.md +45 -0
  606. package/data/skills/medical-entity-extractor/SKILL.md +144 -0
  607. package/data/skills/medical-imaging-review/SKILL.md +170 -0
  608. package/data/skills/medical-research-toolkit/SKILL.md +273 -0
  609. package/data/skills/medrxiv-search/SKILL.md +211 -0
  610. package/data/skills/mental-health-analyzer/SKILL.md +981 -0
  611. package/data/skills/mesh-generation/SKILL.md +149 -0
  612. package/data/skills/metabolomics-workbench-database/SKILL.md +253 -0
  613. package/data/skills/microbiome-cancer-agent/SKILL.md +180 -0
  614. package/data/skills/modern-drug-rehab-computer/SKILL.md +392 -0
  615. package/data/skills/molecular-dynamics/SKILL.md +457 -0
  616. package/data/skills/molecular-glue-discovery-agent/SKILL.md +224 -0
  617. package/data/skills/molecule-evolution-agent/SKILL.md +62 -0
  618. package/data/skills/molfeat/SKILL.md +505 -0
  619. package/data/skills/monarch-database/SKILL.md +372 -0
  620. package/data/skills/mpn-progression-monitor-agent/SKILL.md +228 -0
  621. package/data/skills/mpn-research-assistant/SKILL.md +197 -0
  622. package/data/skills/mrd-edge-detection-agent/SKILL.md +213 -0
  623. package/data/skills/multi-ancestry-prs-agent/SKILL.md +224 -0
  624. package/data/skills/multi-search-engine/SKILL.md +110 -0
  625. package/data/skills/multimodal-medical-imaging/SKILL.md +59 -0
  626. package/data/skills/multimodal-radpath-fusion-agent/SKILL.md +213 -0
  627. package/data/skills/myeloma-mrd-agent/SKILL.md +184 -0
  628. package/data/skills/networkx/SKILL.md +435 -0
  629. package/data/skills/neurokit2/SKILL.md +350 -0
  630. package/data/skills/neuropixels-analysis/SKILL.md +344 -0
  631. package/data/skills/nextflow-development/SKILL.md +290 -0
  632. package/data/skills/ngs-analysis/SKILL.md +183 -0
  633. package/data/skills/nicheformer-spatial-agent/SKILL.md +197 -0
  634. package/data/skills/nk-cell-therapy-agent/SKILL.md +186 -0
  635. package/data/skills/nonlinear-solvers/SKILL.md +180 -0
  636. package/data/skills/numerical-integration/SKILL.md +166 -0
  637. package/data/skills/numerical-stability/SKILL.md +149 -0
  638. package/data/skills/nutrition-analyzer/SKILL.md +775 -0
  639. package/data/skills/occupational-health-analyzer/SKILL.md +386 -0
  640. package/data/skills/omero-integration/SKILL.md +245 -0
  641. package/data/skills/ontology-explorer/SKILL.md +168 -0
  642. package/data/skills/ontology-mapper/SKILL.md +171 -0
  643. package/data/skills/ontology-validator/SKILL.md +136 -0
  644. package/data/skills/open-notebook/SKILL.md +289 -0
  645. package/data/skills/open-targets-search/SKILL.md +211 -0
  646. package/data/skills/openalex-database/SKILL.md +488 -0
  647. package/data/skills/opentargets-database/SKILL.md +367 -0
  648. package/data/skills/opentrons-integration/SKILL.md +567 -0
  649. package/data/skills/opentrons-protocol-agent/SKILL.md +58 -0
  650. package/data/skills/organoid-drug-response-agent/SKILL.md +189 -0
  651. package/data/skills/pan-cancer-multiomics-agent/SKILL.md +159 -0
  652. package/data/skills/paper-2-web/SKILL.md +495 -0
  653. package/data/skills/parameter-optimization/SKILL.md +141 -0
  654. package/data/skills/patents-search/SKILL.md +211 -0
  655. package/data/skills/pathml/SKILL.md +160 -0
  656. package/data/skills/patiently-ai/SKILL.md +103 -0
  657. package/data/skills/pdb/SKILL.md +217 -0
  658. package/data/skills/pdb-database/SKILL.md +303 -0
  659. package/data/skills/pdf/SKILL.md +314 -0
  660. package/data/skills/pdf-anthropic/SKILL.md +294 -0
  661. package/data/skills/pdf-processing/SKILL.md +149 -0
  662. package/data/skills/pdf-processing-pro/SKILL.md +296 -0
  663. package/data/skills/pdx-model-analysis-agent/SKILL.md +169 -0
  664. package/data/skills/peer-review/SKILL.md +565 -0
  665. package/data/skills/performance-profiling/SKILL.md +255 -0
  666. package/data/skills/perplexity-search/SKILL.md +441 -0
  667. package/data/skills/pharmacogenomics-agent/SKILL.md +143 -0
  668. package/data/skills/pharmgx-reporter/SKILL.md +134 -0
  669. package/data/skills/phylogenetics/SKILL.md +404 -0
  670. package/data/skills/plotly/SKILL.md +265 -0
  671. package/data/skills/polars/SKILL.md +385 -0
  672. package/data/skills/popeve-variant-predictor-agent/SKILL.md +213 -0
  673. package/data/skills/post-processing/SKILL.md +338 -0
  674. package/data/skills/pptx/SKILL.md +232 -0
  675. package/data/skills/pptx-official/SKILL.md +484 -0
  676. package/data/skills/pptx-posters/SKILL.md +414 -0
  677. package/data/skills/precision-oncology-agent/SKILL.md +53 -0
  678. package/data/skills/prior-auth-coworker/SKILL.md +60 -0
  679. package/data/skills/prior-auth-review-skill/SKILL.md +360 -0
  680. package/data/skills/profile-report/SKILL.md +120 -0
  681. package/data/skills/protac-design-agent/SKILL.md +220 -0
  682. package/data/skills/protein-design-workflow/SKILL.md +199 -0
  683. package/data/skills/protein-qc/SKILL.md +300 -0
  684. package/data/skills/protein-structure-prediction/SKILL.md +59 -0
  685. package/data/skills/proteinmpnn/SKILL.md +279 -0
  686. package/data/skills/protocolsio-integration/SKILL.md +415 -0
  687. package/data/skills/prs-net-deep-learning-agent/SKILL.md +232 -0
  688. package/data/skills/psychologist-analyst/SKILL.md +1888 -0
  689. package/data/skills/pubchem-database/SKILL.md +568 -0
  690. package/data/skills/pubmed-database/SKILL.md +454 -0
  691. package/data/skills/pubmed-search/SKILL.md +103 -0
  692. package/data/skills/pydeseq2/SKILL.md +553 -0
  693. package/data/skills/pydicom/SKILL.md +428 -0
  694. package/data/skills/pyhealth/SKILL.md +485 -0
  695. package/data/skills/pylabrobot/SKILL.md +179 -0
  696. package/data/skills/pymc/SKILL.md +566 -0
  697. package/data/skills/pymoo/SKILL.md +565 -0
  698. package/data/skills/pyopenms/SKILL.md +211 -0
  699. package/data/skills/pysam/SKILL.md +259 -0
  700. package/data/skills/pytdc/SKILL.md +454 -0
  701. package/data/skills/pytorch-lightning/SKILL.md +172 -0
  702. package/data/skills/pyzotero/SKILL.md +111 -0
  703. package/data/skills/radgpt-radiology-reporter/SKILL.md +67 -0
  704. package/data/skills/radiomics-pathomics-fusion-agent/SKILL.md +221 -0
  705. package/data/skills/rdkit/SKILL.md +763 -0
  706. package/data/skills/reactome-database/SKILL.md +272 -0
  707. package/data/skills/receiving-code-review/SKILL.md +213 -0
  708. package/data/skills/recovery-community-moderator/SKILL.md +175 -0
  709. package/data/skills/regulatory-drafter/SKILL.md +56 -0
  710. package/data/skills/regulatory-drafting/SKILL.md +35 -0
  711. package/data/skills/rehabilitation-analyzer/SKILL.md +636 -0
  712. package/data/skills/repro-enforcer/SKILL.md +50 -0
  713. package/data/skills/requesting-code-review/SKILL.md +105 -0
  714. package/data/skills/research-grants/SKILL.md +935 -0
  715. package/data/skills/research-literature/SKILL.md +35 -0
  716. package/data/skills/research-lookup/SKILL.md +502 -0
  717. package/data/skills/rfdiffusion/SKILL.md +306 -0
  718. package/data/skills/rna-velocity-agent/SKILL.md +174 -0
  719. package/data/skills/scanpy/SKILL.md +380 -0
  720. package/data/skills/scfoundation-model-agent/SKILL.md +210 -0
  721. package/data/skills/scientific-brainstorming/SKILL.md +185 -0
  722. package/data/skills/scientific-critical-thinking/SKILL.md +566 -0
  723. package/data/skills/scientific-manuscript/SKILL.md +181 -0
  724. package/data/skills/scientific-problem-selection/SKILL.md +269 -0
  725. package/data/skills/scientific-schematics/SKILL.md +619 -0
  726. package/data/skills/scientific-slides/SKILL.md +1154 -0
  727. package/data/skills/scientific-visualization/SKILL.md +773 -0
  728. package/data/skills/scientific-writing/SKILL.md +483 -0
  729. package/data/skills/scikit-bio/SKILL.md +431 -0
  730. package/data/skills/scikit-learn/SKILL.md +515 -0
  731. package/data/skills/scikit-survival/SKILL.md +393 -0
  732. package/data/skills/scrna-orchestrator/SKILL.md +204 -0
  733. package/data/skills/scrna-qc/SKILL.md +43 -0
  734. package/data/skills/scvelo/SKILL.md +321 -0
  735. package/data/skills/scvi-tools/SKILL.md +184 -0
  736. package/data/skills/seaborn/SKILL.md +671 -0
  737. package/data/skills/search-strategy/SKILL.md +247 -0
  738. package/data/skills/seq-wrangler/SKILL.md +58 -0
  739. package/data/skills/shap/SKILL.md +560 -0
  740. package/data/skills/simo-multiomics-integration-agent/SKILL.md +178 -0
  741. package/data/skills/simpy/SKILL.md +423 -0
  742. package/data/skills/simulation-orchestrator/SKILL.md +230 -0
  743. package/data/skills/simulation-validator/SKILL.md +195 -0
  744. package/data/skills/single-annotation/SKILL.md +129 -0
  745. package/data/skills/single-cell-rna-qc/SKILL.md +175 -0
  746. package/data/skills/single-cellphone-db/SKILL.md +68 -0
  747. package/data/skills/single-clustering/SKILL.md +75 -0
  748. package/data/skills/single-downstream-analysis/SKILL.md +150 -0
  749. package/data/skills/single-multiomics/SKILL.md +44 -0
  750. package/data/skills/single-preprocessing/SKILL.md +184 -0
  751. package/data/skills/single-to-spatial-mapping/SKILL.md +48 -0
  752. package/data/skills/single-trajectory/SKILL.md +62 -0
  753. package/data/skills/sleep-analyzer/SKILL.md +773 -0
  754. package/data/skills/slurm-job-script-generator/SKILL.md +135 -0
  755. package/data/skills/solublempnn/SKILL.md +165 -0
  756. package/data/skills/spatial-agent/SKILL.md +56 -0
  757. package/data/skills/spatial-epigenomics-agent/SKILL.md +163 -0
  758. package/data/skills/spatial-transcriptomics-agent/SKILL.md +75 -0
  759. package/data/skills/spatial-transcriptomics-analysis/SKILL.md +72 -0
  760. package/data/skills/spatial-transcriptomics-analysis/STAgent/SKILL.md +75 -0
  761. package/data/skills/spatial-transcriptomics-analysis/SpatialAgent/SKILL.md +56 -0
  762. package/data/skills/spatial-transcriptomics-analysis/bioSkills/image-analysis/SKILL.md +266 -0
  763. package/data/skills/spatial-transcriptomics-analysis/bioSkills/spatial-communication/SKILL.md +287 -0
  764. package/data/skills/spatial-transcriptomics-analysis/bioSkills/spatial-data-io/SKILL.md +243 -0
  765. package/data/skills/spatial-transcriptomics-analysis/bioSkills/spatial-deconvolution/SKILL.md +298 -0
  766. package/data/skills/spatial-transcriptomics-analysis/bioSkills/spatial-domains/SKILL.md +229 -0
  767. package/data/skills/spatial-transcriptomics-analysis/bioSkills/spatial-multiomics/SKILL.md +172 -0
  768. package/data/skills/spatial-transcriptomics-analysis/bioSkills/spatial-neighbors/SKILL.md +189 -0
  769. package/data/skills/spatial-transcriptomics-analysis/bioSkills/spatial-preprocessing/SKILL.md +232 -0
  770. package/data/skills/spatial-transcriptomics-analysis/bioSkills/spatial-proteomics/SKILL.md +127 -0
  771. package/data/skills/spatial-transcriptomics-analysis/bioSkills/spatial-statistics/SKILL.md +225 -0
  772. package/data/skills/spatial-transcriptomics-analysis/bioSkills/spatial-visualization/SKILL.md +270 -0
  773. package/data/skills/spatial-tutorials/SKILL.md +87 -0
  774. package/data/skills/speech-pathology-ai/SKILL.md +184 -0
  775. package/data/skills/statistical-analysis/SKILL.md +626 -0
  776. package/data/skills/statsmodels/SKILL.md +608 -0
  777. package/data/skills/string-database/SKILL.md +528 -0
  778. package/data/skills/struct-predictor/SKILL.md +52 -0
  779. package/data/skills/subagent-driven-development/SKILL.md +242 -0
  780. package/data/skills/systematic-debugging/SKILL.md +296 -0
  781. package/data/skills/tcell-exhaustion-analysis-agent/SKILL.md +139 -0
  782. package/data/skills/tcga-preprocessing/SKILL.md +49 -0
  783. package/data/skills/tcm-constitution-analyzer/SKILL.md +664 -0
  784. package/data/skills/tcr-pmhc-prediction-agent/SKILL.md +226 -0
  785. package/data/skills/tcr-repertoire-analysis-agent/SKILL.md +218 -0
  786. package/data/skills/test-driven-development/SKILL.md +371 -0
  787. package/data/skills/tiledbvcf/SKILL.md +459 -0
  788. package/data/skills/time-resolved-cryoem-agent/SKILL.md +223 -0
  789. package/data/skills/time-stepping/SKILL.md +140 -0
  790. package/data/skills/timesfm-forecasting/SKILL.md +785 -0
  791. package/data/skills/tme-immune-profiling-agent/SKILL.md +220 -0
  792. package/data/skills/tooluniverse-adverse-event-detection/SKILL.md +1115 -0
  793. package/data/skills/tooluniverse-antibody-engineering/SKILL.md +1581 -0
  794. package/data/skills/tooluniverse-binder-discovery/SKILL.md +1459 -0
  795. package/data/skills/tooluniverse-cancer-variant-interpretation/SKILL.md +971 -0
  796. package/data/skills/tooluniverse-chemical-compound-retrieval/SKILL.md +322 -0
  797. package/data/skills/tooluniverse-chemical-safety/SKILL.md +733 -0
  798. package/data/skills/tooluniverse-clinical-guidelines/SKILL.md +399 -0
  799. package/data/skills/tooluniverse-clinical-trial-design/SKILL.md +1195 -0
  800. package/data/skills/tooluniverse-clinical-trial-matching/SKILL.md +1333 -0
  801. package/data/skills/tooluniverse-crispr-screen-analysis/SKILL.md +900 -0
  802. package/data/skills/tooluniverse-disease-research/SKILL.md +630 -0
  803. package/data/skills/tooluniverse-drug-drug-interaction/SKILL.md +73 -0
  804. package/data/skills/tooluniverse-drug-repurposing/SKILL.md +595 -0
  805. package/data/skills/tooluniverse-drug-research/SKILL.md +1642 -0
  806. package/data/skills/tooluniverse-drug-target-validation/SKILL.md +1206 -0
  807. package/data/skills/tooluniverse-epigenomics/SKILL.md +1489 -0
  808. package/data/skills/tooluniverse-expression-data-retrieval/SKILL.md +389 -0
  809. package/data/skills/tooluniverse-gene-enrichment/SKILL.md +402 -0
  810. package/data/skills/tooluniverse-gwas-drug-discovery/SKILL.md +576 -0
  811. package/data/skills/tooluniverse-gwas-finemapping/SKILL.md +309 -0
  812. package/data/skills/tooluniverse-gwas-snp-interpretation/SKILL.md +223 -0
  813. package/data/skills/tooluniverse-gwas-study-explorer/SKILL.md +342 -0
  814. package/data/skills/tooluniverse-gwas-trait-to-gene/SKILL.md +236 -0
  815. package/data/skills/tooluniverse-image-analysis/SKILL.md +439 -0
  816. package/data/skills/tooluniverse-immune-repertoire-analysis/SKILL.md +949 -0
  817. package/data/skills/tooluniverse-immunotherapy-response-prediction/SKILL.md +865 -0
  818. package/data/skills/tooluniverse-infectious-disease/SKILL.md +749 -0
  819. package/data/skills/tooluniverse-literature-deep-research/SKILL.md +1050 -0
  820. package/data/skills/tooluniverse-metabolomics/SKILL.md +298 -0
  821. package/data/skills/tooluniverse-metabolomics-analysis/SKILL.md +764 -0
  822. package/data/skills/tooluniverse-multi-omics-integration/SKILL.md +703 -0
  823. package/data/skills/tooluniverse-multiomic-disease-characterization/SKILL.md +1138 -0
  824. package/data/skills/tooluniverse-network-pharmacology/SKILL.md +1312 -0
  825. package/data/skills/tooluniverse-pharmacovigilance/SKILL.md +807 -0
  826. package/data/skills/tooluniverse-phylogenetics/SKILL.md +461 -0
  827. package/data/skills/tooluniverse-polygenic-risk-score/SKILL.md +397 -0
  828. package/data/skills/tooluniverse-precision-medicine-stratification/SKILL.md +1143 -0
  829. package/data/skills/tooluniverse-precision-oncology/SKILL.md +1091 -0
  830. package/data/skills/tooluniverse-protein-interactions/SKILL.md +446 -0
  831. package/data/skills/tooluniverse-protein-structure-retrieval/SKILL.md +416 -0
  832. package/data/skills/tooluniverse-protein-therapeutic-design/SKILL.md +637 -0
  833. package/data/skills/tooluniverse-proteomics-analysis/SKILL.md +843 -0
  834. package/data/skills/tooluniverse-rare-disease-diagnosis/SKILL.md +1257 -0
  835. package/data/skills/tooluniverse-rnaseq-deseq2/SKILL.md +536 -0
  836. package/data/skills/tooluniverse-sequence-retrieval/SKILL.md +419 -0
  837. package/data/skills/tooluniverse-single-cell/SKILL.md +719 -0
  838. package/data/skills/tooluniverse-spatial-omics-analysis/SKILL.md +1102 -0
  839. package/data/skills/tooluniverse-spatial-transcriptomics/SKILL.md +788 -0
  840. package/data/skills/tooluniverse-statistical-modeling/SKILL.md +557 -0
  841. package/data/skills/tooluniverse-structural-variant-analysis/SKILL.md +1356 -0
  842. package/data/skills/tooluniverse-systems-biology/SKILL.md +374 -0
  843. package/data/skills/tooluniverse-target-research/SKILL.md +1510 -0
  844. package/data/skills/tooluniverse-variant-analysis/SKILL.md +448 -0
  845. package/data/skills/tooluniverse-variant-interpretation/SKILL.md +1118 -0
  846. package/data/skills/torch-geometric/SKILL.md +674 -0
  847. package/data/skills/torch_geometric/SKILL.md +670 -0
  848. package/data/skills/torchdrug/SKILL.md +444 -0
  849. package/data/skills/tpd-ternary-complex-agent/SKILL.md +226 -0
  850. package/data/skills/transformers/SKILL.md +157 -0
  851. package/data/skills/travel-health-analyzer/SKILL.md +421 -0
  852. package/data/skills/treatment-plans/SKILL.md +1576 -0
  853. package/data/skills/trial-eligibility-agent/SKILL.md +54 -0
  854. package/data/skills/trialgpt-matching/SKILL.md +66 -0
  855. package/data/skills/tumor-clonal-evolution-agent/SKILL.md +134 -0
  856. package/data/skills/tumor-heterogeneity-agent/SKILL.md +216 -0
  857. package/data/skills/tumor-mutational-burden-agent/SKILL.md +188 -0
  858. package/data/skills/ukb-navigator/SKILL.md +113 -0
  859. package/data/skills/umap-learn/SKILL.md +473 -0
  860. package/data/skills/uniprot-database/SKILL.md +189 -0
  861. package/data/skills/universal-single-cell-annotator/SKILL.md +72 -0
  862. package/data/skills/using-git-worktrees/SKILL.md +218 -0
  863. package/data/skills/using-superpowers/SKILL.md +95 -0
  864. package/data/skills/usmle/SKILL.md +62 -0
  865. package/data/skills/uspto-database/SKILL.md +597 -0
  866. package/data/skills/vaex/SKILL.md +180 -0
  867. package/data/skills/varcadd-pathogenicity/SKILL.md +68 -0
  868. package/data/skills/variant-interpretation-acmg/SKILL.md +58 -0
  869. package/data/skills/variant-interpretation-acmg/bioSkills/clinical-interpretation/SKILL.md +334 -0
  870. package/data/skills/variant-interpretation-acmg/bioSkills/consensus-sequences/SKILL.md +343 -0
  871. package/data/skills/variant-interpretation-acmg/bioSkills/deepvariant/SKILL.md +279 -0
  872. package/data/skills/variant-interpretation-acmg/bioSkills/filtering-best-practices/SKILL.md +362 -0
  873. package/data/skills/variant-interpretation-acmg/bioSkills/gatk-variant-calling/SKILL.md +398 -0
  874. package/data/skills/variant-interpretation-acmg/bioSkills/joint-calling/SKILL.md +343 -0
  875. package/data/skills/variant-interpretation-acmg/bioSkills/structural-variant-calling/SKILL.md +256 -0
  876. package/data/skills/variant-interpretation-acmg/bioSkills/variant-annotation/SKILL.md +387 -0
  877. package/data/skills/variant-interpretation-acmg/bioSkills/variant-calling/SKILL.md +258 -0
  878. package/data/skills/variant-interpretation-acmg/bioSkills/variant-normalization/SKILL.md +304 -0
  879. package/data/skills/variant-interpretation-acmg/bioSkills/vcf-basics/SKILL.md +329 -0
  880. package/data/skills/variant-interpretation-acmg/bioSkills/vcf-manipulation/SKILL.md +398 -0
  881. package/data/skills/variant-interpretation-acmg/bioSkills/vcf-statistics/SKILL.md +424 -0
  882. package/data/skills/variant-interpretation-acmg/varCADD/SKILL.md +68 -0
  883. package/data/skills/vcf-annotator/SKILL.md +55 -0
  884. package/data/skills/verification-before-completion/SKILL.md +139 -0
  885. package/data/skills/virtual-lab-agent/SKILL.md +240 -0
  886. package/data/skills/wearable-analysis-agent/SKILL.md +70 -0
  887. package/data/skills/weightloss-analyzer/SKILL.md +320 -0
  888. package/data/skills/wellally-tech/SKILL.md +685 -0
  889. package/data/skills/wikipedia-search/SKILL.md +481 -0
  890. package/data/skills/writing-plans/SKILL.md +116 -0
  891. package/data/skills/writing-skills/SKILL.md +655 -0
  892. package/data/skills/xlsx/SKILL.md +292 -0
  893. package/data/skills/xlsx-official/SKILL.md +289 -0
  894. package/data/skills/zarr-python/SKILL.md +777 -0
  895. package/data/skills/zinc-database/SKILL.md +398 -0
  896. package/data/tools/__init__.py +8 -0
  897. package/data/tools/hpc.py +71 -0
  898. package/data/tools/hpc_client/__init__.py +8 -0
  899. package/data/tools/hpc_client/builders/__init__.py +12 -0
  900. package/data/tools/hpc_client/builders/alphafold.py +36 -0
  901. package/data/tools/hpc_client/builders/boltz.py +33 -0
  902. package/data/tools/hpc_client/builders/chai.py +30 -0
  903. package/data/tools/hpc_client/builders/immunebuilder.py +31 -0
  904. package/data/tools/hpc_client/builders/rfantibody.py +58 -0
  905. package/data/tools/hpc_client/builders/thermompnn.py +16 -0
  906. package/data/tools/hpc_client/hpc_api.py +41 -0
  907. package/data/tools/hpc_client/hpc_tools.py +218 -0
  908. package/data/tools/hpc_dynamic.py +71 -0
  909. package/data/tools/integrations/__init__.py +14 -0
  910. package/data/tools/integrations/adaptyv.py +107 -0
  911. package/data/tools/integrations/addgene.py +52 -0
  912. package/data/tools/integrations/api_internal.py +33 -0
  913. package/data/tools/molecular_biology.py +688 -0
  914. package/data/tools/pharmacology.py +67 -0
  915. package/data/workflows/bulk-omics-clustering/SKILL.md +501 -0
  916. package/data/workflows/bulk-omics-clustering/references/best_practices.md +395 -0
  917. package/data/workflows/bulk-omics-clustering/references/clustering_methods_comparison.md +288 -0
  918. package/data/workflows/bulk-omics-clustering/references/common-patterns.md +1136 -0
  919. package/data/workflows/bulk-omics-clustering/references/decision-guide.md +819 -0
  920. package/data/workflows/bulk-omics-clustering/references/distance_metrics_guide.md +388 -0
  921. package/data/workflows/bulk-omics-clustering/references/parameter_guide.md +396 -0
  922. package/data/workflows/bulk-omics-clustering/references/r-quick-start.md +105 -0
  923. package/data/workflows/bulk-omics-clustering/references/validation_metrics_guide.md +315 -0
  924. package/data/workflows/bulk-omics-clustering/scripts/characterize_clusters.py +255 -0
  925. package/data/workflows/bulk-omics-clustering/scripts/cluster_validation.py +449 -0
  926. package/data/workflows/bulk-omics-clustering/scripts/density_clustering.py +321 -0
  927. package/data/workflows/bulk-omics-clustering/scripts/dimensionality_reduction.py +328 -0
  928. package/data/workflows/bulk-omics-clustering/scripts/distance_metrics.py +251 -0
  929. package/data/workflows/bulk-omics-clustering/scripts/export_results.py +456 -0
  930. package/data/workflows/bulk-omics-clustering/scripts/hierarchical_clustering.R +229 -0
  931. package/data/workflows/bulk-omics-clustering/scripts/hierarchical_clustering.py +269 -0
  932. package/data/workflows/bulk-omics-clustering/scripts/kmeans_clustering.py +346 -0
  933. package/data/workflows/bulk-omics-clustering/scripts/load_example_data.R +171 -0
  934. package/data/workflows/bulk-omics-clustering/scripts/load_example_data.py +171 -0
  935. package/data/workflows/bulk-omics-clustering/scripts/model_based_clustering.py +370 -0
  936. package/data/workflows/bulk-omics-clustering/scripts/optimal_clusters.py +381 -0
  937. package/data/workflows/bulk-omics-clustering/scripts/plot_cluster_heatmap.R +141 -0
  938. package/data/workflows/bulk-omics-clustering/scripts/plot_clustering_results.py +452 -0
  939. package/data/workflows/bulk-omics-clustering/scripts/prepare_data.py +250 -0
  940. package/data/workflows/bulk-omics-clustering/scripts/stability_analysis.py +434 -0
  941. package/data/workflows/bulk-rnaseq-counts-to-de-deseq2/SKILL.md +505 -0
  942. package/data/workflows/bulk-rnaseq-counts-to-de-deseq2/references/comprehensive-reference.md +440 -0
  943. package/data/workflows/bulk-rnaseq-counts-to-de-deseq2/references/decision-guide.md +327 -0
  944. package/data/workflows/bulk-rnaseq-counts-to-de-deseq2/references/troubleshooting.md +456 -0
  945. package/data/workflows/bulk-rnaseq-counts-to-de-deseq2/references/usage-guide.md +75 -0
  946. package/data/workflows/bulk-rnaseq-counts-to-de-deseq2/scripts/basic_workflow.R +149 -0
  947. package/data/workflows/bulk-rnaseq-counts-to-de-deseq2/scripts/batch_correction.R +44 -0
  948. package/data/workflows/bulk-rnaseq-counts-to-de-deseq2/scripts/export_results.R +190 -0
  949. package/data/workflows/bulk-rnaseq-counts-to-de-deseq2/scripts/extract_results.R +242 -0
  950. package/data/workflows/bulk-rnaseq-counts-to-de-deseq2/scripts/load_example_data.R +250 -0
  951. package/data/workflows/bulk-rnaseq-counts-to-de-deseq2/scripts/multi_condition.R +50 -0
  952. package/data/workflows/bulk-rnaseq-counts-to-de-deseq2/scripts/qc_plots.R +410 -0
  953. package/data/workflows/bulk-rnaseq-counts-to-de-deseq2/scripts/transformations.R +218 -0
  954. package/data/workflows/chip-atlas-diff-analysis/SKILL.md +222 -0
  955. package/data/workflows/chip-atlas-diff-analysis/references/chipatlas_diff_api_format.md +106 -0
  956. package/data/workflows/chip-atlas-diff-analysis/references/diff_analysis_methods.md +89 -0
  957. package/data/workflows/chip-atlas-diff-analysis/references/output_format.md +78 -0
  958. package/data/workflows/chip-atlas-diff-analysis/scripts/__init__.py +1 -0
  959. package/data/workflows/chip-atlas-diff-analysis/scripts/annotate_genes.py +144 -0
  960. package/data/workflows/chip-atlas-diff-analysis/scripts/export_all.py +498 -0
  961. package/data/workflows/chip-atlas-diff-analysis/scripts/filter_regions.py +176 -0
  962. package/data/workflows/chip-atlas-diff-analysis/scripts/generate_all_plots.py +321 -0
  963. package/data/workflows/chip-atlas-diff-analysis/scripts/load_example_data.py +149 -0
  964. package/data/workflows/chip-atlas-diff-analysis/scripts/load_user_data.py +211 -0
  965. package/data/workflows/chip-atlas-diff-analysis/scripts/parse_bed_results.py +240 -0
  966. package/data/workflows/chip-atlas-diff-analysis/scripts/qc_checks.py +621 -0
  967. package/data/workflows/chip-atlas-diff-analysis/scripts/query_chipatlas_api.py +329 -0
  968. package/data/workflows/chip-atlas-diff-analysis/scripts/run_diff_workflow.py +256 -0
  969. package/data/workflows/chip-atlas-peak-enrichment/SKILL.md +212 -0
  970. package/data/workflows/chip-atlas-peak-enrichment/references/chipatlas_metadata_format.md +115 -0
  971. package/data/workflows/chip-atlas-peak-enrichment/references/enrichment_statistics.md +145 -0
  972. package/data/workflows/chip-atlas-peak-enrichment/references/peak_thresholds.md +63 -0
  973. package/data/workflows/chip-atlas-peak-enrichment/references/promoter_definitions.md +69 -0
  974. package/data/workflows/chip-atlas-peak-enrichment/scripts/__init__.py +1 -0
  975. package/data/workflows/chip-atlas-peak-enrichment/scripts/convert_genes_to_regions.py +271 -0
  976. package/data/workflows/chip-atlas-peak-enrichment/scripts/export_all.py +456 -0
  977. package/data/workflows/chip-atlas-peak-enrichment/scripts/filter_experiments.py +116 -0
  978. package/data/workflows/chip-atlas-peak-enrichment/scripts/generate_all_plots.py +280 -0
  979. package/data/workflows/chip-atlas-peak-enrichment/scripts/load_example_data.py +96 -0
  980. package/data/workflows/chip-atlas-peak-enrichment/scripts/load_user_data.py +183 -0
  981. package/data/workflows/chip-atlas-peak-enrichment/scripts/query_chipatlas_api.py +349 -0
  982. package/data/workflows/chip-atlas-peak-enrichment/scripts/run_enrichment_workflow.py +271 -0
  983. package/data/workflows/chip-atlas-target-genes/SKILL.md +230 -0
  984. package/data/workflows/chip-atlas-target-genes/references/macs2_binding_scores.md +89 -0
  985. package/data/workflows/chip-atlas-target-genes/references/string_scores.md +58 -0
  986. package/data/workflows/chip-atlas-target-genes/references/target_genes_data_format.md +73 -0
  987. package/data/workflows/chip-atlas-target-genes/scripts/__init__.py +0 -0
  988. package/data/workflows/chip-atlas-target-genes/scripts/download_target_genes.py +200 -0
  989. package/data/workflows/chip-atlas-target-genes/scripts/export_all.py +340 -0
  990. package/data/workflows/chip-atlas-target-genes/scripts/filter_targets.py +205 -0
  991. package/data/workflows/chip-atlas-target-genes/scripts/generate_all_plots.py +330 -0
  992. package/data/workflows/chip-atlas-target-genes/scripts/load_example_query.py +61 -0
  993. package/data/workflows/chip-atlas-target-genes/scripts/load_user_query.py +47 -0
  994. package/data/workflows/chip-atlas-target-genes/scripts/run_target_genes_workflow.py +141 -0
  995. package/data/workflows/clinicaltrials-landscape/SKILL.md +257 -0
  996. package/data/workflows/clinicaltrials-landscape/references/api-parameters.md +181 -0
  997. package/data/workflows/clinicaltrials-landscape/references/mechanisms.md +141 -0
  998. package/data/workflows/clinicaltrials-landscape/references/output-schema.md +184 -0
  999. package/data/workflows/clinicaltrials-landscape/scripts/__init__.py +1 -0
  1000. package/data/workflows/clinicaltrials-landscape/scripts/classify_mechanisms.py +359 -0
  1001. package/data/workflows/clinicaltrials-landscape/scripts/compile_trials.py +579 -0
  1002. package/data/workflows/clinicaltrials-landscape/scripts/disease_config.py +161 -0
  1003. package/data/workflows/clinicaltrials-landscape/scripts/export_all.py +242 -0
  1004. package/data/workflows/clinicaltrials-landscape/scripts/generate_landscape_plots.py +761 -0
  1005. package/data/workflows/clinicaltrials-landscape/scripts/generate_pdf_report.py +1465 -0
  1006. package/data/workflows/clinicaltrials-landscape/scripts/generate_report.py +1813 -0
  1007. package/data/workflows/clinicaltrials-landscape/scripts/query_clinicaltrials.py +307 -0
  1008. package/data/workflows/coexpression-network/SKILL.md +344 -0
  1009. package/data/workflows/coexpression-network/references/parameter-tuning-guide.md +591 -0
  1010. package/data/workflows/coexpression-network/references/troubleshooting.md +483 -0
  1011. package/data/workflows/coexpression-network/references/wgcna-best-practices.md +563 -0
  1012. package/data/workflows/coexpression-network/references/wgcna-reference.md +538 -0
  1013. package/data/workflows/coexpression-network/scripts/build_network.R +43 -0
  1014. package/data/workflows/coexpression-network/scripts/correlate_modules_traits.R +92 -0
  1015. package/data/workflows/coexpression-network/scripts/export_wgcna_results.R +117 -0
  1016. package/data/workflows/coexpression-network/scripts/identify_hub_genes.R +63 -0
  1017. package/data/workflows/coexpression-network/scripts/load_example_data.R +214 -0
  1018. package/data/workflows/coexpression-network/scripts/module_enrichment.R +159 -0
  1019. package/data/workflows/coexpression-network/scripts/pick_soft_power.R +70 -0
  1020. package/data/workflows/coexpression-network/scripts/plot_all_wgcna.R +104 -0
  1021. package/data/workflows/coexpression-network/scripts/plot_eigengene_heatmap.R +65 -0
  1022. package/data/workflows/coexpression-network/scripts/plot_hub_genes.R +70 -0
  1023. package/data/workflows/coexpression-network/scripts/plot_module_dendrogram.R +50 -0
  1024. package/data/workflows/coexpression-network/scripts/plotting_helpers.R +87 -0
  1025. package/data/workflows/coexpression-network/scripts/prepare_wgcna_data.R +73 -0
  1026. package/data/workflows/coexpression-network/scripts/wgcna_workflow.R +93 -0
  1027. package/data/workflows/experimental-design-statistics/SKILL.md +408 -0
  1028. package/data/workflows/experimental-design-statistics/references/batch_effect_mitigation.md +756 -0
  1029. package/data/workflows/experimental-design-statistics/references/cv_tissue_database.csv +30 -0
  1030. package/data/workflows/experimental-design-statistics/references/experimental_design_best_practices.md +515 -0
  1031. package/data/workflows/experimental-design-statistics/references/multiple_testing_guide.md +730 -0
  1032. package/data/workflows/experimental-design-statistics/references/power_analysis_guidelines.md +635 -0
  1033. package/data/workflows/experimental-design-statistics/references/qc_guidelines.md +310 -0
  1034. package/data/workflows/experimental-design-statistics/references/software_requirements.md +328 -0
  1035. package/data/workflows/experimental-design-statistics/references/troubleshooting_guide.md +510 -0
  1036. package/data/workflows/experimental-design-statistics/scripts/batch_assignment.R +302 -0
  1037. package/data/workflows/experimental-design-statistics/scripts/batch_validation.R +342 -0
  1038. package/data/workflows/experimental-design-statistics/scripts/export_design.R +352 -0
  1039. package/data/workflows/experimental-design-statistics/scripts/load_example_data.R +204 -0
  1040. package/data/workflows/experimental-design-statistics/scripts/multiple_testing.R +417 -0
  1041. package/data/workflows/experimental-design-statistics/scripts/plot_power_curves.R +317 -0
  1042. package/data/workflows/experimental-design-statistics/scripts/power_atacseq.R +229 -0
  1043. package/data/workflows/experimental-design-statistics/scripts/power_pilot_based.R +289 -0
  1044. package/data/workflows/experimental-design-statistics/scripts/power_rnaseq.R +247 -0
  1045. package/data/workflows/experimental-design-statistics/scripts/sample_size_de.R +327 -0
  1046. package/data/workflows/experimental-design-statistics/scripts/sample_size_scrna.R +304 -0
  1047. package/data/workflows/functional-enrichment-from-degs/SKILL.md +387 -0
  1048. package/data/workflows/functional-enrichment-from-degs/references/database_guide.md +354 -0
  1049. package/data/workflows/functional-enrichment-from-degs/references/decision-guide.md +546 -0
  1050. package/data/workflows/functional-enrichment-from-degs/references/gsea_ora_comparison.md +213 -0
  1051. package/data/workflows/functional-enrichment-from-degs/references/gsea_ora_validation_framework.md +483 -0
  1052. package/data/workflows/functional-enrichment-from-degs/references/interpretation_guidelines.md +374 -0
  1053. package/data/workflows/functional-enrichment-from-degs/references/method-reference.md +742 -0
  1054. package/data/workflows/functional-enrichment-from-degs/scripts/export_results.R +190 -0
  1055. package/data/workflows/functional-enrichment-from-degs/scripts/generate_plots.R +240 -0
  1056. package/data/workflows/functional-enrichment-from-degs/scripts/get_msigdb_genesets.R +75 -0
  1057. package/data/workflows/functional-enrichment-from-degs/scripts/load_de_results.R +60 -0
  1058. package/data/workflows/functional-enrichment-from-degs/scripts/load_example_data.R +212 -0
  1059. package/data/workflows/functional-enrichment-from-degs/scripts/prepare_gene_lists.R +92 -0
  1060. package/data/workflows/functional-enrichment-from-degs/scripts/run_gsea.R +44 -0
  1061. package/data/workflows/functional-enrichment-from-degs/scripts/run_ora.R +53 -0
  1062. package/data/workflows/genetic-variant-annotation/SKILL.md +440 -0
  1063. package/data/workflows/genetic-variant-annotation/references/auto_installation_implementation.md +274 -0
  1064. package/data/workflows/genetic-variant-annotation/references/consequence_terms.md +392 -0
  1065. package/data/workflows/genetic-variant-annotation/references/filtering_strategies.md +808 -0
  1066. package/data/workflows/genetic-variant-annotation/references/installation_guide.md +557 -0
  1067. package/data/workflows/genetic-variant-annotation/references/pathogenicity_interpretation.md +473 -0
  1068. package/data/workflows/genetic-variant-annotation/references/qc_guidelines.md +524 -0
  1069. package/data/workflows/genetic-variant-annotation/references/snpeff_best_practices.md +481 -0
  1070. package/data/workflows/genetic-variant-annotation/references/tool_selection_guide.md +433 -0
  1071. package/data/workflows/genetic-variant-annotation/references/troubleshooting_guide.md +678 -0
  1072. package/data/workflows/genetic-variant-annotation/references/vep_best_practices.md +450 -0
  1073. package/data/workflows/genetic-variant-annotation/scripts/annotate_genes.py +243 -0
  1074. package/data/workflows/genetic-variant-annotation/scripts/export_results.py +450 -0
  1075. package/data/workflows/genetic-variant-annotation/scripts/filter_variants.py +365 -0
  1076. package/data/workflows/genetic-variant-annotation/scripts/install_tools.py +246 -0
  1077. package/data/workflows/genetic-variant-annotation/scripts/load_example_data.py +166 -0
  1078. package/data/workflows/genetic-variant-annotation/scripts/parse_snpeff_output.py +283 -0
  1079. package/data/workflows/genetic-variant-annotation/scripts/parse_vep_output.py +257 -0
  1080. package/data/workflows/genetic-variant-annotation/scripts/plot_variant_distribution.py +372 -0
  1081. package/data/workflows/genetic-variant-annotation/scripts/prioritize_variants.py +287 -0
  1082. package/data/workflows/genetic-variant-annotation/scripts/run_snpeff.py +418 -0
  1083. package/data/workflows/genetic-variant-annotation/scripts/run_vep.py +358 -0
  1084. package/data/workflows/genetic-variant-annotation/scripts/select_tool.py +203 -0
  1085. package/data/workflows/genetic-variant-annotation/scripts/test_complete_workflow.py +312 -0
  1086. package/data/workflows/genetic-variant-annotation/scripts/test_pickle_load.py +118 -0
  1087. package/data/workflows/genetic-variant-annotation/scripts/validate_vcf.py +351 -0
  1088. package/data/workflows/genetic-variant-annotation/scripts/verify_changes.py +212 -0
  1089. package/data/workflows/grn-pyscenic/SKILL.md +331 -0
  1090. package/data/workflows/grn-pyscenic/references/cli_interface.md +222 -0
  1091. package/data/workflows/grn-pyscenic/references/database_downloads.md +245 -0
  1092. package/data/workflows/grn-pyscenic/scripts/export_all.py +192 -0
  1093. package/data/workflows/grn-pyscenic/scripts/generate_report.py +512 -0
  1094. package/data/workflows/grn-pyscenic/scripts/integrate_with_adata.py +54 -0
  1095. package/data/workflows/grn-pyscenic/scripts/load_example_data.py +200 -0
  1096. package/data/workflows/grn-pyscenic/scripts/load_expression_data.py +61 -0
  1097. package/data/workflows/grn-pyscenic/scripts/plot_regulon_visualizations.py +263 -0
  1098. package/data/workflows/grn-pyscenic/scripts/run_grn_workflow.py +184 -0
  1099. package/data/workflows/gwas-to-function-twas/SKILL.md +394 -0
  1100. package/data/workflows/gwas-to-function-twas/references/fusion_best_practices.md +120 -0
  1101. package/data/workflows/gwas-to-function-twas/references/installation-guide.md +414 -0
  1102. package/data/workflows/gwas-to-function-twas/references/ldsc_qc_guidelines.md +287 -0
  1103. package/data/workflows/gwas-to-function-twas/references/spredixxcan_best_practices.md +166 -0
  1104. package/data/workflows/gwas-to-function-twas/references/therapeutic_interpretation_guide.md +717 -0
  1105. package/data/workflows/gwas-to-function-twas/references/tissue_reference_guide.md +182 -0
  1106. package/data/workflows/gwas-to-function-twas/references/troubleshooting_guide.md +317 -0
  1107. package/data/workflows/gwas-to-function-twas/references/twas_hub_validation_guide.md +88 -0
  1108. package/data/workflows/gwas-to-function-twas/scripts/colocalization_analysis.py +187 -0
  1109. package/data/workflows/gwas-to-function-twas/scripts/druggability_scoring.py +199 -0
  1110. package/data/workflows/gwas-to-function-twas/scripts/export_results.py +220 -0
  1111. package/data/workflows/gwas-to-function-twas/scripts/integrate_variant_annotation.py +194 -0
  1112. package/data/workflows/gwas-to-function-twas/scripts/interpret_therapeutic_direction.py +418 -0
  1113. package/data/workflows/gwas-to-function-twas/scripts/mendelian_randomization.py +749 -0
  1114. package/data/workflows/gwas-to-function-twas/scripts/multilayer_direction_analysis.py +471 -0
  1115. package/data/workflows/gwas-to-function-twas/scripts/plot_twas_results.py +252 -0
  1116. package/data/workflows/gwas-to-function-twas/scripts/run_fusion.py +155 -0
  1117. package/data/workflows/gwas-to-function-twas/scripts/run_smultixcan.py +102 -0
  1118. package/data/workflows/gwas-to-function-twas/scripts/run_spredixxcan.py +138 -0
  1119. package/data/workflows/gwas-to-function-twas/scripts/select_reference_panel.py +253 -0
  1120. package/data/workflows/gwas-to-function-twas/scripts/validate_gwas_sumstats.py +214 -0
  1121. package/data/workflows/gwas-to-function-twas/scripts/validate_with_twas_hub.py +439 -0
  1122. package/data/workflows/lasso-biomarker-panel/SKILL.md +322 -0
  1123. package/data/workflows/lasso-biomarker-panel/references/decision-guide.md +64 -0
  1124. package/data/workflows/lasso-biomarker-panel/references/lasso-reference.md +110 -0
  1125. package/data/workflows/lasso-biomarker-panel/references/validation-guide.md +105 -0
  1126. package/data/workflows/lasso-biomarker-panel/scripts/biological_interpretation.R +1560 -0
  1127. package/data/workflows/lasso-biomarker-panel/scripts/biomarker_plots.R +350 -0
  1128. package/data/workflows/lasso-biomarker-panel/scripts/export_results.R +1492 -0
  1129. package/data/workflows/lasso-biomarker-panel/scripts/lasso_workflow.R +328 -0
  1130. package/data/workflows/lasso-biomarker-panel/scripts/load_example_data.R +1903 -0
  1131. package/data/workflows/lasso-biomarker-panel/scripts/plotting_helpers.R +78 -0
  1132. package/data/workflows/lasso-biomarker-panel/scripts/prepare_features.R +225 -0
  1133. package/data/workflows/lasso-biomarker-panel/scripts/query_cellxgene.py +107 -0
  1134. package/data/workflows/lasso-biomarker-panel/scripts/validate_external.R +174 -0
  1135. package/data/workflows/literature-preclinical/SKILL.md +276 -0
  1136. package/data/workflows/literature-preclinical/assets/eval/simple_test.py +386 -0
  1137. package/data/workflows/literature-preclinical/references/experiment-extraction-guide.md +147 -0
  1138. package/data/workflows/literature-preclinical/references/full-text-enrichment-guide.md +121 -0
  1139. package/data/workflows/literature-preclinical/references/preclinical-search-guide.md +117 -0
  1140. package/data/workflows/literature-preclinical/scripts/extract_experiments.py +401 -0
  1141. package/data/workflows/literature-preclinical/scripts/generate_plots.R +303 -0
  1142. package/data/workflows/literature-preclinical/scripts/narrative_synthesis.py +653 -0
  1143. package/data/workflows/literature-preclinical/scripts/preclinical_search.py +332 -0
  1144. package/data/workflows/literature-preclinical/scripts/preclinical_synthesis.py +237 -0
  1145. package/data/workflows/literature-preclinical/scripts/report_generation.py +326 -0
  1146. package/data/workflows/mendelian-randomization-twosamplemr/SKILL.md +210 -0
  1147. package/data/workflows/mendelian-randomization-twosamplemr/references/interpretation-guide.md +239 -0
  1148. package/data/workflows/mendelian-randomization-twosamplemr/references/method-reference.md +190 -0
  1149. package/data/workflows/mendelian-randomization-twosamplemr/scripts/export_results.R +123 -0
  1150. package/data/workflows/mendelian-randomization-twosamplemr/scripts/generate_report.R +411 -0
  1151. package/data/workflows/mendelian-randomization-twosamplemr/scripts/load_data.R +281 -0
  1152. package/data/workflows/mendelian-randomization-twosamplemr/scripts/mr_plots.R +163 -0
  1153. package/data/workflows/mendelian-randomization-twosamplemr/scripts/run_mr_analysis.R +322 -0
  1154. package/data/workflows/pcr-primer-design/SKILL.md +397 -0
  1155. package/data/workflows/pcr-primer-design/references/code_examples.md +594 -0
  1156. package/data/workflows/pcr-primer-design/references/miqe_guidelines.md +453 -0
  1157. package/data/workflows/pcr-primer-design/references/parameter_ranges.md +356 -0
  1158. package/data/workflows/pcr-primer-design/references/primer_design_best_practices.md +451 -0
  1159. package/data/workflows/pcr-primer-design/references/troubleshooting_guide.md +477 -0
  1160. package/data/workflows/pcr-primer-design/scripts/__init__.py +2 -0
  1161. package/data/workflows/pcr-primer-design/scripts/calculate_tm.py +306 -0
  1162. package/data/workflows/pcr-primer-design/scripts/check_dimers.py +298 -0
  1163. package/data/workflows/pcr-primer-design/scripts/check_secondary_structures.py +343 -0
  1164. package/data/workflows/pcr-primer-design/scripts/design_qpcr_primers.py +233 -0
  1165. package/data/workflows/pcr-primer-design/scripts/design_standard_primers.py +197 -0
  1166. package/data/workflows/pcr-primer-design/scripts/design_taqman_probes.py +226 -0
  1167. package/data/workflows/pcr-primer-design/scripts/export_results.py +382 -0
  1168. package/data/workflows/pcr-primer-design/scripts/generate_reports.py +379 -0
  1169. package/data/workflows/pcr-primer-design/scripts/validate_specificity.py +311 -0
  1170. package/data/workflows/pcr-primer-design/scripts/visualize_primers.py +379 -0
  1171. package/data/workflows/polygenic-risk-score-prs-catalog/SKILL.md +195 -0
  1172. package/data/workflows/polygenic-risk-score-prs-catalog/references/interpretation-guide.md +80 -0
  1173. package/data/workflows/polygenic-risk-score-prs-catalog/references/pgs-catalog-guide.md +109 -0
  1174. package/data/workflows/polygenic-risk-score-prs-catalog/scripts/export_results.R +186 -0
  1175. package/data/workflows/polygenic-risk-score-prs-catalog/scripts/generate_plots.R +283 -0
  1176. package/data/workflows/polygenic-risk-score-prs-catalog/scripts/load_pgs_weights.R +228 -0
  1177. package/data/workflows/polygenic-risk-score-prs-catalog/scripts/load_reference_data.R +191 -0
  1178. package/data/workflows/polygenic-risk-score-prs-catalog/scripts/score_traits.R +216 -0
  1179. package/data/workflows/pooled-crispr-screens/SKILL.md +362 -0
  1180. package/data/workflows/pooled-crispr-screens/references/crispr_screen_best_practices.md +349 -0
  1181. package/data/workflows/pooled-crispr-screens/references/qc_guidelines.md +722 -0
  1182. package/data/workflows/pooled-crispr-screens/references/statistical_methods.md +644 -0
  1183. package/data/workflows/pooled-crispr-screens/references/troubleshooting_guide.md +684 -0
  1184. package/data/workflows/pooled-crispr-screens/references/umi_optimization.md +297 -0
  1185. package/data/workflows/pooled-crispr-screens/scripts/concatenate_libraries.py +132 -0
  1186. package/data/workflows/pooled-crispr-screens/scripts/detect_perturbed_cells.py +255 -0
  1187. package/data/workflows/pooled-crispr-screens/scripts/differential_expression.py +202 -0
  1188. package/data/workflows/pooled-crispr-screens/scripts/differential_expression_glmgampoi.py +320 -0
  1189. package/data/workflows/pooled-crispr-screens/scripts/export_results.py +261 -0
  1190. package/data/workflows/pooled-crispr-screens/scripts/expression_filtering.py +159 -0
  1191. package/data/workflows/pooled-crispr-screens/scripts/gene_name_corrections.py +188 -0
  1192. package/data/workflows/pooled-crispr-screens/scripts/generate_report.py +485 -0
  1193. package/data/workflows/pooled-crispr-screens/scripts/load_10x_libraries.py +69 -0
  1194. package/data/workflows/pooled-crispr-screens/scripts/load_example_data.py +257 -0
  1195. package/data/workflows/pooled-crispr-screens/scripts/map_sgrna_to_cells.py +119 -0
  1196. package/data/workflows/pooled-crispr-screens/scripts/normalize_and_scale.py +140 -0
  1197. package/data/workflows/pooled-crispr-screens/scripts/qc_filtering.py +185 -0
  1198. package/data/workflows/pooled-crispr-screens/scripts/run_glmgampoi.R +181 -0
  1199. package/data/workflows/pooled-crispr-screens/scripts/screen_all_perturbations.py +306 -0
  1200. package/data/workflows/pooled-crispr-screens/scripts/validate_perturbations.py +314 -0
  1201. package/data/workflows/pooled-crispr-screens/scripts/visualize_perturbations.py +314 -0
  1202. package/data/workflows/scrnaseq-scanpy-core-analysis/SKILL.md +425 -0
  1203. package/data/workflows/scrnaseq-scanpy-core-analysis/references/ambient_rna_correction.md +422 -0
  1204. package/data/workflows/scrnaseq-scanpy-core-analysis/references/common-patterns.md +533 -0
  1205. package/data/workflows/scrnaseq-scanpy-core-analysis/references/integration_methods.md +820 -0
  1206. package/data/workflows/scrnaseq-scanpy-core-analysis/references/marker_gene_database.md +471 -0
  1207. package/data/workflows/scrnaseq-scanpy-core-analysis/references/pseudobulk_de_guide.md +408 -0
  1208. package/data/workflows/scrnaseq-scanpy-core-analysis/references/qc_guidelines.md +535 -0
  1209. package/data/workflows/scrnaseq-scanpy-core-analysis/references/scanpy_best_practices.md +496 -0
  1210. package/data/workflows/scrnaseq-scanpy-core-analysis/references/troubleshooting_guide.md +668 -0
  1211. package/data/workflows/scrnaseq-scanpy-core-analysis/references/workflow-details.md +727 -0
  1212. package/data/workflows/scrnaseq-scanpy-core-analysis/scripts/annotate_celltypes.py +431 -0
  1213. package/data/workflows/scrnaseq-scanpy-core-analysis/scripts/cluster_cells.py +293 -0
  1214. package/data/workflows/scrnaseq-scanpy-core-analysis/scripts/export_results.py +423 -0
  1215. package/data/workflows/scrnaseq-scanpy-core-analysis/scripts/filter_cells.py +531 -0
  1216. package/data/workflows/scrnaseq-scanpy-core-analysis/scripts/find_markers.py +391 -0
  1217. package/data/workflows/scrnaseq-scanpy-core-analysis/scripts/find_variable_genes.py +222 -0
  1218. package/data/workflows/scrnaseq-scanpy-core-analysis/scripts/integrate_scvi.py +665 -0
  1219. package/data/workflows/scrnaseq-scanpy-core-analysis/scripts/integration_diagnostics.py +678 -0
  1220. package/data/workflows/scrnaseq-scanpy-core-analysis/scripts/load_example_data.py +68 -0
  1221. package/data/workflows/scrnaseq-scanpy-core-analysis/scripts/normalize_data.py +325 -0
  1222. package/data/workflows/scrnaseq-scanpy-core-analysis/scripts/plot_dimreduction.py +389 -0
  1223. package/data/workflows/scrnaseq-scanpy-core-analysis/scripts/plot_qc.py +320 -0
  1224. package/data/workflows/scrnaseq-scanpy-core-analysis/scripts/pseudobulk_de.py +553 -0
  1225. package/data/workflows/scrnaseq-scanpy-core-analysis/scripts/qc_metrics.py +477 -0
  1226. package/data/workflows/scrnaseq-scanpy-core-analysis/scripts/remove_ambient_rna.py +347 -0
  1227. package/data/workflows/scrnaseq-scanpy-core-analysis/scripts/run_umap.py +188 -0
  1228. package/data/workflows/scrnaseq-scanpy-core-analysis/scripts/scale_and_pca.py +365 -0
  1229. package/data/workflows/scrnaseq-scanpy-core-analysis/scripts/setup_and_import.py +334 -0
  1230. package/data/workflows/scrnaseq-seurat-core-analysis/SKILL.md +585 -0
  1231. package/data/workflows/scrnaseq-seurat-core-analysis/references/ambient_rna_correction.md +422 -0
  1232. package/data/workflows/scrnaseq-seurat-core-analysis/references/common-patterns.md +667 -0
  1233. package/data/workflows/scrnaseq-seurat-core-analysis/references/decision-guide.md +456 -0
  1234. package/data/workflows/scrnaseq-seurat-core-analysis/references/integration_methods.md +864 -0
  1235. package/data/workflows/scrnaseq-seurat-core-analysis/references/marker_gene_database.md +471 -0
  1236. package/data/workflows/scrnaseq-seurat-core-analysis/references/pseudobulk_de_guide.md +408 -0
  1237. package/data/workflows/scrnaseq-seurat-core-analysis/references/qc_guidelines.md +452 -0
  1238. package/data/workflows/scrnaseq-seurat-core-analysis/references/seurat_best_practices.md +417 -0
  1239. package/data/workflows/scrnaseq-seurat-core-analysis/references/troubleshooting_guide.md +566 -0
  1240. package/data/workflows/scrnaseq-seurat-core-analysis/references/workflow-details.md +801 -0
  1241. package/data/workflows/scrnaseq-seurat-core-analysis/scripts/annotate_celltypes.R +306 -0
  1242. package/data/workflows/scrnaseq-seurat-core-analysis/scripts/cluster_cells.R +223 -0
  1243. package/data/workflows/scrnaseq-seurat-core-analysis/scripts/export_results.R +292 -0
  1244. package/data/workflows/scrnaseq-seurat-core-analysis/scripts/filter_cells.R +576 -0
  1245. package/data/workflows/scrnaseq-seurat-core-analysis/scripts/find_markers.R +325 -0
  1246. package/data/workflows/scrnaseq-seurat-core-analysis/scripts/find_variable_features.R +106 -0
  1247. package/data/workflows/scrnaseq-seurat-core-analysis/scripts/integrate_batches.R +504 -0
  1248. package/data/workflows/scrnaseq-seurat-core-analysis/scripts/integration_diagnostics.R +596 -0
  1249. package/data/workflows/scrnaseq-seurat-core-analysis/scripts/load_example_data.R +89 -0
  1250. package/data/workflows/scrnaseq-seurat-core-analysis/scripts/normalize_data.R +184 -0
  1251. package/data/workflows/scrnaseq-seurat-core-analysis/scripts/plot_dimreduction.R +273 -0
  1252. package/data/workflows/scrnaseq-seurat-core-analysis/scripts/plot_qc.R +250 -0
  1253. package/data/workflows/scrnaseq-seurat-core-analysis/scripts/pseudobulk_de.R +324 -0
  1254. package/data/workflows/scrnaseq-seurat-core-analysis/scripts/qc_metrics.R +358 -0
  1255. package/data/workflows/scrnaseq-seurat-core-analysis/scripts/remove_ambient_rna.R +281 -0
  1256. package/data/workflows/scrnaseq-seurat-core-analysis/scripts/run_umap.R +116 -0
  1257. package/data/workflows/scrnaseq-seurat-core-analysis/scripts/scale_and_pca.R +243 -0
  1258. package/data/workflows/scrnaseq-seurat-core-analysis/scripts/setup_and_import.R +193 -0
  1259. package/data/workflows/spatial-transcriptomics/SKILL.md +256 -0
  1260. package/data/workflows/spatial-transcriptomics/references/spatial-analysis-guide.md +216 -0
  1261. package/data/workflows/spatial-transcriptomics/scripts/export_results.py +214 -0
  1262. package/data/workflows/spatial-transcriptomics/scripts/generate_all_plots.py +397 -0
  1263. package/data/workflows/spatial-transcriptomics/scripts/load_example_data.py +175 -0
  1264. package/data/workflows/spatial-transcriptomics/scripts/spatial_workflow.py +206 -0
  1265. package/dist/bgi.js +28 -1
  1266. package/package.json +2 -1
@@ -0,0 +1,1459 @@
1
+ ---
2
+ name: tooluniverse-binder-discovery
3
+ description: Discover novel small molecule binders for protein targets using structure-based and ligand-based approaches. Creates actionable reports with candidate compounds, ADMET profiles, and synthesis feasibility. Use when users ask to find small molecules for a target, identify novel binders, perform virtual screening, or need hit-to-lead compound identification.
4
+ ---
5
+
6
+ # Small Molecule Binder Discovery Strategy
7
+
8
+ Systematic discovery of novel small molecule binders using 60+ ToolUniverse tools across druggability assessment, known ligand mining, similarity expansion, ADMET filtering, and synthesis feasibility.
9
+
10
+ **KEY PRINCIPLES**:
11
+ 1. **Report-first approach** - Create report file FIRST, then populate progressively
12
+ 2. **Target validation FIRST** - Confirm druggability before compound searching
13
+ 3. **Multi-strategy approach** - Combine structure-based and ligand-based methods
14
+ 4. **ADMET-aware filtering** - Eliminate poor compounds early
15
+ 5. **Evidence grading** - Grade candidates by supporting evidence
16
+ 6. **Actionable output** - Provide prioritized candidates with rationale
17
+ 7. **English-first queries** - Always use English terms in tool calls, even if the user writes in another language. Only try original-language terms as a fallback. Respond in the user's language
18
+
19
+ ---
20
+
21
+ ## Critical Workflow Requirements
22
+
23
+ ### 1. Report-First Approach (MANDATORY)
24
+
25
+ **DO NOT** show search process or tool outputs to the user. Instead:
26
+
27
+ 1. **Create the report file FIRST** - Before any data collection:
28
+ - File name: `[TARGET]_binder_discovery_report.md`
29
+ - Initialize with all section headers from the template
30
+ - Add placeholder text: `[Researching...]` in each section
31
+
32
+ 2. **Progressively update the report** - As you gather data:
33
+ - Update each section with findings immediately
34
+ - The user sees the report growing, not the search process
35
+
36
+ 3. **Output separate data files**:
37
+ - `[TARGET]_candidate_compounds.csv` - Prioritized compounds with SMILES, scores
38
+ - `[TARGET]_bibliography.json` - Literature references (optional)
39
+
40
+ ### 2. Citation Requirements (MANDATORY)
41
+
42
+ Every piece of information MUST include its source:
43
+
44
+ ```markdown
45
+ ### 3.2 Known Inhibitors
46
+ | Compound | ChEMBL ID | IC50 (nM) | Selectivity | Source |
47
+ |----------|-----------|-----------|-------------|--------|
48
+ | Imatinib | CHEMBL941 | 38 | ABL-selective | ChEMBL |
49
+ | Dasatinib | CHEMBL1421 | 0.5 | Multi-kinase | ChEMBL |
50
+
51
+ *Source: ChEMBL via `ChEMBL_get_target_activities` (CHEMBL1862)*
52
+ ```
53
+
54
+ ---
55
+
56
+ ## Workflow Overview
57
+
58
+ ```
59
+ Phase 0: Tool Verification (check parameter names)
60
+
61
+ Phase 1: Target Validation
62
+ ├─ 1.1 Resolve identifiers (UniProt, Ensembl, ChEMBL target ID)
63
+ ├─ 1.2 Assess druggability/tractability
64
+ │ └─ 1.2.5 Check therapeutic antibodies (Thera-SAbDab) [NEW]
65
+ ├─ 1.3 Identify binding sites
66
+ └─ 1.4 Predict structure (NvidiaNIM_alphafold2/esmfold)
67
+
68
+ Phase 2: Known Ligand Mining
69
+ ├─ Extract ChEMBL bioactivity data
70
+ ├─ Get GtoPdb interactions
71
+ ├─ Identify chemical probes
72
+ ├─ BindingDB affinity data (NEW - Ki/IC50/Kd)
73
+ ├─ PubChem BioAssay HTS data (NEW - screening hits)
74
+ └─ Analyze SAR from known actives
75
+
76
+ Phase 3: Structure Analysis
77
+ ├─ Get PDB structures with ligands
78
+ ├─ Check EMDB for cryo-EM structures (NEW - for membrane targets)
79
+ ├─ Analyze binding pocket
80
+ └─ Identify key interactions
81
+
82
+ Phase 3.5: Docking Validation (NvidiaNIM_diffdock/boltz2) [NEW]
83
+ ├─ Dock reference inhibitor
84
+ └─ Validate binding pocket geometry
85
+
86
+ Phase 4: Compound Expansion
87
+ ├─ 4.1-4.3 Similarity/substructure search
88
+ └─ 4.4 De novo generation (NvidiaNIM_genmol/molmim) [NEW]
89
+
90
+ Phase 5: ADMET Filtering
91
+ ├─ Predict physicochemical properties
92
+ ├─ Predict ADMET endpoints
93
+ └─ Flag liabilities
94
+
95
+ Phase 6: Candidate Docking & Prioritization
96
+ ├─ Dock all candidates (NvidiaNIM_diffdock/boltz2) [UPDATED]
97
+ ├─ Score by docking + ADMET + novelty
98
+ ├─ Assess synthesis feasibility
99
+ └─ Generate final ranked list
100
+
101
+ Phase 7: Report Synthesis
102
+ ```
103
+
104
+ ---
105
+
106
+ ## Phase 0: Tool Verification
107
+
108
+ **CRITICAL**: Verify tool parameters before calling unfamiliar tools.
109
+
110
+ ```python
111
+ # Check tool params to prevent silent failures
112
+ tool_info = tu.tools.get_tool_info(tool_name="ChEMBL_get_target_activities")
113
+ ```
114
+
115
+ ### Known Parameter Corrections
116
+
117
+ | Tool | WRONG Parameter | CORRECT Parameter |
118
+ |------|-----------------|-------------------|
119
+ | `OpenTargets_get_target_tractability_by_ensemblID` | `ensembl_id` | `ensemblId` |
120
+ | `ChEMBL_get_target_activities` | `chembl_target_id` | `target_chembl_id` |
121
+ | `ChEMBL_search_similar_molecules` | `smiles` | `molecule` (accepts SMILES, ChEMBL ID, or name) |
122
+ | `alphafold_get_prediction` | `uniprot` | `accession` |
123
+
124
+ ---
125
+
126
+ ## Phase 1: Target Validation
127
+
128
+ ### 1.1 Identifier Resolution Chain
129
+
130
+ ```
131
+ 1. UniProt_search(query=target_name, organism="human")
132
+ └─ Extract: UniProt accession, gene name, protein name
133
+
134
+ 2. MyGene_query_genes(q=gene_symbol, species="human")
135
+ └─ Extract: Ensembl gene ID, NCBI gene ID
136
+
137
+ 3. ChEMBL_search_targets(query=target_name, organism="Homo sapiens")
138
+ └─ Extract: ChEMBL target ID, target type
139
+
140
+ 4. GtoPdb_get_targets(query=target_name)
141
+ └─ Extract: GtoPdb target ID (if GPCR/ion channel/enzyme)
142
+ ```
143
+
144
+ **Store all IDs for downstream queries**:
145
+ ```
146
+ ids = {
147
+ 'uniprot': 'P00533',
148
+ 'ensembl': 'ENSG00000146648',
149
+ 'chembl_target': 'CHEMBL203',
150
+ 'gene_symbol': 'EGFR',
151
+ 'gtopdb': '1797' # if available
152
+ }
153
+ ```
154
+
155
+ ### 1.2 Druggability Assessment
156
+
157
+ **Multi-Source Triangulation**:
158
+
159
+ ```
160
+ 1. OpenTargets_get_target_tractability_by_ensemblID(ensemblId)
161
+ └─ Extract: Small molecule tractability score, bucket
162
+
163
+ 2. DGIdb_get_gene_druggability(genes=[gene_symbol])
164
+ └─ Extract: Druggability categories, known drug count
165
+
166
+ 3. OpenTargets_get_target_classes_by_ensemblID(ensemblId)
167
+ └─ Extract: Target class (kinase, GPCR, etc.)
168
+
169
+ 4. GPCRdb_get_protein(protein=entry_name) # NEW - for GPCRs
170
+ └─ Extract: GPCR family, receptor state, ligand binding data
171
+ ```
172
+
173
+ ### 1.2a GPCRdb Integration (NEW - for GPCR Targets)
174
+
175
+ ~35% of all approved drugs target GPCRs. For GPCR targets, use specialized data:
176
+
177
+ ```python
178
+ def check_if_gpcr_and_enrich(tu, target_name, uniprot_id):
179
+ """Check if target is GPCR and get specialized data."""
180
+
181
+ # Build GPCRdb entry name (e.g., "adrb2_human")
182
+ entry_name = f"{target_name.lower()}_human"
183
+
184
+ # Check if it's a GPCR
185
+ gpcr_info = tu.tools.GPCRdb_get_protein(
186
+ operation="get_protein",
187
+ protein=entry_name
188
+ )
189
+
190
+ if gpcr_info.get('status') == 'success':
191
+ # It's a GPCR - get specialized data
192
+
193
+ # Get known structures (active/inactive states)
194
+ structures = tu.tools.GPCRdb_get_structures(
195
+ operation="get_structures",
196
+ protein=entry_name
197
+ )
198
+
199
+ # Get known ligands
200
+ ligands = tu.tools.GPCRdb_get_ligands(
201
+ operation="get_ligands",
202
+ protein=entry_name
203
+ )
204
+
205
+ # Get mutation data (important for SAR)
206
+ mutations = tu.tools.GPCRdb_get_mutations(
207
+ operation="get_mutations",
208
+ protein=entry_name
209
+ )
210
+
211
+ return {
212
+ 'is_gpcr': True,
213
+ 'gpcr_family': gpcr_info['data'].get('family'),
214
+ 'gpcr_class': gpcr_info['data'].get('receptor_class'),
215
+ 'structures': structures['data'].get('structures', []),
216
+ 'ligands': ligands['data'].get('ligands', []),
217
+ 'mutation_data': mutations['data'].get('mutations', [])
218
+ }
219
+
220
+ return {'is_gpcr': False}
221
+ ```
222
+
223
+ **GPCRdb Advantages**:
224
+ - GPCR-specific sequence alignments (Ballesteros-Weinstein numbering)
225
+ - Active vs. inactive state structures
226
+ - Curated ligand binding data
227
+ - Experimental mutation effects on ligand binding
228
+
229
+ **Druggability Scorecard**:
230
+
231
+ | Factor | Assessment | Score |
232
+ |--------|------------|-------|
233
+ | Known small molecule drugs | Yes (3+) | ★★★ |
234
+ | Tractability bucket | 1-3 | ★★☆-★★★ |
235
+ | Target class | Enzyme/GPCR/Ion channel | ★★★ |
236
+ | Binding site known | Yes (X-ray) | ★★★ |
237
+ | GPCRdb ligands available | Yes (10+) | ★★★ (GPCR only) |
238
+ | Therapeutic antibodies exist | Check Thera-SAbDab | See 1.2.5 |
239
+
240
+ **Decision Point**: If druggability score < ★★☆, warn user about challenges.
241
+
242
+ ### 1.2.5 Therapeutic Antibody Landscape (NEW)
243
+
244
+ Check if therapeutic antibodies already target this protein - important for:
245
+ - Understanding competitive landscape
246
+ - Validating target tractability (if antibodies work, target is validated)
247
+ - Identifying potential combination approaches
248
+
249
+ ```python
250
+ def check_therapeutic_antibodies(tu, target_name):
251
+ """
252
+ Check Thera-SAbDab for therapeutic antibodies against target.
253
+ """
254
+ # Search by target name
255
+ results = tu.tools.TheraSAbDab_search_by_target(
256
+ target=target_name
257
+ )
258
+
259
+ if results.get('status') == 'success':
260
+ antibodies = results['data'].get('therapeutics', [])
261
+
262
+ # Categorize by clinical stage
263
+ by_phase = {'Approved': [], 'Phase 3': [], 'Phase 2': [], 'Phase 1': [], 'Preclinical': []}
264
+ for ab in antibodies:
265
+ phase = ab.get('phase', 'Unknown')
266
+ for key in by_phase.keys():
267
+ if key.lower() in phase.lower():
268
+ by_phase[key].append(ab)
269
+ break
270
+
271
+ return {
272
+ 'total_antibodies': len(antibodies),
273
+ 'by_phase': by_phase,
274
+ 'antibodies': antibodies[:10], # Top 10
275
+ 'competitive_alert': len(by_phase.get('Approved', [])) > 0
276
+ }
277
+ return None
278
+
279
+ def get_antibody_landscape(tu, target_name, uniprot_id=None):
280
+ """
281
+ Comprehensive antibody competitive landscape.
282
+ """
283
+ # Thera-SAbDab search
284
+ therasabdab = check_therapeutic_antibodies(tu, target_name)
285
+
286
+ # Also search by common synonyms
287
+ synonyms = [target_name]
288
+ if target_name != uniprot_id:
289
+ synonyms.append(uniprot_id)
290
+
291
+ all_antibodies = []
292
+ for synonym in synonyms:
293
+ results = tu.tools.TheraSAbDab_search_therapeutics(query=synonym)
294
+ if results.get('status') == 'success':
295
+ all_antibodies.extend(results['data'].get('therapeutics', []))
296
+
297
+ # Deduplicate
298
+ seen = set()
299
+ unique = []
300
+ for ab in all_antibodies:
301
+ inn = ab.get('inn_name')
302
+ if inn and inn not in seen:
303
+ seen.add(inn)
304
+ unique.append(ab)
305
+
306
+ return {
307
+ 'antibodies': unique,
308
+ 'count': len(unique),
309
+ 'has_approved': any(ab.get('phase', '').lower() == 'approved' for ab in unique),
310
+ 'source': 'Thera-SAbDab'
311
+ }
312
+ ```
313
+
314
+ **Report Output**:
315
+ ```markdown
316
+ ### 1.2.5 Therapeutic Antibody Landscape (NEW)
317
+
318
+ **Thera-SAbDab Search Results**:
319
+
320
+ | Antibody (INN) | Target | Format | Phase | PDB |
321
+ |----------------|--------|--------|-------|-----|
322
+ | Pembrolizumab | PD-1 | IgG4 | Approved | 5DK3 |
323
+ | Nivolumab | PD-1 | IgG4 | Approved | 5WT9 |
324
+ | Cemiplimab | PD-1 | IgG4 | Approved | 7WVM |
325
+
326
+ **Competitive Landscape**: ⚠️ 3 approved antibodies target this protein
327
+ **Strategic Implication**: Small molecule approach offers differentiation (oral dosing, CNS penetration, cost)
328
+
329
+ *Source: Thera-SAbDab via `TheraSAbDab_search_by_target`*
330
+ ```
331
+
332
+ **Why Include Antibody Landscape**:
333
+ - **Validation**: Approved antibodies = validated target
334
+ - **Competition**: Understand what's already in market/clinic
335
+ - **Strategy**: Identify gaps (no oral, no CNS-penetrant)
336
+ - **Synergy**: Potential combination opportunities
337
+
338
+ ### 1.3 Binding Site Analysis
339
+
340
+ ```
341
+ 1. ChEMBL_search_binding_sites(target_chembl_id)
342
+ └─ Extract: Binding site names, types
343
+
344
+ 2. get_binding_affinity_by_pdb_id(pdb_id) # For each PDB with ligand
345
+ └─ Extract: Kd, Ki, IC50 values for co-crystallized ligands
346
+
347
+ 3. InterPro_get_protein_domains(uniprot_accession)
348
+ └─ Extract: Domain architecture, active sites
349
+ ```
350
+
351
+ **Output for Report**:
352
+ ```markdown
353
+ ### 1.3 Binding Site Assessment
354
+
355
+ **Known Binding Sites**:
356
+ | Site | Type | Evidence | Key Residues | Source |
357
+ |------|------|----------|--------------|--------|
358
+ | ATP pocket | Orthosteric | X-ray (23 PDBs) | K745, E762, M793 | PDB/ChEMBL |
359
+ | Allosteric pocket | Allosteric | X-ray (3 PDBs) | T790, C797 | PDB |
360
+
361
+ **Binding Site Druggability**: ★★★ (well-defined pocket, multiple co-crystal structures)
362
+
363
+ *Source: ChEMBL via `ChEMBL_search_binding_sites`, PDB structures*
364
+ ```
365
+
366
+ ### 1.4 Structure Prediction (NVIDIA NIM)
367
+
368
+ When no experimental structure is available, or for custom domain predictions.
369
+
370
+ **Requires**: `NVIDIA_API_KEY` environment variable
371
+
372
+ **Option A: AlphaFold2 (High accuracy, async)**
373
+ ```
374
+ NvidiaNIM_alphafold2(
375
+ sequence=kinase_domain_sequence,
376
+ algorithm="mmseqs2",
377
+ relax_prediction=False
378
+ )
379
+ └─ Returns: PDB structure with pLDDT confidence scores
380
+ └─ Use when: Accuracy is critical, time is available (~5-15 min)
381
+ ```
382
+
383
+ **Option B: ESMFold (Fast, synchronous)**
384
+ ```
385
+ NvidiaNIM_esmfold(sequence=kinase_domain_sequence)
386
+ └─ Returns: PDB structure (max 1024 AA)
387
+ └─ Use when: Quick assessment needed (~30 sec)
388
+ ```
389
+
390
+ **Report pLDDT Confidence**:
391
+ ```markdown
392
+ ### 1.4 Structure Prediction Quality
393
+
394
+ **Method**: AlphaFold2 via NVIDIA NIM
395
+ **Mean pLDDT**: 90.94 (very high confidence)
396
+
397
+ | Confidence Level | Range | Fraction | Interpretation |
398
+ |------------------|-------|----------|----------------|
399
+ | Very High | ≥90 | 74.3% | Highly reliable |
400
+ | Confident | 70-90 | 16.0% | Reliable |
401
+ | Low | 50-70 | 9.0% | Use caution |
402
+ | Very Low | <50 | 0.7% | Unreliable |
403
+
404
+ **Key Binding Residue Confidence**:
405
+ | Residue | Function | pLDDT |
406
+ |---------|----------|-------|
407
+ | K745 | ATP binding | 90.0 |
408
+ | T790 | Gatekeeper | 92.3 |
409
+ | M793 | Hinge region | 95.3 |
410
+ | D855 | DFG motif | 89.5 |
411
+
412
+ *Source: NVIDIA NIM via `NvidiaNIM_alphafold2`*
413
+ ```
414
+
415
+ ---
416
+
417
+ ## Phase 2: Known Ligand Mining
418
+
419
+ ### 2.1 ChEMBL Bioactivity Data
420
+
421
+ ```
422
+ 1. ChEMBL_get_target_activities(target_chembl_id, limit=500)
423
+ └─ Filter: standard_type in ["IC50", "Ki", "Kd", "EC50"]
424
+ └─ Filter: standard_value < 10000 nM
425
+ └─ Extract: ChEMBL molecule IDs, SMILES, potency values
426
+
427
+ 2. ChEMBL_get_molecule(molecule_chembl_id) # For top actives
428
+ └─ Extract: Full molecular data, max_phase, oral flag
429
+ ```
430
+
431
+ **Activity Summary Table**:
432
+ ```markdown
433
+ ### 2.1 Known Active Compounds (ChEMBL)
434
+
435
+ **Total Bioactivity Points**: 2,847 (IC50: 1,234 | Ki: 892 | Kd: 456 | EC50: 265)
436
+ **Compounds with IC50 < 100 nM**: 156
437
+ **Approved Drugs for This Target**: 5
438
+
439
+ | Compound | ChEMBL ID | IC50 (nM) | Max Phase | SMILES (truncated) |
440
+ |----------|-----------|-----------|-----------|-------------------|
441
+ | Erlotinib | CHEMBL553 | 2 | 4 | COc1cc2ncnc(Nc3ccc... |
442
+ | Gefitinib | CHEMBL939 | 5 | 4 | COc1cc2ncnc(Nc3ccc... |
443
+ | [Novel] | CHEMBL123 | 12 | 0 | c1ccc(NC(=O)c2ccc... |
444
+
445
+ *Source: ChEMBL via `ChEMBL_get_target_activities` (CHEMBL203)*
446
+ ```
447
+
448
+ ### 2.2 GtoPdb Interactions
449
+
450
+ ```
451
+ GtoPdb_get_target_interactions(target_id)
452
+ └─ Extract: Ligands with pKi/pIC50, selectivity data
453
+ ```
454
+
455
+ ### 2.3 Chemical Probes
456
+
457
+ ```
458
+ OpenTargets_get_chemical_probes_by_target_ensemblID(ensemblId)
459
+ └─ Extract: Validated chemical probes with ratings
460
+ ```
461
+
462
+ **Output for Report**:
463
+ ```markdown
464
+ ### 2.3 Chemical Probes
465
+
466
+ | Probe | Target | Rating | Use | Caveat | Source |
467
+ |-------|--------|--------|-----|--------|--------|
468
+ | Probe-X | EGFR | ★★★★ | In vivo | None | Chemical Probes Portal |
469
+ | Probe-Y | EGFR | ★★★☆ | In vitro | Off-target kinase activity | Open Targets |
470
+
471
+ **Recommended Probe for Target Validation**: Probe-X (highest rating, validated in vivo)
472
+ ```
473
+
474
+ ### 2.4 SAR Analysis from Actives
475
+
476
+ Identify common scaffolds and SAR trends:
477
+
478
+ ```markdown
479
+ ### 2.4 Structure-Activity Relationships
480
+
481
+ **Core Scaffolds Identified**:
482
+ 1. **4-Anilinoquinazoline** (34 compounds, IC50 range: 2-500 nM)
483
+ - N1 position: Aryl preferred
484
+ - C6/C7: Methoxy groups improve potency
485
+
486
+ 2. **Pyrimidine-amine** (12 compounds, IC50 range: 15-800 nM)
487
+ - Less potent than quinazolines
488
+ - Better selectivity profile
489
+
490
+ **Key SAR Insights**:
491
+ - Halogen at meta position of aniline increases potency 3-5x
492
+ - C7 ethoxy group critical for binding (H-bond to M793)
493
+ ```
494
+
495
+ ### 2.5 BindingDB Affinity Data (NEW)
496
+
497
+ BindingDB provides experimental binding affinity data complementary to ChEMBL:
498
+
499
+ ```python
500
+ def get_bindingdb_ligands(tu, uniprot_id, affinity_cutoff=10000):
501
+ """
502
+ Get ligands from BindingDB with measured affinities.
503
+
504
+ BindingDB advantages:
505
+ - May have compounds not in ChEMBL
506
+ - Different affinity types (Ki, IC50, Kd)
507
+ - Direct literature links
508
+ """
509
+
510
+ result = tu.tools.BindingDB_get_ligands_by_uniprot(
511
+ uniprot=uniprot_id,
512
+ affinity_cutoff=affinity_cutoff # nM
513
+ )
514
+
515
+ if result:
516
+ ligands = []
517
+ for entry in result:
518
+ ligands.append({
519
+ 'smiles': entry.get('smile'),
520
+ 'affinity_type': entry.get('affinity_type'),
521
+ 'affinity_nM': entry.get('affinity'),
522
+ 'pmid': entry.get('pmid'),
523
+ 'monomer_id': entry.get('monomerid')
524
+ })
525
+
526
+ # Sort by potency
527
+ ligands.sort(key=lambda x: float(x['affinity_nM']) if x['affinity_nM'] else 1e6)
528
+ return ligands[:50] # Top 50
529
+
530
+ return []
531
+
532
+ def find_compound_polypharmacology(tu, smiles, similarity_cutoff=0.85):
533
+ """Find off-target interactions for selectivity analysis."""
534
+
535
+ targets = tu.tools.BindingDB_get_targets_by_compound(
536
+ smiles=smiles,
537
+ similarity_cutoff=similarity_cutoff
538
+ )
539
+
540
+ return targets # Other proteins this compound may bind
541
+ ```
542
+
543
+ **BindingDB Output for Report**:
544
+ ```markdown
545
+ ### 2.5 Additional Ligands (BindingDB) (NEW)
546
+
547
+ **Total Unique Ligands**: 89 (non-overlapping with ChEMBL)
548
+ **Most Potent**: 0.3 nM Ki
549
+
550
+ | SMILES | Affinity Type | Value (nM) | PMID | BindingDB ID |
551
+ |--------|---------------|------------|------|--------------|
552
+ | CC(C)Cc1ccc... | Ki | 0.3 | 15737014 | 12345 |
553
+ | COc1cc2ncnc... | IC50 | 2.1 | 16460808 | 12346 |
554
+
555
+ **Novel Scaffolds from BindingDB**: 3 scaffolds not seen in ChEMBL data
556
+
557
+ *Source: BindingDB via `BindingDB_get_ligands_by_uniprot`*
558
+ ```
559
+
560
+ ### 2.6 PubChem BioAssay Screening Data (NEW)
561
+
562
+ PubChem BioAssay provides HTS screening results and dose-response data:
563
+
564
+ ```python
565
+ def get_pubchem_assays_for_target(tu, gene_symbol):
566
+ """
567
+ Get bioassays and active compounds from PubChem.
568
+
569
+ Advantages:
570
+ - HTS data not in ChEMBL
571
+ - NIH-funded screening programs (MLPCN)
572
+ - Dose-response curves for IC50 calculation
573
+ """
574
+
575
+ # Search assays targeting this gene
576
+ assays = tu.tools.PubChem_search_assays_by_target_gene(
577
+ gene_symbol=gene_symbol
578
+ )
579
+
580
+ results = {
581
+ 'assays': [],
582
+ 'total_active_compounds': 0
583
+ }
584
+
585
+ if assays.get('data', {}).get('aids'):
586
+ for aid in assays['data']['aids'][:10]: # Top 10 assays
587
+ # Get assay summary
588
+ summary = tu.tools.PubChem_get_assay_summary(aid=aid)
589
+
590
+ # Get active compounds
591
+ actives = tu.tools.PubChem_get_assay_active_compounds(aid=aid)
592
+ active_cids = actives.get('data', {}).get('cids', [])
593
+
594
+ results['assays'].append({
595
+ 'aid': aid,
596
+ 'summary': summary.get('data', {}),
597
+ 'active_count': len(active_cids)
598
+ })
599
+ results['total_active_compounds'] += len(active_cids)
600
+
601
+ return results
602
+
603
+ def get_dose_response_data(tu, aid):
604
+ """Get dose-response curves for IC50/EC50 determination."""
605
+
606
+ dr_data = tu.tools.PubChem_get_assay_dose_response(aid=aid)
607
+ return dr_data
608
+
609
+ def get_compound_bioactivity_profile(tu, cid):
610
+ """Get all bioactivity data for a compound."""
611
+
612
+ profile = tu.tools.PubChem_get_compound_bioactivity(cid=cid)
613
+ return profile
614
+ ```
615
+
616
+ **PubChem BioAssay Output for Report**:
617
+ ```markdown
618
+ ### 2.6 PubChem HTS Screening Data (NEW)
619
+
620
+ **Assays Found**: 45
621
+ **Total Active Compounds Across Assays**: ~1,200
622
+
623
+ | AID | Assay Type | Active Compounds | Target | Description |
624
+ |-----|------------|------------------|--------|-------------|
625
+ | 504526 | HTS | 234 | EGFR | qHTS inhibition screen |
626
+ | 1053104 | Dose-response | 12 | EGFR kinase | Confirmatory IC50 |
627
+ | 651564 | Cellular | 8 | EGFR | Cell proliferation assay |
628
+
629
+ **Novel Actives** (not in ChEMBL/BindingDB):
630
+ - CID 12345678: Active in AID 504526, IC50 = 45 nM
631
+ - CID 23456789: Active in AID 1053104, IC50 = 120 nM
632
+
633
+ *Source: PubChem via `PubChem_search_assays_by_target_gene`, `PubChem_get_assay_active_compounds`*
634
+ ```
635
+
636
+ **Why Use Both BindingDB and PubChem**:
637
+ | Source | Strengths | Best For |
638
+ |--------|-----------|----------|
639
+ | **ChEMBL** | Curated, standardized, SAR data | Primary ligand source |
640
+ | **BindingDB** | Direct affinity measurements | Ki/Kd values, PMIDs |
641
+ | **PubChem BioAssay** | HTS data, NIH screens | Novel scaffolds, broad coverage |
642
+
643
+ ---
644
+
645
+ ## Phase 3: Structure Analysis
646
+
647
+ ### 3.1 PDB Structure Retrieval
648
+
649
+ ```
650
+ 1. PDB_search_similar_structures(query=uniprot_accession, type="sequence")
651
+ └─ Extract: PDB IDs with ligands
652
+
653
+ 2. get_protein_metadata_by_pdb_id(pdb_id)
654
+ └─ Extract: Resolution, method, ligand codes
655
+
656
+ 3. alphafold_get_prediction(accession=uniprot_accession)
657
+ └─ Extract: Predicted structure (if no experimental)
658
+ ```
659
+
660
+ ### 3.1b EMDB Cryo-EM Structures (NEW)
661
+
662
+ **Prioritize EMDB for**: Membrane proteins (GPCRs, ion channels), large complexes, targets with multiple conformational states.
663
+
664
+ ```python
665
+ def get_cryoem_structures(tu, target_name, uniprot_accession):
666
+ """Get cryo-EM structures for membrane targets."""
667
+
668
+ # Search EMDB
669
+ emdb_results = tu.tools.emdb_search(
670
+ query=f"{target_name} membrane receptor"
671
+ )
672
+
673
+ structures = []
674
+ for entry in emdb_results[:5]:
675
+ details = tu.tools.emdb_get_entry(entry_id=entry['emdb_id'])
676
+
677
+ # Get associated PDB model (essential for docking)
678
+ pdb_models = details.get('pdb_ids', [])
679
+
680
+ structures.append({
681
+ 'emdb_id': entry['emdb_id'],
682
+ 'resolution': entry.get('resolution', 'N/A'),
683
+ 'title': entry.get('title', 'N/A'),
684
+ 'conformational_state': details.get('state', 'Unknown'),
685
+ 'pdb_models': pdb_models
686
+ })
687
+
688
+ return structures
689
+ ```
690
+
691
+ **When to use cryo-EM over X-ray**:
692
+ | Target Type | Prefer cryo-EM? | Reason |
693
+ |-------------|-----------------|--------|
694
+ | GPCR | Yes | Native membrane conformation |
695
+ | Ion channel | Yes | Multiple functional states |
696
+ | Receptor-ligand complex | Yes | Physiological state |
697
+ | Kinase | Usually X-ray | Higher resolution typically |
698
+
699
+ **Structure Summary**:
700
+ ```markdown
701
+ ### 3.1 Available Structures
702
+
703
+ | PDB ID | Resolution | Method | Ligand | Affinity | State |
704
+ |--------|------------|--------|--------|----------|-------|
705
+ | 1M17 | 2.6 Å | X-ray | Erlotinib | Ki=0.4 nM | Active |
706
+ | 4HJO | 2.1 Å | X-ray | Lapatinib | Ki=3 nM | Inactive |
707
+ | AF-P00533 | - | Predicted | None | - | - |
708
+
709
+ ### 3.1b Cryo-EM Structures (EMDB)
710
+
711
+ | EMDB ID | Resolution | PDB Model | Conformation | Ligand |
712
+ |---------|------------|-----------|--------------|--------|
713
+ | EMD-12345 | 3.2 Å | 7ABC | Active | Agonist |
714
+ | EMD-23456 | 3.5 Å | 8DEF | Inactive | Antagonist |
715
+
716
+ **Best Structure for Docking**: 1M17 (high resolution, relevant ligand)
717
+ *Source: RCSB PDB, EMDB, AlphaFold DB*
718
+ ```
719
+
720
+ ### 3.2 Binding Pocket Analysis
721
+
722
+ ```
723
+ get_binding_affinity_by_pdb_id(pdb_id)
724
+ └─ Extract: Binding affinities for co-crystallized ligands
725
+ ```
726
+
727
+ **Output for Report**:
728
+ ```markdown
729
+ ### 3.2 Binding Pocket Characterization
730
+
731
+ **Pocket Volume**: ~850 ų (well-defined)
732
+ **Key Interaction Residues**:
733
+ - **Hinge region**: M793 (backbone H-bond donor/acceptor)
734
+ - **Gatekeeper**: T790 (small residue, allows access)
735
+ - **DFG motif**: D855 (active conformation)
736
+ - **Selectivity pocket**: L788, G796 (unique to EGFR)
737
+
738
+ **Druggability Assessment**: High (enclosed pocket, conserved interactions)
739
+ ```
740
+
741
+ ---
742
+
743
+ ## Phase 3.5: Docking Validation (NVIDIA NIM)
744
+
745
+ Validate structure and score compounds using molecular docking.
746
+
747
+ **Requires**: `NVIDIA_API_KEY` environment variable
748
+
749
+ ### 3.5.1 Reference Compound Docking
750
+
751
+ Dock a known inhibitor to validate the structure captures the binding pocket correctly.
752
+
753
+ **Option A: DiffDock (Blind docking, PDB + SDF input)**
754
+ ```
755
+ NvidiaNIM_diffdock(
756
+ protein=pdb_content, # PDB text content
757
+ ligand=reference_sdf, # SDF/MOL2 content
758
+ num_poses=10
759
+ )
760
+ └─ Returns: Docking poses with confidence scores
761
+ └─ Use: When you have PDB structure and ligand SDF file
762
+ ```
763
+
764
+ **Option B: Boltz2 (From sequence + SMILES)**
765
+ ```
766
+ NvidiaNIM_boltz2(
767
+ polymers=[{"molecule_type": "protein", "sequence": kinase_sequence}],
768
+ ligands=[{"smiles": "COc1cc2ncnc(Nc3ccc(C#C)cc3)c2cc1OCCOC"}],
769
+ sampling_steps=50,
770
+ diffusion_samples=1
771
+ )
772
+ └─ Returns: Protein-ligand complex structure
773
+ └─ Use: When starting from SMILES, no SDF needed
774
+ ```
775
+
776
+ ### 3.5.2 Docking Score Interpretation
777
+
778
+ | Score vs Reference | Priority | Symbol |
779
+ |--------------------|----------|--------|
780
+ | Higher than reference | Top priority | ★★★★ |
781
+ | Within 5% of reference | High priority | ★★★ |
782
+ | Within 20% of reference | Moderate priority | ★★☆ |
783
+ | >20% lower | Low priority | ★☆☆ |
784
+
785
+ **Report Format**:
786
+ ```markdown
787
+ ### 3.5 Docking Validation Results
788
+
789
+ **Reference Compound**: Erlotinib
790
+ **Method**: DiffDock via NVIDIA NIM
791
+
792
+ | Metric | Value | Interpretation |
793
+ |--------|-------|----------------|
794
+ | Best Pose Confidence | 0.906 | Excellent |
795
+ | Steric Clashes | None | Clean binding pose |
796
+
797
+ **Validation Status**: ✓ Structure captures binding pocket correctly
798
+
799
+ *Source: NVIDIA NIM via `NvidiaNIM_diffdock`*
800
+ ```
801
+
802
+ ---
803
+
804
+ ## Phase 4: Compound Expansion
805
+
806
+ ### 4.1 Similarity Search
807
+
808
+ Starting from top actives, expand chemical space:
809
+
810
+ ```
811
+ 1. ChEMBL_search_similar_molecules(molecule=top_active_smiles, similarity=70)
812
+ └─ Extract: Similar compounds not yet tested on target
813
+
814
+ 2. PubChem_search_compounds_by_similarity(smiles, threshold=0.7)
815
+ └─ Extract: PubChem CIDs with similar structures
816
+ ```
817
+
818
+ **Strategy**:
819
+ - Use 3-5 diverse actives as seeds
820
+ - Similarity threshold: 70-85% (balance novelty vs. activity)
821
+ - Prioritize compounds NOT in ChEMBL bioactivity for target
822
+
823
+ ### 4.2 Substructure Search
824
+
825
+ ```
826
+ 1. ChEMBL_search_substructure(smiles=core_scaffold)
827
+ └─ Extract: Compounds containing scaffold
828
+
829
+ 2. PubChem_search_compounds_by_substructure(smiles=core_scaffold)
830
+ └─ Extract: Additional scaffold-containing compounds
831
+ ```
832
+
833
+ ### 4.3 Cross-Database Mining
834
+
835
+ ```
836
+ 1. STITCH_get_chemical_protein_interactions(identifier=target_gene)
837
+ └─ Extract: Additional chemical-protein links
838
+
839
+ 2. DGIdb_get_drug_gene_interactions(genes=[gene_symbol])
840
+ └─ Extract: Approved/investigational drugs
841
+ ```
842
+
843
+ **Output for Report**:
844
+ ```markdown
845
+ ### 4. Compound Expansion Results
846
+
847
+ **Starting Seeds**: 5 diverse actives (IC50 < 100 nM)
848
+ **Similarity Expansion**: 847 compounds (70% threshold)
849
+ **Substructure Search**: 234 scaffold matches
850
+ **Cross-Database**: 45 additional hits
851
+
852
+ **After Deduplication**: 923 unique candidate compounds
853
+
854
+ | Source | Compounds | Already Tested | Novel Candidates |
855
+ |--------|-----------|----------------|------------------|
856
+ | ChEMBL similarity | 456 | 234 | 222 |
857
+ | PubChem similarity | 391 | 156 | 235 |
858
+ | ChEMBL substructure | 178 | 89 | 89 |
859
+ | STITCH | 45 | 23 | 22 |
860
+ | **Total Unique** | **923** | **355** | **568** |
861
+ ```
862
+
863
+ ### 4.4 De Novo Molecule Generation (NVIDIA NIM)
864
+
865
+ When database mining yields insufficient candidates, generate novel molecules.
866
+
867
+ **Requires**: `NVIDIA_API_KEY` environment variable
868
+
869
+ **Option A: GenMol (Scaffold Hopping with Masked Regions)**
870
+ ```
871
+ NvidiaNIM_genmol(
872
+ smiles="COc1cc2ncnc(Nc3ccc([*{3-8}])c([*{1-3}])c3)c2cc1OCCCN1CCOCC1",
873
+ num_molecules=100,
874
+ temperature=2.0,
875
+ scoring="QED"
876
+ )
877
+ └─ Input: SMILES with [*{min-max}] masked regions
878
+ └─ Output: Generated molecules with QED/LogP scores
879
+ └─ Use: Explore specific positions while keeping scaffold
880
+ ```
881
+
882
+ **Mask Design Strategy**:
883
+ | Position | Mask | Purpose |
884
+ |----------|------|---------|
885
+ | Aniline substituent | `[*{1-3}]` | Small groups (halogen, methyl) |
886
+ | Solubilizing group | `[*{5-10}]` | Morpholine, piperazine variants |
887
+ | Linker region | `[*{3-6}]` | Spacer variations |
888
+
889
+ **Example Masked SMILES for EGFR**:
890
+ ```
891
+ # Keep quinazoline core, vary aniline and tail
892
+ COc1cc2ncnc(Nc3ccc([*{1-3}])c([*{1-3}])c3)c2cc1[*{5-12}]
893
+ ```
894
+
895
+ **Option B: MolMIM (Controlled Generation from Reference)**
896
+ ```
897
+ NvidiaNIM_molmim(
898
+ smi="COc1cc2ncnc(Nc3ccc(Cl)cc3)c2cc1OCCN1CCOCC1",
899
+ num_molecules=50,
900
+ algorithm="CMA-ES"
901
+ )
902
+ └─ Input: Reference SMILES (known active)
903
+ └─ Output: Optimized analogs with property scores
904
+ └─ Use: Generate close analogs of top actives
905
+ ```
906
+
907
+ **Generation Workflow**:
908
+ 1. Identify top 3-5 actives from Phase 2
909
+ 2. Design masked SMILES for GenMol OR use as reference for MolMIM
910
+ 3. Generate 50-100 molecules per seed
911
+ 4. Pass generated molecules to Phase 5 (ADMET filtering)
912
+ 5. Dock survivors in Phase 6 for final ranking
913
+
914
+ **Report Format**:
915
+ ```markdown
916
+ ### 4.4 De Novo Generation Results
917
+
918
+ **Method**: GenMol via NVIDIA NIM
919
+ **Seed Scaffold**: 4-anilinoquinazoline (from erlotinib)
920
+ **Masked Positions**: Aniline (3,4), solubilizing tail
921
+
922
+ | Metric | Value |
923
+ |--------|-------|
924
+ | Molecules Generated | 100 |
925
+ | Passing Lipinski | 95 (95%) |
926
+ | Mean QED Score | 0.72 |
927
+ | Unique Scaffolds | 12 |
928
+
929
+ **Top Generated Compounds**:
930
+ | ID | SMILES | QED | LogP | Novelty |
931
+ |----|--------|-----|------|---------|
932
+ | GEN-001 | COc1cc2ncnc(Nc3ccc(Cl)c(Cl)c3)c2cc1OCCCN1CCOCC1 | 0.81 | 4.2 | Novel substitution |
933
+ | GEN-002 | COc1cc2ncnc(Nc3ccc(C#N)c(F)c3)c2cc1OCCCN1CCOCC1 | 0.78 | 3.8 | Novel substitution |
934
+
935
+ *Source: NVIDIA NIM via `NvidiaNIM_genmol`*
936
+ ```
937
+
938
+ ---
939
+
940
+ ## Phase 5: ADMET Filtering
941
+
942
+ ### 5.1 Physicochemical Properties
943
+
944
+ ```
945
+ ADMETAI_predict_physicochemical_properties(smiles=[compound_list])
946
+ └─ Filter: Lipinski violations ≤ 1
947
+ └─ Filter: QED > 0.3
948
+ └─ Filter: MW 200-600
949
+ ```
950
+
951
+ ### 5.2 ADMET Endpoints
952
+
953
+ ```
954
+ 1. ADMETAI_predict_bioavailability(smiles=[compound_list])
955
+ └─ Filter: Oral bioavailability > 0.3
956
+
957
+ 2. ADMETAI_predict_toxicity(smiles=[compound_list])
958
+ └─ Filter: AMES < 0.5, hERG < 0.5, DILI < 0.5
959
+
960
+ 3. ADMETAI_predict_CYP_interactions(smiles=[compound_list])
961
+ └─ Flag: CYP3A4 inhibitors (drug interaction risk)
962
+ ```
963
+
964
+ ### 5.3 Structural Alerts
965
+
966
+ ```
967
+ ChEMBL_search_compound_structural_alerts(smiles=compound_smiles)
968
+ └─ Flag: PAINS, reactive groups, toxicophores
969
+ ```
970
+
971
+ **ADMET Filter Summary**:
972
+ ```markdown
973
+ ### 5. ADMET Filtering Results
974
+
975
+ | Filter Stage | Input | Passed | Failed | Pass Rate |
976
+ |--------------|-------|--------|--------|-----------|
977
+ | Physicochemical (Lipinski) | 568 | 456 | 112 | 80% |
978
+ | Drug-likeness (QED > 0.3) | 456 | 398 | 58 | 87% |
979
+ | Bioavailability (> 0.3) | 398 | 312 | 86 | 78% |
980
+ | Toxicity filters | 312 | 267 | 45 | 86% |
981
+ | Structural alerts | 267 | 234 | 33 | 88% |
982
+ | **Final Candidates** | **568** | **234** | **334** | **41%** |
983
+
984
+ **Common Failure Reasons**:
985
+ 1. High molecular weight (>600): 67 compounds
986
+ 2. Low predicted bioavailability: 86 compounds
987
+ 3. hERG liability: 28 compounds
988
+ 4. PAINS alerts: 18 compounds
989
+ ```
990
+
991
+ ---
992
+
993
+ ## Phase 6: Candidate Prioritization
994
+
995
+ ### 6.1 Scoring Framework
996
+
997
+ Score each candidate on multiple dimensions:
998
+
999
+ | Dimension | Weight | Scoring Criteria |
1000
+ |-----------|--------|------------------|
1001
+ | **Structural Similarity** | 25% | Tanimoto to actives (0.7-1.0 → 1-5) |
1002
+ | **Novelty** | 20% | Not in ChEMBL bioactivity = +2; Novel scaffold = +3 |
1003
+ | **ADMET Score** | 25% | Composite of property predictions |
1004
+ | **Synthesis Feasibility** | 15% | SA score (1-10), commercial availability |
1005
+ | **Scaffold Diversity** | 15% | Cluster representative bonus |
1006
+
1007
+ ### 6.2 Synthesis Feasibility
1008
+
1009
+ ```markdown
1010
+ ### 6.2 Synthesis Feasibility Assessment
1011
+
1012
+ | Candidate | SA Score | Commercial | Estimated Steps | Flag |
1013
+ |-----------|----------|------------|-----------------|------|
1014
+ | Compound-1 | 2.3 | Yes (Enamine) | 0 | ★★★ |
1015
+ | Compound-2 | 3.5 | Building block | 2-3 | ★★☆ |
1016
+ | Compound-3 | 5.8 | No | 6-8 | ★☆☆ |
1017
+
1018
+ **SA Score Interpretation**:
1019
+ - 1-3: Easy synthesis
1020
+ - 3-5: Moderate complexity
1021
+ - 5-10: Challenging synthesis
1022
+ ```
1023
+
1024
+ ### 6.3 Final Prioritized List
1025
+
1026
+ ```markdown
1027
+ ### 6.3 Top 20 Candidate Compounds
1028
+
1029
+ | Rank | ID | SMILES | Sim. Score | ADMET | Novelty | Overall | Rationale |
1030
+ |------|-----|--------|------------|-------|---------|---------|-----------|
1031
+ | 1 | CPD-001 | Cc1ccc... | 0.82 | 4.5 | Novel scaffold | 4.2 | High similarity, clean ADMET |
1032
+ | 2 | CPD-002 | COc1cc... | 0.78 | 4.3 | Not tested | 4.0 | Quinazoline analog |
1033
+ | 3 | CPD-003 | Nc1ccc... | 0.75 | 4.1 | Novel core | 3.9 | New chemotype |
1034
+ | ... | ... | ... | ... | ... | ... | ... | ... |
1035
+
1036
+ **Scaffold Diversity**: 7 distinct scaffolds in top 20
1037
+ **Commercial Availability**: 12/20 available for purchase
1038
+ **Estimated Hit Rate**: 15-25% (based on similarity to actives)
1039
+ ```
1040
+
1041
+ ---
1042
+
1043
+ ## Phase 6.5: Literature Evidence (NEW)
1044
+
1045
+ ### 6.5.1 Literature Search for Validation
1046
+
1047
+ Search literature to validate candidate compounds and understand target context.
1048
+
1049
+ ```python
1050
+ def search_binder_literature(tu, target_name, compound_scaffolds):
1051
+ """Search literature for compound and target evidence."""
1052
+
1053
+ # PubMed: Published SAR studies
1054
+ sar_papers = tu.tools.PubMed_search_articles(
1055
+ query=f"{target_name} inhibitor SAR structure-activity",
1056
+ limit=30
1057
+ )
1058
+
1059
+ # BioRxiv: Latest unpublished findings
1060
+ preprints = tu.tools.BioRxiv_search_preprints(
1061
+ query=f"{target_name} small molecule discovery",
1062
+ limit=15
1063
+ )
1064
+
1065
+ # MedRxiv: Clinical data on inhibitors
1066
+ clinical = tu.tools.MedRxiv_search_preprints(
1067
+ query=f"{target_name} inhibitor clinical trial",
1068
+ limit=10
1069
+ )
1070
+
1071
+ # Citation analysis for key papers
1072
+ key_papers = sar_papers[:10]
1073
+ for paper in key_papers:
1074
+ citation = tu.tools.openalex_search_works(
1075
+ query=paper['title'],
1076
+ limit=1
1077
+ )
1078
+ paper['citations'] = citation[0].get('cited_by_count', 0) if citation else 0
1079
+
1080
+ return {
1081
+ 'published_sar': sar_papers,
1082
+ 'preprints': preprints,
1083
+ 'clinical_preprints': clinical,
1084
+ 'high_impact_papers': sorted(key_papers, key=lambda x: x.get('citations', 0), reverse=True)
1085
+ }
1086
+ ```
1087
+
1088
+ ### 6.5.2 Output for Report
1089
+
1090
+ ```markdown
1091
+ ## 6.5 Literature Evidence
1092
+
1093
+ ### Published SAR Studies
1094
+
1095
+ | PMID | Title | Year | Key Insight |
1096
+ |------|-------|------|-------------|
1097
+ | 34567890 | Discovery of novel EGFR inhibitors... | 2024 | C7 substitution critical |
1098
+ | 33456789 | Structure-activity relationship of... | 2023 | Fluorine improves potency |
1099
+
1100
+ ### Recent Preprints (⚠️ Not Peer-Reviewed)
1101
+
1102
+ | Source | Title | Posted | Relevance |
1103
+ |--------|-------|--------|-----------|
1104
+ | BioRxiv | Novel scaffolds for EGFR... | 2024-02 | New chemotype discovery |
1105
+ | MedRxiv | Clinical activity of... | 2024-01 | Phase 2 results |
1106
+
1107
+ ### High-Impact References
1108
+
1109
+ | PMID | Citations | Title |
1110
+ |------|-----------|-------|
1111
+ | 32123456 | 523 | Landmark EGFR inhibitor study... |
1112
+ | 31234567 | 312 | Comprehensive SAR analysis... |
1113
+
1114
+ *Source: PubMed, BioRxiv, MedRxiv, OpenAlex*
1115
+ ```
1116
+
1117
+ ---
1118
+
1119
+ ## Report Template
1120
+
1121
+ **File**: `[TARGET]_binder_discovery_report.md`
1122
+
1123
+ ```markdown
1124
+ # Small Molecule Binder Discovery: [TARGET]
1125
+
1126
+ **Generated**: [Date] | **Query**: [Original query] | **Status**: In Progress
1127
+
1128
+ ---
1129
+
1130
+ ## Executive Summary
1131
+ [Researching...]
1132
+
1133
+ ---
1134
+
1135
+ ## 1. Target Validation
1136
+ ### 1.1 Target Identifiers
1137
+ [Researching...]
1138
+ ### 1.2 Druggability Assessment
1139
+ [Researching...]
1140
+ ### 1.3 Binding Site Analysis
1141
+ [Researching...]
1142
+
1143
+ ---
1144
+
1145
+ ## 2. Known Ligand Landscape
1146
+ ### 2.1 ChEMBL Bioactivity Summary
1147
+ [Researching...]
1148
+ ### 2.2 Approved Drugs & Clinical Compounds
1149
+ [Researching...]
1150
+ ### 2.3 Chemical Probes
1151
+ [Researching...]
1152
+ ### 2.4 SAR Insights
1153
+ [Researching...]
1154
+
1155
+ ---
1156
+
1157
+ ## 3. Structural Information
1158
+ ### 3.1 Available Structures
1159
+ [Researching...]
1160
+ ### 3.2 Binding Pocket Analysis
1161
+ [Researching...]
1162
+ ### 3.3 Key Interactions
1163
+ [Researching...]
1164
+
1165
+ ---
1166
+
1167
+ ## 4. Compound Expansion
1168
+ ### 4.1 Similarity Search Results
1169
+ [Researching...]
1170
+ ### 4.2 Substructure Search Results
1171
+ [Researching...]
1172
+ ### 4.3 Cross-Database Mining
1173
+ [Researching...]
1174
+
1175
+ ---
1176
+
1177
+ ## 5. ADMET Filtering
1178
+ ### 5.1 Physicochemical Filters
1179
+ [Researching...]
1180
+ ### 5.2 ADMET Predictions
1181
+ [Researching...]
1182
+ ### 5.3 Structural Alerts
1183
+ [Researching...]
1184
+ ### 5.4 Filter Summary
1185
+ [Researching...]
1186
+
1187
+ ---
1188
+
1189
+ ## 6. Candidate Prioritization
1190
+ ### 6.1 Scoring Methodology
1191
+ [Researching...]
1192
+ ### 6.2 Synthesis Feasibility
1193
+ [Researching...]
1194
+ ### 6.3 Top 20 Candidates
1195
+ [Researching...]
1196
+
1197
+ ---
1198
+
1199
+ ## 7. Recommendations
1200
+ ### 7.1 Immediate Actions
1201
+ [Researching...]
1202
+ ### 7.2 Experimental Validation Plan
1203
+ [Researching...]
1204
+ ### 7.3 Backup Strategies
1205
+ [Researching...]
1206
+
1207
+ ---
1208
+
1209
+ ## 8. Data Gaps & Limitations
1210
+ [Researching...]
1211
+
1212
+ ---
1213
+
1214
+ ## 9. Data Sources
1215
+ [Will be populated as research progresses...]
1216
+
1217
+ ---
1218
+
1219
+ ## 10. Methods Summary
1220
+
1221
+ | Step | Tool | Purpose |
1222
+ |------|------|---------|
1223
+ | Sequence retrieval | UniProt_search | Get protein sequence |
1224
+ | Structure prediction | NvidiaNIM_alphafold2 / NvidiaNIM_esmfold | 3D structure with pLDDT |
1225
+ | Docking validation | NvidiaNIM_diffdock / NvidiaNIM_boltz2 | Validate binding pocket |
1226
+ | Known ligands | ChEMBL_get_target_activities | Bioactivity data |
1227
+ | Similarity search | ChEMBL_search_similar_molecules | Expand chemical space |
1228
+ | De novo generation | NvidiaNIM_genmol / NvidiaNIM_molmim | Novel molecule design |
1229
+ | ADMET filtering | ADMETAI_predict_* | Drug-likeness assessment |
1230
+ | Candidate docking | NvidiaNIM_diffdock / NvidiaNIM_boltz2 | Final scoring |
1231
+ ```
1232
+
1233
+ ---
1234
+
1235
+ ## Evidence Grading
1236
+
1237
+ | Tier | Symbol | Description | Example |
1238
+ |------|--------|-------------|---------|
1239
+ | **T0** | ★★★★ | Docking score > reference inhibitor | Better than erlotinib |
1240
+ | **T1** | ★★★ | Experimental IC50/Ki < 100 nM | ChEMBL bioactivity |
1241
+ | **T2** | ★★☆ | Docking within 5% of reference OR IC50 100-1000 nM | High priority |
1242
+ | **T3** | ★☆☆ | Structural similarity > 80% to T1 | Predicted active |
1243
+ | **T4** | ☆☆☆ | Similarity 70-80%, scaffold match | Lower confidence |
1244
+ | **T5** | ○○○ | Generated molecule, ADMET-passed, no docking | Speculative |
1245
+
1246
+ **Docking-Enhanced Grading**: When NVIDIA NIM docking is available, compounds gain evidence:
1247
+ - Docking > reference → upgrade to T0 (★★★★)
1248
+ - Docking within 5% → upgrade to T2 (★★☆)
1249
+ - Docking within 20% → maintain current tier
1250
+ - Docking >20% worse → downgrade one tier
1251
+
1252
+ Apply to all candidate compounds:
1253
+ ```markdown
1254
+ | Compound | Evidence | Docking vs Ref | Rationale |
1255
+ |----------|----------|----------------|-----------|
1256
+ | CPD-001 | ★★★★ | +8.3% | 85% similar, docking > erlotinib |
1257
+ | CPD-002 | ★★★ | -2.1% | IC50=45nM, validated by docking |
1258
+ | CPD-003 | ★★☆ | -4.5% | 78% similar, good docking |
1259
+ | GEN-001 | ★☆☆ | -15% | Generated, ADMET-passed |
1260
+ ```
1261
+
1262
+ ---
1263
+
1264
+ ## Mandatory Completeness Checklist
1265
+
1266
+ ### Phase 1: Target Validation
1267
+ - [ ] UniProt accession resolved
1268
+ - [ ] ChEMBL target ID obtained
1269
+ - [ ] Druggability assessed (≥2 sources)
1270
+ - [ ] Target class identified
1271
+ - [ ] Binding site information (or "No structural data")
1272
+
1273
+ ### Phase 2: Known Ligands
1274
+ - [ ] ChEMBL activities queried (≥100 or all available)
1275
+ - [ ] Activity statistics (count, potency range)
1276
+ - [ ] Top 10 actives listed with IC50
1277
+ - [ ] Chemical probes identified (or "None available")
1278
+ - [ ] SAR insights summarized
1279
+
1280
+ ### Phase 3: Structure
1281
+ - [ ] PDB structures listed (or "No experimental structure")
1282
+ - [ ] Best structure for docking identified
1283
+ - [ ] Binding pocket described (or "Predicted from AlphaFold")
1284
+
1285
+ ### Phase 4: Expansion
1286
+ - [ ] ≥3 seed compounds used
1287
+ - [ ] Similarity search completed (≥100 results or exhausted)
1288
+ - [ ] Substructure search completed
1289
+ - [ ] Deduplicated candidate count reported
1290
+
1291
+ ### Phase 5: ADMET
1292
+ - [ ] Physicochemical filters applied
1293
+ - [ ] Toxicity predictions run
1294
+ - [ ] Structural alerts checked
1295
+ - [ ] Filter funnel table included
1296
+
1297
+ ### Phase 6: Prioritization
1298
+ - [ ] ≥20 candidates ranked (or all if fewer)
1299
+ - [ ] Scoring methodology explained
1300
+ - [ ] Synthesis feasibility assessed
1301
+ - [ ] Scaffold diversity noted
1302
+
1303
+ ### Phase 7: Recommendations
1304
+ - [ ] ≥3 immediate actions listed
1305
+ - [ ] Experimental validation plan outlined
1306
+ - [ ] Data gaps aggregated
1307
+
1308
+ ---
1309
+
1310
+ ## Tool Reference by Phase
1311
+
1312
+ ### Phase 1: Target Validation
1313
+ | Tool | Purpose |
1314
+ |------|---------|
1315
+ | `UniProt_search` | Resolve UniProt accession |
1316
+ | `MyGene_query_genes` | Get Ensembl/NCBI IDs |
1317
+ | `ChEMBL_search_targets` | Get ChEMBL target ID |
1318
+ | `OpenTargets_get_target_tractability_by_ensemblID` | Tractability assessment |
1319
+ | `DGIdb_get_gene_druggability` | Druggability categories |
1320
+ | `ChEMBL_search_binding_sites` | Binding site info |
1321
+ | `InterPro_get_protein_domains` | Domain architecture |
1322
+
1323
+ ### Phase 2: Known Ligands
1324
+ | Tool | Purpose |
1325
+ |------|---------|
1326
+ | `ChEMBL_get_target_activities` | Bioactivity data |
1327
+ | `ChEMBL_get_molecule` | Molecule details |
1328
+ | `GtoPdb_get_target_interactions` | Pharmacology data |
1329
+ | `OpenTargets_get_chemical_probes_by_target_ensemblID` | Chemical probes |
1330
+ | `OpenTargets_get_associated_drugs_by_target_ensemblID` | Known drugs |
1331
+
1332
+ ### Phase 1.4: Structure Prediction (NVIDIA NIM)
1333
+ | Tool | Purpose |
1334
+ |------|---------|
1335
+ | `NvidiaNIM_alphafold2` | High-accuracy structure prediction with pLDDT |
1336
+ | `NvidiaNIM_esmfold` | Fast structure prediction (max 1024 AA) |
1337
+ | `NvidiaNIM_msa_search` | MSA generation for AlphaFold |
1338
+
1339
+ ### Phase 3: Structure
1340
+ | Tool | Purpose |
1341
+ |------|---------|
1342
+ | `PDB_search_similar_structures` | Find PDB structures |
1343
+ | `get_protein_metadata_by_pdb_id` | Structure metadata |
1344
+ | `get_binding_affinity_by_pdb_id` | Ligand affinities |
1345
+ | `alphafold_get_prediction` | Predicted structure (AlphaFold DB) |
1346
+ | `get_ligand_smiles_by_chem_comp_id` | Ligand structures |
1347
+ | `emdb_search` | Search cryo-EM structures (NEW) |
1348
+ | `emdb_get_entry` | Get EMDB entry details (NEW) |
1349
+
1350
+ ### Phase 3.5: Docking Validation (NVIDIA NIM)
1351
+ | Tool | Purpose |
1352
+ |------|---------|
1353
+ | `NvidiaNIM_diffdock` | Blind molecular docking (PDB + SDF) |
1354
+ | `NvidiaNIM_boltz2` | Protein-ligand complex (sequence + SMILES) |
1355
+
1356
+ ### Phase 4: Expansion
1357
+ | Tool | Purpose |
1358
+ |------|---------|
1359
+ | `ChEMBL_search_similar_molecules` | Similarity search |
1360
+ | `PubChem_search_compounds_by_similarity` | PubChem similarity |
1361
+ | `ChEMBL_search_substructure` | Substructure search |
1362
+ | `PubChem_search_compounds_by_substructure` | PubChem substructure |
1363
+ | `STITCH_get_chemical_protein_interactions` | Cross-database |
1364
+
1365
+ ### Phase 4.4: De Novo Generation (NVIDIA NIM)
1366
+ | Tool | Purpose |
1367
+ |------|---------|
1368
+ | `NvidiaNIM_genmol` | Scaffold hopping with masked regions |
1369
+ | `NvidiaNIM_molmim` | Controlled generation from reference |
1370
+
1371
+ ### Phase 5: ADMET
1372
+ | Tool | Purpose |
1373
+ |------|---------|
1374
+ | `ADMETAI_predict_physicochemical_properties` | Drug-likeness |
1375
+ | `ADMETAI_predict_bioavailability` | Oral absorption |
1376
+ | `ADMETAI_predict_toxicity` | Toxicity flags |
1377
+ | `ADMETAI_predict_CYP_interactions` | CYP liabilities |
1378
+ | `ChEMBL_search_compound_structural_alerts` | PAINS, alerts |
1379
+
1380
+ ### Phase 6: Candidate Docking (NVIDIA NIM)
1381
+ | Tool | Purpose |
1382
+ |------|---------|
1383
+ | `NvidiaNIM_diffdock` | Score all candidates by docking |
1384
+ | `NvidiaNIM_boltz2` | Alternative docking from SMILES |
1385
+
1386
+ ### Phase 6.5: Literature Evidence (NEW)
1387
+ | Tool | Purpose |
1388
+ |------|---------|
1389
+ | `PubMed_search_articles` | Published SAR studies |
1390
+ | `BioRxiv_search_preprints` | Latest biology preprints |
1391
+ | `MedRxiv_search_preprints` | Clinical preprints |
1392
+ | `openalex_search_works` | Citation analysis |
1393
+ | `SemanticScholar_search` | AI-ranked papers |
1394
+
1395
+ ---
1396
+
1397
+ ## Fallback Chains
1398
+
1399
+ | Primary Tool | Fallback 1 | Fallback 2 | Use When |
1400
+ |--------------|------------|------------|----------|
1401
+ | `ChEMBL_get_target_activities` | `GtoPdb_get_target_interactions` | `PubChem_search_assays` | No ChEMBL data |
1402
+ | `ChEMBL_search_similar_molecules` | `PubChem_search_compounds_by_similarity` | `STITCH_get_chemical_protein_interactions` | ChEMBL exhausted |
1403
+ | `PDB_search_similar_structures` | `NvidiaNIM_alphafold2` | `alphafold_get_prediction` | No PDB structure |
1404
+ | `alphafold_get_prediction` | `NvidiaNIM_alphafold2` | `NvidiaNIM_esmfold` | AlphaFold DB unavailable |
1405
+ | `NvidiaNIM_alphafold2` | `NvidiaNIM_esmfold` | `alphafold_get_prediction` | AlphaFold2 NIM error |
1406
+ | `NvidiaNIM_diffdock` | `NvidiaNIM_boltz2` | Skip docking, use similarity | Docking error |
1407
+ | `NvidiaNIM_genmol` | `NvidiaNIM_molmim` | Manual scaffold hopping | Generation error |
1408
+ | `OpenTargets_get_target_tractability` | `DGIdb_get_gene_druggability` | Document "Unknown" | Open Targets error |
1409
+ | `ADMETAI_*` | SwissADME tools | Basic Lipinski | Invalid SMILES |
1410
+ | `PDB_search_similar_structures` | `emdb_search` + PDB | `NvidiaNIM_alphafold2` | Membrane proteins |
1411
+ | `PubMed_search_articles` | `openalex_search_works` | `SemanticScholar_search` | Literature search |
1412
+ | `BioRxiv_search_preprints` | `MedRxiv_search_preprints` | Skip preprints | Preprint sources |
1413
+
1414
+ **NVIDIA NIM API Key Required**: Tools with `NvidiaNIM_` prefix require `NVIDIA_API_KEY` environment variable. Check availability at start:
1415
+ ```python
1416
+ import os
1417
+ nvidia_available = bool(os.environ.get("NVIDIA_API_KEY"))
1418
+ # If not available, fall back to non-NIM alternatives
1419
+ ```
1420
+
1421
+ ---
1422
+
1423
+ ## Common Use Cases
1424
+
1425
+ ### Well-Characterized Target
1426
+ User: "Find novel binders for EGFR"
1427
+ → Rich ChEMBL data; focus on novel scaffolds, selectivity, ADMET
1428
+
1429
+ ### Novel Target
1430
+ User: "Find small molecules for [new target with no known ligands]"
1431
+ → Limited bioactivity; rely on structure-based assessment, similar target ligands
1432
+
1433
+ ### Lead Optimization
1434
+ User: "Find analogs of compound X for target Y"
1435
+ → Deep similarity search around specific compound; focus on SAR
1436
+
1437
+ ### Selectivity Challenge
1438
+ User: "Find selective inhibitors for kinase X vs kinase Y"
1439
+ → Include selectivity analysis; filter by off-target predictions
1440
+
1441
+ ---
1442
+
1443
+ ## When NOT to Use This Skill
1444
+
1445
+ - **Drug research** → Use tooluniverse-drug-research (existing drug profiling)
1446
+ - **Target research only** → Use tooluniverse-target-research
1447
+ - **Single compound ADMET** → Call ADMET tools directly
1448
+ - **Literature search** → Use tooluniverse-literature-deep-research
1449
+ - **Protein structure only** → Use tooluniverse-protein-structure-retrieval
1450
+
1451
+ Use this skill for **discovering new compounds** for a protein target.
1452
+
1453
+ ---
1454
+
1455
+ ## Additional Resources
1456
+
1457
+ - **Checklist**: [CHECKLIST.md](CHECKLIST.md) - Pre-delivery verification
1458
+ - **Examples**: [EXAMPLES.md](EXAMPLES.md) - Detailed workflow examples
1459
+ - **Tool corrections**: [TOOLS_REFERENCE.md](TOOLS_REFERENCE.md) - Parameter corrections