miga-base 1.2.15.2 → 1.2.15.4

Sign up to get free protection for your applications and to get access to all the features.
Files changed (306) hide show
  1. checksums.yaml +4 -4
  2. data/lib/miga/cli/action/download/gtdb.rb +4 -1
  3. data/lib/miga/cli/action/gtdb_get.rb +4 -0
  4. data/lib/miga/daemon.rb +4 -1
  5. data/lib/miga/lair.rb +6 -4
  6. data/lib/miga/remote_dataset/download.rb +3 -2
  7. data/lib/miga/remote_dataset.rb +25 -7
  8. data/lib/miga/taxonomy.rb +6 -0
  9. data/lib/miga/version.rb +2 -2
  10. metadata +6 -302
  11. data/utils/FastAAI/00.Libraries/01.SCG_HMMs/Archaea_SCG.hmm +0 -41964
  12. data/utils/FastAAI/00.Libraries/01.SCG_HMMs/Bacteria_SCG.hmm +0 -32439
  13. data/utils/FastAAI/00.Libraries/01.SCG_HMMs/Complete_SCG_DB.hmm +0 -62056
  14. data/utils/FastAAI/FastAAI +0 -3659
  15. data/utils/FastAAI/FastAAI-legacy/FastAAI +0 -1336
  16. data/utils/FastAAI/FastAAI-legacy/kAAI_v1.0_virus.py +0 -1296
  17. data/utils/FastAAI/README.md +0 -84
  18. data/utils/enveomics/Docs/recplot2.md +0 -244
  19. data/utils/enveomics/Examples/aai-matrix.bash +0 -66
  20. data/utils/enveomics/Examples/ani-matrix.bash +0 -66
  21. data/utils/enveomics/Examples/essential-phylogeny.bash +0 -105
  22. data/utils/enveomics/Examples/unus-genome-phylogeny.bash +0 -100
  23. data/utils/enveomics/LICENSE.txt +0 -73
  24. data/utils/enveomics/Makefile +0 -52
  25. data/utils/enveomics/Manifest/Tasks/aasubs.json +0 -103
  26. data/utils/enveomics/Manifest/Tasks/blasttab.json +0 -790
  27. data/utils/enveomics/Manifest/Tasks/distances.json +0 -161
  28. data/utils/enveomics/Manifest/Tasks/fasta.json +0 -802
  29. data/utils/enveomics/Manifest/Tasks/fastq.json +0 -291
  30. data/utils/enveomics/Manifest/Tasks/graphics.json +0 -126
  31. data/utils/enveomics/Manifest/Tasks/mapping.json +0 -137
  32. data/utils/enveomics/Manifest/Tasks/ogs.json +0 -382
  33. data/utils/enveomics/Manifest/Tasks/other.json +0 -906
  34. data/utils/enveomics/Manifest/Tasks/remote.json +0 -355
  35. data/utils/enveomics/Manifest/Tasks/sequence-identity.json +0 -650
  36. data/utils/enveomics/Manifest/Tasks/tables.json +0 -308
  37. data/utils/enveomics/Manifest/Tasks/trees.json +0 -68
  38. data/utils/enveomics/Manifest/Tasks/variants.json +0 -111
  39. data/utils/enveomics/Manifest/categories.json +0 -165
  40. data/utils/enveomics/Manifest/examples.json +0 -162
  41. data/utils/enveomics/Manifest/tasks.json +0 -4
  42. data/utils/enveomics/Pipelines/assembly.pbs/CONFIG.mock.bash +0 -69
  43. data/utils/enveomics/Pipelines/assembly.pbs/FastA.N50.pl +0 -1
  44. data/utils/enveomics/Pipelines/assembly.pbs/FastA.filterN.pl +0 -1
  45. data/utils/enveomics/Pipelines/assembly.pbs/FastA.length.pl +0 -1
  46. data/utils/enveomics/Pipelines/assembly.pbs/README.md +0 -189
  47. data/utils/enveomics/Pipelines/assembly.pbs/RUNME-2.bash +0 -112
  48. data/utils/enveomics/Pipelines/assembly.pbs/RUNME-3.bash +0 -23
  49. data/utils/enveomics/Pipelines/assembly.pbs/RUNME-4.bash +0 -44
  50. data/utils/enveomics/Pipelines/assembly.pbs/RUNME.bash +0 -50
  51. data/utils/enveomics/Pipelines/assembly.pbs/kSelector.R +0 -37
  52. data/utils/enveomics/Pipelines/assembly.pbs/newbler.pbs +0 -68
  53. data/utils/enveomics/Pipelines/assembly.pbs/newbler_preparator.pl +0 -49
  54. data/utils/enveomics/Pipelines/assembly.pbs/soap.pbs +0 -80
  55. data/utils/enveomics/Pipelines/assembly.pbs/stats.pbs +0 -57
  56. data/utils/enveomics/Pipelines/assembly.pbs/velvet.pbs +0 -63
  57. data/utils/enveomics/Pipelines/blast.pbs/01.pbs.bash +0 -38
  58. data/utils/enveomics/Pipelines/blast.pbs/02.pbs.bash +0 -73
  59. data/utils/enveomics/Pipelines/blast.pbs/03.pbs.bash +0 -21
  60. data/utils/enveomics/Pipelines/blast.pbs/BlastTab.recover_job.pl +0 -72
  61. data/utils/enveomics/Pipelines/blast.pbs/CONFIG.mock.bash +0 -98
  62. data/utils/enveomics/Pipelines/blast.pbs/FastA.split.pl +0 -1
  63. data/utils/enveomics/Pipelines/blast.pbs/README.md +0 -127
  64. data/utils/enveomics/Pipelines/blast.pbs/RUNME.bash +0 -109
  65. data/utils/enveomics/Pipelines/blast.pbs/TASK.check.bash +0 -128
  66. data/utils/enveomics/Pipelines/blast.pbs/TASK.dry.bash +0 -16
  67. data/utils/enveomics/Pipelines/blast.pbs/TASK.eo.bash +0 -22
  68. data/utils/enveomics/Pipelines/blast.pbs/TASK.pause.bash +0 -26
  69. data/utils/enveomics/Pipelines/blast.pbs/TASK.run.bash +0 -89
  70. data/utils/enveomics/Pipelines/blast.pbs/sentinel.pbs.bash +0 -29
  71. data/utils/enveomics/Pipelines/idba.pbs/README.md +0 -49
  72. data/utils/enveomics/Pipelines/idba.pbs/RUNME.bash +0 -95
  73. data/utils/enveomics/Pipelines/idba.pbs/run.pbs +0 -56
  74. data/utils/enveomics/Pipelines/trim.pbs/README.md +0 -54
  75. data/utils/enveomics/Pipelines/trim.pbs/RUNME.bash +0 -70
  76. data/utils/enveomics/Pipelines/trim.pbs/run.pbs +0 -130
  77. data/utils/enveomics/README.md +0 -42
  78. data/utils/enveomics/Scripts/AAsubs.log2ratio.rb +0 -171
  79. data/utils/enveomics/Scripts/Aln.cat.rb +0 -221
  80. data/utils/enveomics/Scripts/Aln.convert.pl +0 -35
  81. data/utils/enveomics/Scripts/AlphaDiversity.pl +0 -152
  82. data/utils/enveomics/Scripts/BedGraph.tad.rb +0 -93
  83. data/utils/enveomics/Scripts/BedGraph.window.rb +0 -71
  84. data/utils/enveomics/Scripts/BlastPairwise.AAsubs.pl +0 -102
  85. data/utils/enveomics/Scripts/BlastTab.addlen.rb +0 -63
  86. data/utils/enveomics/Scripts/BlastTab.advance.bash +0 -48
  87. data/utils/enveomics/Scripts/BlastTab.best_hit_sorted.pl +0 -55
  88. data/utils/enveomics/Scripts/BlastTab.catsbj.pl +0 -104
  89. data/utils/enveomics/Scripts/BlastTab.cogCat.rb +0 -76
  90. data/utils/enveomics/Scripts/BlastTab.filter.pl +0 -47
  91. data/utils/enveomics/Scripts/BlastTab.kegg_pep2path_rest.pl +0 -194
  92. data/utils/enveomics/Scripts/BlastTab.metaxaPrep.pl +0 -104
  93. data/utils/enveomics/Scripts/BlastTab.pairedHits.rb +0 -157
  94. data/utils/enveomics/Scripts/BlastTab.recplot2.R +0 -48
  95. data/utils/enveomics/Scripts/BlastTab.seqdepth.pl +0 -86
  96. data/utils/enveomics/Scripts/BlastTab.seqdepth_ZIP.pl +0 -119
  97. data/utils/enveomics/Scripts/BlastTab.seqdepth_nomedian.pl +0 -86
  98. data/utils/enveomics/Scripts/BlastTab.subsample.pl +0 -47
  99. data/utils/enveomics/Scripts/BlastTab.sumPerHit.pl +0 -114
  100. data/utils/enveomics/Scripts/BlastTab.taxid2taxrank.pl +0 -90
  101. data/utils/enveomics/Scripts/BlastTab.topHits_sorted.rb +0 -123
  102. data/utils/enveomics/Scripts/Chao1.pl +0 -97
  103. data/utils/enveomics/Scripts/CharTable.classify.rb +0 -234
  104. data/utils/enveomics/Scripts/EBIseq2tax.rb +0 -83
  105. data/utils/enveomics/Scripts/FastA.N50.pl +0 -60
  106. data/utils/enveomics/Scripts/FastA.extract.rb +0 -152
  107. data/utils/enveomics/Scripts/FastA.filter.pl +0 -52
  108. data/utils/enveomics/Scripts/FastA.filterLen.pl +0 -28
  109. data/utils/enveomics/Scripts/FastA.filterN.pl +0 -60
  110. data/utils/enveomics/Scripts/FastA.fragment.rb +0 -100
  111. data/utils/enveomics/Scripts/FastA.gc.pl +0 -42
  112. data/utils/enveomics/Scripts/FastA.interpose.pl +0 -93
  113. data/utils/enveomics/Scripts/FastA.length.pl +0 -38
  114. data/utils/enveomics/Scripts/FastA.mask.rb +0 -89
  115. data/utils/enveomics/Scripts/FastA.per_file.pl +0 -36
  116. data/utils/enveomics/Scripts/FastA.qlen.pl +0 -57
  117. data/utils/enveomics/Scripts/FastA.rename.pl +0 -65
  118. data/utils/enveomics/Scripts/FastA.revcom.pl +0 -23
  119. data/utils/enveomics/Scripts/FastA.sample.rb +0 -98
  120. data/utils/enveomics/Scripts/FastA.slider.pl +0 -85
  121. data/utils/enveomics/Scripts/FastA.split.pl +0 -55
  122. data/utils/enveomics/Scripts/FastA.split.rb +0 -79
  123. data/utils/enveomics/Scripts/FastA.subsample.pl +0 -131
  124. data/utils/enveomics/Scripts/FastA.tag.rb +0 -65
  125. data/utils/enveomics/Scripts/FastA.toFastQ.rb +0 -69
  126. data/utils/enveomics/Scripts/FastA.wrap.rb +0 -48
  127. data/utils/enveomics/Scripts/FastQ.filter.pl +0 -54
  128. data/utils/enveomics/Scripts/FastQ.interpose.pl +0 -90
  129. data/utils/enveomics/Scripts/FastQ.maskQual.rb +0 -89
  130. data/utils/enveomics/Scripts/FastQ.offset.pl +0 -90
  131. data/utils/enveomics/Scripts/FastQ.split.pl +0 -53
  132. data/utils/enveomics/Scripts/FastQ.tag.rb +0 -70
  133. data/utils/enveomics/Scripts/FastQ.test-error.rb +0 -81
  134. data/utils/enveomics/Scripts/FastQ.toFastA.awk +0 -24
  135. data/utils/enveomics/Scripts/GFF.catsbj.pl +0 -127
  136. data/utils/enveomics/Scripts/GenBank.add_fields.rb +0 -84
  137. data/utils/enveomics/Scripts/HMM.essential.rb +0 -351
  138. data/utils/enveomics/Scripts/HMM.haai.rb +0 -168
  139. data/utils/enveomics/Scripts/HMMsearch.extractIds.rb +0 -83
  140. data/utils/enveomics/Scripts/JPlace.distances.rb +0 -88
  141. data/utils/enveomics/Scripts/JPlace.to_iToL.rb +0 -320
  142. data/utils/enveomics/Scripts/M5nr.getSequences.rb +0 -81
  143. data/utils/enveomics/Scripts/MeTaxa.distribution.pl +0 -198
  144. data/utils/enveomics/Scripts/MyTaxa.fragsByTax.pl +0 -35
  145. data/utils/enveomics/Scripts/MyTaxa.seq-taxrank.rb +0 -49
  146. data/utils/enveomics/Scripts/NCBIacc2tax.rb +0 -92
  147. data/utils/enveomics/Scripts/Newick.autoprune.R +0 -27
  148. data/utils/enveomics/Scripts/RAxML-EPA.to_iToL.pl +0 -228
  149. data/utils/enveomics/Scripts/RecPlot2.compareIdentities.R +0 -32
  150. data/utils/enveomics/Scripts/RefSeq.download.bash +0 -48
  151. data/utils/enveomics/Scripts/SRA.download.bash +0 -55
  152. data/utils/enveomics/Scripts/TRIBS.plot-test.R +0 -36
  153. data/utils/enveomics/Scripts/TRIBS.test.R +0 -39
  154. data/utils/enveomics/Scripts/Table.barplot.R +0 -31
  155. data/utils/enveomics/Scripts/Table.df2dist.R +0 -30
  156. data/utils/enveomics/Scripts/Table.filter.pl +0 -61
  157. data/utils/enveomics/Scripts/Table.merge.pl +0 -77
  158. data/utils/enveomics/Scripts/Table.prefScore.R +0 -60
  159. data/utils/enveomics/Scripts/Table.replace.rb +0 -69
  160. data/utils/enveomics/Scripts/Table.round.rb +0 -63
  161. data/utils/enveomics/Scripts/Table.split.pl +0 -57
  162. data/utils/enveomics/Scripts/Taxonomy.silva2ncbi.rb +0 -227
  163. data/utils/enveomics/Scripts/VCF.KaKs.rb +0 -147
  164. data/utils/enveomics/Scripts/VCF.SNPs.rb +0 -88
  165. data/utils/enveomics/Scripts/aai.rb +0 -421
  166. data/utils/enveomics/Scripts/ani.rb +0 -362
  167. data/utils/enveomics/Scripts/anir.rb +0 -137
  168. data/utils/enveomics/Scripts/clust.rand.rb +0 -102
  169. data/utils/enveomics/Scripts/gi2tax.rb +0 -103
  170. data/utils/enveomics/Scripts/in_silico_GA_GI.pl +0 -96
  171. data/utils/enveomics/Scripts/lib/data/dupont_2012_essential.hmm.gz +0 -0
  172. data/utils/enveomics/Scripts/lib/data/lee_2019_essential.hmm.gz +0 -0
  173. data/utils/enveomics/Scripts/lib/enveomics.R +0 -1
  174. data/utils/enveomics/Scripts/lib/enveomics_rb/anir.rb +0 -293
  175. data/utils/enveomics/Scripts/lib/enveomics_rb/bm_set.rb +0 -175
  176. data/utils/enveomics/Scripts/lib/enveomics_rb/enveomics.rb +0 -24
  177. data/utils/enveomics/Scripts/lib/enveomics_rb/errors.rb +0 -17
  178. data/utils/enveomics/Scripts/lib/enveomics_rb/gmm_em.rb +0 -30
  179. data/utils/enveomics/Scripts/lib/enveomics_rb/jplace.rb +0 -253
  180. data/utils/enveomics/Scripts/lib/enveomics_rb/match.rb +0 -88
  181. data/utils/enveomics/Scripts/lib/enveomics_rb/og.rb +0 -182
  182. data/utils/enveomics/Scripts/lib/enveomics_rb/rbm.rb +0 -49
  183. data/utils/enveomics/Scripts/lib/enveomics_rb/remote_data.rb +0 -74
  184. data/utils/enveomics/Scripts/lib/enveomics_rb/seq_range.rb +0 -237
  185. data/utils/enveomics/Scripts/lib/enveomics_rb/stats/rand.rb +0 -31
  186. data/utils/enveomics/Scripts/lib/enveomics_rb/stats/sample.rb +0 -152
  187. data/utils/enveomics/Scripts/lib/enveomics_rb/stats.rb +0 -3
  188. data/utils/enveomics/Scripts/lib/enveomics_rb/utils.rb +0 -74
  189. data/utils/enveomics/Scripts/lib/enveomics_rb/vcf.rb +0 -135
  190. data/utils/enveomics/Scripts/ogs.annotate.rb +0 -88
  191. data/utils/enveomics/Scripts/ogs.core-pan.rb +0 -160
  192. data/utils/enveomics/Scripts/ogs.extract.rb +0 -125
  193. data/utils/enveomics/Scripts/ogs.mcl.rb +0 -186
  194. data/utils/enveomics/Scripts/ogs.rb +0 -104
  195. data/utils/enveomics/Scripts/ogs.stats.rb +0 -131
  196. data/utils/enveomics/Scripts/rbm-legacy.rb +0 -172
  197. data/utils/enveomics/Scripts/rbm.rb +0 -108
  198. data/utils/enveomics/Scripts/sam.filter.rb +0 -148
  199. data/utils/enveomics/Tests/Makefile +0 -10
  200. data/utils/enveomics/Tests/Mgen_M2288.faa +0 -3189
  201. data/utils/enveomics/Tests/Mgen_M2288.fna +0 -8282
  202. data/utils/enveomics/Tests/Mgen_M2321.fna +0 -8288
  203. data/utils/enveomics/Tests/Nequ_Kin4M.faa +0 -2970
  204. data/utils/enveomics/Tests/Xanthomonas_oryzae-PilA.tribs.Rdata +0 -0
  205. data/utils/enveomics/Tests/Xanthomonas_oryzae-PilA.txt +0 -7
  206. data/utils/enveomics/Tests/Xanthomonas_oryzae.aai-mat.tsv +0 -17
  207. data/utils/enveomics/Tests/Xanthomonas_oryzae.aai.tsv +0 -137
  208. data/utils/enveomics/Tests/a_mg.cds-go.blast.tsv +0 -123
  209. data/utils/enveomics/Tests/a_mg.reads-cds.blast.tsv +0 -200
  210. data/utils/enveomics/Tests/a_mg.reads-cds.counts.tsv +0 -55
  211. data/utils/enveomics/Tests/alkB.nwk +0 -1
  212. data/utils/enveomics/Tests/anthrax-cansnp-data.tsv +0 -13
  213. data/utils/enveomics/Tests/anthrax-cansnp-key.tsv +0 -17
  214. data/utils/enveomics/Tests/hiv1.faa +0 -59
  215. data/utils/enveomics/Tests/hiv1.fna +0 -134
  216. data/utils/enveomics/Tests/hiv2.faa +0 -70
  217. data/utils/enveomics/Tests/hiv_mix-hiv1.blast.tsv +0 -233
  218. data/utils/enveomics/Tests/hiv_mix-hiv1.blast.tsv.lim +0 -1
  219. data/utils/enveomics/Tests/hiv_mix-hiv1.blast.tsv.rec +0 -233
  220. data/utils/enveomics/Tests/phyla_counts.tsv +0 -10
  221. data/utils/enveomics/Tests/primate_lentivirus.ogs +0 -11
  222. data/utils/enveomics/Tests/primate_lentivirus.rbm/hiv1-hiv1.rbm +0 -9
  223. data/utils/enveomics/Tests/primate_lentivirus.rbm/hiv1-hiv2.rbm +0 -8
  224. data/utils/enveomics/Tests/primate_lentivirus.rbm/hiv1-siv.rbm +0 -6
  225. data/utils/enveomics/Tests/primate_lentivirus.rbm/hiv2-hiv2.rbm +0 -9
  226. data/utils/enveomics/Tests/primate_lentivirus.rbm/hiv2-siv.rbm +0 -6
  227. data/utils/enveomics/Tests/primate_lentivirus.rbm/siv-siv.rbm +0 -6
  228. data/utils/enveomics/build_enveomics_r.bash +0 -45
  229. data/utils/enveomics/enveomics.R/DESCRIPTION +0 -31
  230. data/utils/enveomics/enveomics.R/NAMESPACE +0 -39
  231. data/utils/enveomics/enveomics.R/R/autoprune.R +0 -155
  232. data/utils/enveomics/enveomics.R/R/barplot.R +0 -184
  233. data/utils/enveomics/enveomics.R/R/cliopts.R +0 -135
  234. data/utils/enveomics/enveomics.R/R/df2dist.R +0 -154
  235. data/utils/enveomics/enveomics.R/R/growthcurve.R +0 -331
  236. data/utils/enveomics/enveomics.R/R/prefscore.R +0 -79
  237. data/utils/enveomics/enveomics.R/R/recplot.R +0 -354
  238. data/utils/enveomics/enveomics.R/R/recplot2.R +0 -1631
  239. data/utils/enveomics/enveomics.R/R/tribs.R +0 -583
  240. data/utils/enveomics/enveomics.R/R/utils.R +0 -80
  241. data/utils/enveomics/enveomics.R/README.md +0 -81
  242. data/utils/enveomics/enveomics.R/data/growth.curves.rda +0 -0
  243. data/utils/enveomics/enveomics.R/data/phyla.counts.rda +0 -0
  244. data/utils/enveomics/enveomics.R/man/cash-enve.GrowthCurve-method.Rd +0 -16
  245. data/utils/enveomics/enveomics.R/man/cash-enve.RecPlot2-method.Rd +0 -16
  246. data/utils/enveomics/enveomics.R/man/cash-enve.RecPlot2.Peak-method.Rd +0 -16
  247. data/utils/enveomics/enveomics.R/man/enve.GrowthCurve-class.Rd +0 -25
  248. data/utils/enveomics/enveomics.R/man/enve.TRIBS-class.Rd +0 -46
  249. data/utils/enveomics/enveomics.R/man/enve.TRIBS.merge.Rd +0 -23
  250. data/utils/enveomics/enveomics.R/man/enve.TRIBStest-class.Rd +0 -47
  251. data/utils/enveomics/enveomics.R/man/enve.__prune.iter.Rd +0 -23
  252. data/utils/enveomics/enveomics.R/man/enve.__prune.reduce.Rd +0 -23
  253. data/utils/enveomics/enveomics.R/man/enve.__tribs.Rd +0 -40
  254. data/utils/enveomics/enveomics.R/man/enve.barplot.Rd +0 -103
  255. data/utils/enveomics/enveomics.R/man/enve.cliopts.Rd +0 -67
  256. data/utils/enveomics/enveomics.R/man/enve.col.alpha.Rd +0 -24
  257. data/utils/enveomics/enveomics.R/man/enve.col2alpha.Rd +0 -19
  258. data/utils/enveomics/enveomics.R/man/enve.df2dist.Rd +0 -45
  259. data/utils/enveomics/enveomics.R/man/enve.df2dist.group.Rd +0 -44
  260. data/utils/enveomics/enveomics.R/man/enve.df2dist.list.Rd +0 -47
  261. data/utils/enveomics/enveomics.R/man/enve.growthcurve.Rd +0 -75
  262. data/utils/enveomics/enveomics.R/man/enve.prefscore.Rd +0 -50
  263. data/utils/enveomics/enveomics.R/man/enve.prune.dist.Rd +0 -44
  264. data/utils/enveomics/enveomics.R/man/enve.recplot.Rd +0 -139
  265. data/utils/enveomics/enveomics.R/man/enve.recplot2-class.Rd +0 -45
  266. data/utils/enveomics/enveomics.R/man/enve.recplot2.ANIr.Rd +0 -24
  267. data/utils/enveomics/enveomics.R/man/enve.recplot2.Rd +0 -77
  268. data/utils/enveomics/enveomics.R/man/enve.recplot2.__counts.Rd +0 -25
  269. data/utils/enveomics/enveomics.R/man/enve.recplot2.__peakHist.Rd +0 -21
  270. data/utils/enveomics/enveomics.R/man/enve.recplot2.__whichClosestPeak.Rd +0 -19
  271. data/utils/enveomics/enveomics.R/man/enve.recplot2.changeCutoff.Rd +0 -19
  272. data/utils/enveomics/enveomics.R/man/enve.recplot2.compareIdentities.Rd +0 -47
  273. data/utils/enveomics/enveomics.R/man/enve.recplot2.coordinates.Rd +0 -29
  274. data/utils/enveomics/enveomics.R/man/enve.recplot2.corePeak.Rd +0 -18
  275. data/utils/enveomics/enveomics.R/man/enve.recplot2.extractWindows.Rd +0 -45
  276. data/utils/enveomics/enveomics.R/man/enve.recplot2.findPeaks.Rd +0 -36
  277. data/utils/enveomics/enveomics.R/man/enve.recplot2.findPeaks.__em_e.Rd +0 -19
  278. data/utils/enveomics/enveomics.R/man/enve.recplot2.findPeaks.__em_m.Rd +0 -19
  279. data/utils/enveomics/enveomics.R/man/enve.recplot2.findPeaks.__emauto_one.Rd +0 -27
  280. data/utils/enveomics/enveomics.R/man/enve.recplot2.findPeaks.__mow_one.Rd +0 -52
  281. data/utils/enveomics/enveomics.R/man/enve.recplot2.findPeaks.__mower.Rd +0 -17
  282. data/utils/enveomics/enveomics.R/man/enve.recplot2.findPeaks.em.Rd +0 -51
  283. data/utils/enveomics/enveomics.R/man/enve.recplot2.findPeaks.emauto.Rd +0 -43
  284. data/utils/enveomics/enveomics.R/man/enve.recplot2.findPeaks.mower.Rd +0 -82
  285. data/utils/enveomics/enveomics.R/man/enve.recplot2.peak-class.Rd +0 -59
  286. data/utils/enveomics/enveomics.R/man/enve.recplot2.seqdepth.Rd +0 -27
  287. data/utils/enveomics/enveomics.R/man/enve.recplot2.windowDepthThreshold.Rd +0 -36
  288. data/utils/enveomics/enveomics.R/man/enve.selvector.Rd +0 -23
  289. data/utils/enveomics/enveomics.R/man/enve.tribs.Rd +0 -68
  290. data/utils/enveomics/enveomics.R/man/enve.tribs.test.Rd +0 -28
  291. data/utils/enveomics/enveomics.R/man/enve.truncate.Rd +0 -27
  292. data/utils/enveomics/enveomics.R/man/growth.curves.Rd +0 -14
  293. data/utils/enveomics/enveomics.R/man/phyla.counts.Rd +0 -13
  294. data/utils/enveomics/enveomics.R/man/plot.enve.GrowthCurve.Rd +0 -78
  295. data/utils/enveomics/enveomics.R/man/plot.enve.TRIBS.Rd +0 -46
  296. data/utils/enveomics/enveomics.R/man/plot.enve.TRIBStest.Rd +0 -45
  297. data/utils/enveomics/enveomics.R/man/plot.enve.recplot2.Rd +0 -125
  298. data/utils/enveomics/enveomics.R/man/summary.enve.GrowthCurve.Rd +0 -19
  299. data/utils/enveomics/enveomics.R/man/summary.enve.TRIBS.Rd +0 -19
  300. data/utils/enveomics/enveomics.R/man/summary.enve.TRIBStest.Rd +0 -19
  301. data/utils/enveomics/globals.mk +0 -8
  302. data/utils/enveomics/manifest.json +0 -9
  303. data/utils/multitrim/Multitrim How-To.pdf +0 -0
  304. data/utils/multitrim/README.md +0 -67
  305. data/utils/multitrim/multitrim.py +0 -1555
  306. data/utils/multitrim/multitrim.yml +0 -13
@@ -1,13 +0,0 @@
1
-
2
- \name{phyla.counts}
3
- \docType{data}
4
- \alias{phyla.counts}
5
- \title{Counts of microbial phyla in four sites}
6
- \description{
7
- This data set gives the counts of phyla in three different
8
- sites.
9
- }
10
- \usage{phyla.counts}
11
- \format{A data frame with 9 rows (phyla) and 4 rows (sites).}
12
- \keyword{datasets}
13
-
@@ -1,78 +0,0 @@
1
- % Generated by roxygen2: do not edit by hand
2
- % Please edit documentation in R/growthcurve.R
3
- \name{plot.enve.GrowthCurve}
4
- \alias{plot.enve.GrowthCurve}
5
- \title{Enveomics: Plot of Growth Curve}
6
- \usage{
7
- \method{plot}{enve.GrowthCurve}(
8
- x,
9
- col,
10
- pt.alpha = 0.9,
11
- ln.alpha = 1,
12
- ln.lwd = 1,
13
- ln.lty = 1,
14
- band.alpha = 0.4,
15
- band.density = NULL,
16
- band.angle = 45,
17
- xp.alpha = 0.5,
18
- xp.lwd = 1,
19
- xp.lty = 1,
20
- pch = 19,
21
- new = TRUE,
22
- legend = new,
23
- add.params = FALSE,
24
- ...
25
- )
26
- }
27
- \arguments{
28
- \item{x}{An \code{\link{enve.GrowthCurve}} object to plot.}
29
-
30
- \item{col}{Base colors to use for the different samples. Can be recycled.
31
- By default, grey for one sample or rainbow colors for more than one.}
32
-
33
- \item{pt.alpha}{Color alpha for the observed data points, using \code{col}
34
- as a base.}
35
-
36
- \item{ln.alpha}{Color alpha for the fitted growth curve, using \code{col}
37
- as a base.}
38
-
39
- \item{ln.lwd}{Line width for the fitted curve.}
40
-
41
- \item{ln.lty}{Line type for the fitted curve.}
42
-
43
- \item{band.alpha}{Color alpha for the confidence interval band of the
44
- fitted growth curve, using \code{col} as a base.}
45
-
46
- \item{band.density}{Density of the filling pattern in the interval band.
47
- If \code{NULL}, a solid color is used.}
48
-
49
- \item{band.angle}{Angle of the density filling pattern in the interval
50
- band. Ignored if \code{band.density} is \code{NULL}.}
51
-
52
- \item{xp.alpha}{Color alpha for the line connecting individual experiments,
53
- using \code{col} as a base.}
54
-
55
- \item{xp.lwd}{Width of line for the experiments.}
56
-
57
- \item{xp.lty}{Type of line for the experiments.}
58
-
59
- \item{pch}{Point character for observed data points.}
60
-
61
- \item{new}{Should a new plot be generated? If \code{FALSE}, the existing
62
- canvas is used.}
63
-
64
- \item{legend}{Should the plot include a legend? If \code{FALSE}, no legend
65
- is added. If \code{TRUE}, a legend is added in the bottom-right corner.
66
- Otherwise, a legend is added in the position specified as \code{xy.coords}.}
67
-
68
- \item{add.params}{Should the legend include the parameters of the fitted
69
- model?}
70
-
71
- \item{...}{Any other graphic parameters.}
72
- }
73
- \description{
74
- Plots an \code{\link{enve.GrowthCurve}} object.
75
- }
76
- \author{
77
- Luis M. Rodriguez-R [aut, cre]
78
- }
@@ -1,46 +0,0 @@
1
- % Generated by roxygen2: do not edit by hand
2
- % Please edit documentation in R/tribs.R
3
- \name{plot.enve.TRIBS}
4
- \alias{plot.enve.TRIBS}
5
- \title{Enveomics: TRIBS Plot}
6
- \usage{
7
- \method{plot}{enve.TRIBS}(
8
- x,
9
- new = TRUE,
10
- type = c("boxplot", "points"),
11
- col = "#00000044",
12
- pt.cex = 1/2,
13
- pt.pch = 19,
14
- pt.col = col,
15
- ln.col = col,
16
- ...
17
- )
18
- }
19
- \arguments{
20
- \item{x}{\code{\link{enve.TRIBS}} object to plot.}
21
-
22
- \item{new}{Should a new canvas be drawn?}
23
-
24
- \item{type}{Type of plot. The \strong{points} plot shows all the replicates, the
25
- \strong{boxplot} plot represents the values found by
26
- \code{\link[grDevices]{boxplot.stats}}.
27
- as areas, and plots the outliers as points.}
28
-
29
- \item{col}{Color of the areas and/or the points.}
30
-
31
- \item{pt.cex}{Size of the points.}
32
-
33
- \item{pt.pch}{Points character.}
34
-
35
- \item{pt.col}{Color of the points.}
36
-
37
- \item{ln.col}{Color of the lines.}
38
-
39
- \item{...}{Any additional parameters supported by \code{plot}.}
40
- }
41
- \description{
42
- Plot an \code{\link{enve.TRIBS}} object.
43
- }
44
- \author{
45
- Luis M. Rodriguez-R [aut, cre]
46
- }
@@ -1,45 +0,0 @@
1
- % Generated by roxygen2: do not edit by hand
2
- % Please edit documentation in R/tribs.R
3
- \name{plot.enve.TRIBStest}
4
- \alias{plot.enve.TRIBStest}
5
- \title{Enveomics: TRIBS Plot Test}
6
- \usage{
7
- \method{plot}{enve.TRIBStest}(
8
- x,
9
- type = c("overlap", "difference"),
10
- col = "#00000044",
11
- col1 = col,
12
- col2 = "#44001144",
13
- ylab = "Probability",
14
- xlim = range(attr(x, "dist.mids")),
15
- ylim = c(0, max(c(attr(x, "all.dist"), attr(x, "sel.dist")))),
16
- ...
17
- )
18
- }
19
- \arguments{
20
- \item{x}{\code{\link{enve.TRIBStest}} object to plot.}
21
-
22
- \item{type}{What to plot. \code{overlap} generates a plot of the two contrasting empirical
23
- PDFs (to compare against each other), \code{difference} produces a plot of the
24
- differences between the empirical PDFs (to compare against zero).}
25
-
26
- \item{col}{Main color of the plot if type=\code{difference}.}
27
-
28
- \item{col1}{First color of the plot if type=\code{overlap}.}
29
-
30
- \item{col2}{Second color of the plot if type=\code{overlap}.}
31
-
32
- \item{ylab}{Y-axis label.}
33
-
34
- \item{xlim}{X-axis limits.}
35
-
36
- \item{ylim}{Y-axis limits.}
37
-
38
- \item{...}{Any other graphical arguments.}
39
- }
40
- \description{
41
- Plots an \code{\link{enve.TRIBStest}} object.
42
- }
43
- \author{
44
- Luis M. Rodriguez-R [aut, cre]
45
- }
@@ -1,125 +0,0 @@
1
- % Generated by roxygen2: do not edit by hand
2
- % Please edit documentation in R/recplot2.R
3
- \name{plot.enve.RecPlot2}
4
- \alias{plot.enve.RecPlot2}
5
- \title{Enveomics: Recruitment Plot (2)}
6
- \usage{
7
- \method{plot}{enve.RecPlot2}(
8
- x,
9
- layout = matrix(c(5, 5, 2, 1, 4, 3), nrow = 2),
10
- panel.fun = list(),
11
- widths = c(1, 7, 2),
12
- heights = c(1, 2),
13
- palette = grey((100:0)/100),
14
- underlay.group = TRUE,
15
- peaks.col = "darkred",
16
- use.peaks,
17
- id.lim = range(x$id.breaks),
18
- pos.lim = range(x$pos.breaks),
19
- pos.units = c("Mbp", "Kbp", "bp"),
20
- mar = list(`1` = c(5, 4, 1, 1) + 0.1, `2` = c(ifelse(any(layout == 1), 1, 5), 4, 4, 1)
21
- + 0.1, `3` = c(5, ifelse(any(layout == 1), 1, 4), 1, 2) + 0.1, `4` =
22
- c(ifelse(any(layout == 1), 1, 5), ifelse(any(layout == 2), 1, 4), 4, 2) + 0.1, `5` =
23
- c(5, 3, 4, 1) + 0.1, `6` = c(5, 4, 4, 2) + 0.1),
24
- pos.splines = 0,
25
- id.splines = 1/2,
26
- in.lwd = ifelse(is.null(pos.splines) || pos.splines > 0, 1/2, 2),
27
- out.lwd = ifelse(is.null(pos.splines) || pos.splines > 0, 1/2, 2),
28
- id.lwd = ifelse(is.null(id.splines) || id.splines > 0, 1/2, 2),
29
- in.col = "darkblue",
30
- out.col = "lightblue",
31
- id.col = "black",
32
- breaks.col = "#AAAAAA40",
33
- peaks.opts = list(),
34
- ...
35
- )
36
- }
37
- \arguments{
38
- \item{x}{\code{\link{enve.RecPlot2}} object to plot.}
39
-
40
- \item{layout}{Matrix indicating the position of the different panels in the layout,
41
- where:
42
- \itemize{
43
- \item 0: Empty space
44
- \item 1: Counts matrix
45
- \item 2: position histogram (sequencing depth)
46
- \item 3: identity histogram
47
- \item 4: Populations histogram (histogram of sequencing depths)
48
- \item 5: Color scale for the counts matrix (vertical)
49
- \item 6: Color scale of the counts matrix (horizontal)
50
- }
51
- Only panels indicated here will be plotted. To plot only one panel
52
- simply set this to the number of the panel you want to plot.}
53
-
54
- \item{panel.fun}{List of functions to be executed after drawing each panel. Use the
55
- indices in \code{layout} (as characters) as keys. Functions for indices
56
- missing in \code{layout} are ignored. For example, to add a vertical line
57
- at the 3Mbp mark in both the position histogram and the counts matrix:
58
- \code{list('1'=function() abline(v=3), '2'=function() abline(v=3))}.
59
- Note that the X-axis in both panels is in Mbp by default. To change
60
- this behavior, set \code{pos.units} accordingly.}
61
-
62
- \item{widths}{Relative widths of the columns of \code{layout}.}
63
-
64
- \item{heights}{Relative heights of the rows of \code{layout}.}
65
-
66
- \item{palette}{Colors to be used to represent the counts matrix, sorted from no hits
67
- to the maximum sequencing depth.}
68
-
69
- \item{underlay.group}{If TRUE, it indicates the in-group and out-group areas couloured based
70
- on \code{in.col} and \code{out.col}. Requires support for semi-transparency.}
71
-
72
- \item{peaks.col}{If not \code{NA}, it attempts to represent peaks in the population histogram
73
- in the specified color. Set to \code{NA} to avoid peak-finding.}
74
-
75
- \item{use.peaks}{A list of \code{\link{enve.RecPlot2.Peak}} objects, as returned by
76
- \code{\link{enve.recplot2.findPeaks}}. If passed, \code{peaks.opts} is ignored.}
77
-
78
- \item{id.lim}{Limits of identities to represent.}
79
-
80
- \item{pos.lim}{Limits of positions to represent (in bp, regardless of \code{pos.units}).}
81
-
82
- \item{pos.units}{Units in which the positions should be represented (powers of 1,000
83
- base pairs).}
84
-
85
- \item{mar}{Margins of the panels as a list, with the character representation of
86
- the number of the panel as index (see \code{layout}).}
87
-
88
- \item{pos.splines}{Smoothing parameter for the splines in the position histogram. Zero
89
- (0) for no splines. Use \code{NULL} to automatically detect by leave-one-out
90
- cross-validation.}
91
-
92
- \item{id.splines}{Smoothing parameter for the splines in the identity histogram. Zero
93
- (0) for no splines. Use \code{NULL} to automatically detect by leave-one-out
94
- cross-validation.}
95
-
96
- \item{in.lwd}{Line width for the sequencing depth of in-group matches.}
97
-
98
- \item{out.lwd}{Line width for the sequencing depth of out-group matches.}
99
-
100
- \item{id.lwd}{Line width for the identity histogram.}
101
-
102
- \item{in.col}{Color associated to in-group matches.}
103
-
104
- \item{out.col}{Color associated to out-group matches.}
105
-
106
- \item{id.col}{Color for the identity histogram.}
107
-
108
- \item{breaks.col}{Color of the vertical lines indicating sequence breaks.}
109
-
110
- \item{peaks.opts}{Options passed to \code{\link{enve.recplot2.findPeaks}},
111
- if \code{peaks.col} is not \code{NA}.}
112
-
113
- \item{...}{Any other graphic parameters (currently ignored).}
114
- }
115
- \value{
116
- Returns a list of \code{\link{enve.RecPlot2.Peak}} objects (see
117
- \code{\link{enve.recplot2.findPeaks}}). If \code{peaks.col=NA} or
118
- \code{layout} doesn't include 4, returns \code{NA}.
119
- }
120
- \description{
121
- Plots an \code{\link{enve.RecPlot2}} object.
122
- }
123
- \author{
124
- Luis M. Rodriguez-R [aut, cre]
125
- }
@@ -1,19 +0,0 @@
1
- % Generated by roxygen2: do not edit by hand
2
- % Please edit documentation in R/growthcurve.R
3
- \name{summary.enve.GrowthCurve}
4
- \alias{summary.enve.GrowthCurve}
5
- \title{Enveomics: Summary of Growth Curve}
6
- \usage{
7
- \method{summary}{enve.GrowthCurve}(object, ...)
8
- }
9
- \arguments{
10
- \item{object}{An \code{\link{enve.GrowthCurve}} object.}
11
-
12
- \item{...}{No additional parameters are currently supported.}
13
- }
14
- \description{
15
- Summary of an \code{\link{enve.GrowthCurve}} object.
16
- }
17
- \author{
18
- Luis M. Rodriguez-R [aut, cre]
19
- }
@@ -1,19 +0,0 @@
1
- % Generated by roxygen2: do not edit by hand
2
- % Please edit documentation in R/tribs.R
3
- \name{summary.enve.TRIBS}
4
- \alias{summary.enve.TRIBS}
5
- \title{Enveomics: TRIBS Summary}
6
- \usage{
7
- \method{summary}{enve.TRIBS}(object, ...)
8
- }
9
- \arguments{
10
- \item{object}{\code{\link{enve.TRIBS}} object.}
11
-
12
- \item{...}{No additional parameters are currently supported.}
13
- }
14
- \description{
15
- Summary of an \code{\link{enve.TRIBS}} object.
16
- }
17
- \author{
18
- Luis M. Rodriguez-R [aut, cre]
19
- }
@@ -1,19 +0,0 @@
1
- % Generated by roxygen2: do not edit by hand
2
- % Please edit documentation in R/tribs.R
3
- \name{summary.enve.TRIBStest}
4
- \alias{summary.enve.TRIBStest}
5
- \title{Enveomics: TRIBS Summary Test}
6
- \usage{
7
- \method{summary}{enve.TRIBStest}(object, ...)
8
- }
9
- \arguments{
10
- \item{object}{\code{\link{enve.TRIBStest}} object.}
11
-
12
- \item{...}{No additional parameters are currently supported.}
13
- }
14
- \description{
15
- Summary of an \code{\link{enve.TRIBStest}} object.
16
- }
17
- \author{
18
- Luis M. Rodriguez-R [aut, cre]
19
- }
@@ -1,8 +0,0 @@
1
- # Global variables for the Enve-omics collection
2
-
3
- R=R
4
- prefix=/usr/local
5
- bindir=$(prefix)/bin
6
- mandir=$(prefix)/man/man1
7
- SCRIPTS := $(wildcard Scripts/*.*)
8
-
@@ -1,9 +0,0 @@
1
- {
2
- "_": ["This is not standard JSON, to parse use EnveJSON, available at:",
3
- "https://github.com/lmrodriguezr/enveomics-gui/."],
4
- "_include": [
5
- "Manifest/categories.json",
6
- "Manifest/examples.json",
7
- "Manifest/tasks.json"
8
- ]
9
- }
Binary file
@@ -1,67 +0,0 @@
1
- # multitrim
2
-
3
- This is the development script for the new MiGA trimming approach.
4
-
5
- To install the requirements, create a conda environment using multitrim.yml. Navigate to the directory in which multitrim.yml is place, and enter the following command:
6
-
7
- conda env create -f multitrim.yml.
8
-
9
- This will create a conda environment with the correct tools and will allow you to run the multitrim python script. Activate it using:
10
-
11
- conda activate multitrim
12
-
13
- The python script can be run for paired end reads as:
14
-
15
- python3 multitrim.py -1 [FORWARD READS] -2 [REVERSE READS] --max -o [OUTPUT DIRECTORY]
16
-
17
- For single-end reads, run it as:
18
-
19
- python3 multitrim.py -u [SE READS] --max -o [OUTPUT DIRECTORY]
20
-
21
- NOTE:
22
-
23
- Currently Falco is under development. The post-trim QC may fail. If this happens, the python script will issue an error and the post trim QC HTML will be missing. Please email me if this happens.
24
-
25
- # User Manual
26
-
27
- https://docs.google.com/presentation/d/1U87oUzMn-t-lJwOv3oVLv2oR9CpEVGYJpXV8dsBU4zM/edit?usp=sharing
28
-
29
- <hr />
30
-
31
- # Workflow Overview
32
-
33
- * A subsample of up to 100K reads is taken from the input(s)
34
- * The subsamples are run through FaQCs with report only mode on (no trimming) to detect adapters. Possible adapters come from this file: https://github.com/bio-miga/miga/blob/master/utils/adapters.fa
35
- * The adapters detected (if any) are considered present if FaQCs reports them in a default 0.1% of reads. All adapters which are a part of the detected illumina kit(s) are included, e.g. detecting any one ilumina SE adapter will include ALL illumina SE adapters in the trim. The "families" of adapters can be seen at line breaks in the linked adapters.fa file
36
- * Detected adapters are supplied to both FaQCs and Fastp in succession, so both tools attempt to trim adapters:
37
- * First, FaQCs is run on the input reads with the -q 27 parameter, meaning that bases with < 27 quality score count against FaQCs' internal score parameter. This causes trimming to occur when enough <27 qual bases are found, and proceeds from both 5' and 3' ends separately.
38
- * Second, Fastp is run on the post-FaQCs reads using a sliding window of size 3 and min avg. quality of 20. This is identical in behavior to trimmomatic's sliding window, but fastp is faster.
39
- * Reads < 50 bp in length are removed.
40
- * The final post-trim reads are output
41
- * QC reports are performed on pre/post trim reads all at once.
42
-
43
- <hr />
44
-
45
- # Brief Summary
46
-
47
- * Input reads -> FaQCs trims originals -> fastp trims FaQCs outputs -> output reads
48
-
49
- <hr />
50
-
51
- # Tools multitrim uses
52
-
53
- Read Trimmers
54
-
55
- * Tools for trimming:
56
-
57
- * FaQCs: https://github.com/LANL-Bioinformatics/FaQCs
58
-
59
- * Fastp: https://github.com/OpenGene/fastp
60
-
61
- QC
62
-
63
- * Falco: https://github.com/smithlabcode/falco/tree/master/src
64
-
65
- Sampling
66
-
67
- * SeqTK: https://github.com/lh3/seqtk