bioroebe 0.10.80
Sign up to get free protection for your applications and to get access to all the features.
- checksums.yaml +7 -0
- data/LICENSE.md +428 -0
- data/README.md +9280 -0
- data/bin/advanced_dotplot +7 -0
- data/bin/align_open_reading_frames +12 -0
- data/bin/all_positions_of_this_nucleotide +7 -0
- data/bin/aminoacid_composition +7 -0
- data/bin/aminoacid_frequencies +12 -0
- data/bin/aminoacid_substitution +7 -0
- data/bin/automatically_rename_this_fasta_file +7 -0
- data/bin/base_composition +7 -0
- data/bin/batch_create_windows_executables +7 -0
- data/bin/biomart_console +11 -0
- data/bin/bioroebe +27 -0
- data/bin/bioroebe_controller +10 -0
- data/bin/bioshell +26 -0
- data/bin/blosum_2D_table +12 -0
- data/bin/calculate_n50_value +12 -0
- data/bin/chunked_display +12 -0
- data/bin/codon_frequency +9 -0
- data/bin/codon_to_aminoacid +30 -0
- data/bin/colourize_this_fasta_sequence +14 -0
- data/bin/compact_fasta_file +7 -0
- data/bin/complement +7 -0
- data/bin/complementary_dna_strand +12 -0
- data/bin/complementary_rna_strand +12 -0
- data/bin/compseq +7 -0
- data/bin/consensus_sequence +17 -0
- data/bin/count_AT +12 -0
- data/bin/count_GC +12 -0
- data/bin/create_random_aminoacids +7 -0
- data/bin/decode_this_aminoacid_sequence +20 -0
- data/bin/deduce_aminoacid_sequence +13 -0
- data/bin/deduce_most_likely_aminoacid_sequence +7 -0
- data/bin/display_aminoacid_table +12 -0
- data/bin/display_open_reading_frames +7 -0
- data/bin/dna_to_aminoacid_sequence +7 -0
- data/bin/dna_to_rna +7 -0
- data/bin/downcase_chunked_display +12 -0
- data/bin/download_this_pdb +7 -0
- data/bin/fasta_index +7 -0
- data/bin/fetch_data_from_uniprot +12 -0
- data/bin/filter_away_invalid_nucleotides +12 -0
- data/bin/find_substring +19 -0
- data/bin/genbank_to_fasta +7 -0
- data/bin/hamming_distance +12 -0
- data/bin/input_as_dna +12 -0
- data/bin/is_palindrome +13 -0
- data/bin/leading_five_prime +7 -0
- data/bin/levensthein +7 -0
- data/bin/longest_ORF +14 -0
- data/bin/longest_substring +12 -0
- data/bin/n_stop_codons_in_this_sequence +15 -0
- data/bin/open_reading_frames +14 -0
- data/bin/overwrite_fasta_header +7 -0
- data/bin/palindrome_2D_structure +7 -0
- data/bin/palindrome_generator +7 -0
- data/bin/parse_fasta +7 -0
- data/bin/partner_nucleotide +9 -0
- data/bin/possible_codons_for_this_aminoacid +12 -0
- data/bin/random_dna_sequence +12 -0
- data/bin/random_sequence +12 -0
- data/bin/raw_hamming_distance +12 -0
- data/bin/return_longest_substring_via_LCS_algorithm +7 -0
- data/bin/reverse_complement +7 -0
- data/bin/reverse_sequence +7 -0
- data/bin/ruler +12 -0
- data/bin/scan_this_input_for_startcodons +12 -0
- data/bin/short_aminoacid_letter_from_long_aminoacid_name +7 -0
- data/bin/show_atomic_composition +7 -0
- data/bin/show_codon_usage +12 -0
- data/bin/show_fasta_header +7 -0
- data/bin/show_nucleotide_sequence +7 -0
- data/bin/show_this_codon_table +7 -0
- data/bin/show_this_dna_sequence +7 -0
- data/bin/showorf +14 -0
- data/bin/simplify_fasta +7 -0
- data/bin/sort_aminoacid_based_on_its_hydrophobicity +7 -0
- data/bin/split_this_fasta_file_into_chromosomes +7 -0
- data/bin/strict_filter_away_invalid_aminoacids +7 -0
- data/bin/taxonomy +63 -0
- data/bin/three_to_one +7 -0
- data/bin/to_rna +7 -0
- data/bin/trailing_three_prime +7 -0
- data/bin/trypsin_digest +7 -0
- data/bin/upcase_this_aminoacid_sequence_and_remove_numbers +7 -0
- data/bioroebe.gemspec +97 -0
- data/doc/IUPAC_aminoacids_code.md +36 -0
- data/doc/IUPAC_nucleotide_code.md +19 -0
- data/doc/README.gen +9237 -0
- data/doc/blosum.md +5 -0
- data/doc/compatibility/BIO_PHP.md +37 -0
- data/doc/compatibility/README.md +3 -0
- data/doc/compatibility/emboss.md +56 -0
- data/doc/extensive_usage_example.md +35 -0
- data/doc/german_names_for_the_aminoacids.md +27 -0
- data/doc/instructions_for_the_taxonomy_subproject.md +504 -0
- data/doc/legacy_paths.md +9 -0
- data/doc/pdb_ATOM_entry.md +33 -0
- data/doc/quality_control/README.md +2 -0
- data/doc/quality_control/commandline_applications.md +13 -0
- data/doc/resources.md +23 -0
- data/doc/setup.rb +1655 -0
- data/doc/statistics/statistics.md +41 -0
- data/doc/todo/README.md +5 -0
- data/doc/todo/bioroebe_GUI_todo.md +15 -0
- data/doc/todo/bioroebe_todo.md +2823 -0
- data/doc/using_biomart.md +258 -0
- data/html/test.html +144 -0
- data/lib/bioroebe/aminoacids/aminoacid_substitution.rb +118 -0
- data/lib/bioroebe/aminoacids/aminoacids_mass_table.rb +118 -0
- data/lib/bioroebe/aminoacids/codon_percentage.rb +189 -0
- data/lib/bioroebe/aminoacids/colourize_hydrophilic_and_hydrophobic_aminoacids.rb +110 -0
- data/lib/bioroebe/aminoacids/create_random_aminoacids.rb +221 -0
- data/lib/bioroebe/aminoacids/deduce_aminoacid_sequence.rb +460 -0
- data/lib/bioroebe/aminoacids/display_aminoacid_table.rb +231 -0
- data/lib/bioroebe/aminoacids/show_hydrophobicity.rb +184 -0
- data/lib/bioroebe/annotations/README.md +2 -0
- data/lib/bioroebe/annotations/create_annotation_format.rb +208 -0
- data/lib/bioroebe/autoinclude.rb +7 -0
- data/lib/bioroebe/base/base.rb +35 -0
- data/lib/bioroebe/base/colours.rb +14 -0
- data/lib/bioroebe/base/colours_for_base/colours_for_base.rb +275 -0
- data/lib/bioroebe/base/commandline_application/README.md +7 -0
- data/lib/bioroebe/base/commandline_application/aminoacids.rb +33 -0
- data/lib/bioroebe/base/commandline_application/commandline_application.rb +37 -0
- data/lib/bioroebe/base/commandline_application/commandline_arguments.rb +144 -0
- data/lib/bioroebe/base/commandline_application/directory.rb +33 -0
- data/lib/bioroebe/base/commandline_application/extract.rb +22 -0
- data/lib/bioroebe/base/commandline_application/misc.rb +485 -0
- data/lib/bioroebe/base/commandline_application/opn.rb +47 -0
- data/lib/bioroebe/base/commandline_application/reset.rb +40 -0
- data/lib/bioroebe/base/commandline_application/warnings.rb +36 -0
- data/lib/bioroebe/base/commandline_application/write_what_into.rb +29 -0
- data/lib/bioroebe/base/initialize.rb +18 -0
- data/lib/bioroebe/base/misc.rb +94 -0
- data/lib/bioroebe/base/namespace.rb +16 -0
- data/lib/bioroebe/base/prototype/README.md +12 -0
- data/lib/bioroebe/base/prototype/e_and_ee.rb +24 -0
- data/lib/bioroebe/base/prototype/misc.rb +108 -0
- data/lib/bioroebe/base/prototype/mkdir.rb +20 -0
- data/lib/bioroebe/base/prototype/prototype.rb +21 -0
- data/lib/bioroebe/base/prototype/reset.rb +26 -0
- data/lib/bioroebe/base/reset.rb +11 -0
- data/lib/bioroebe/biomart/LICENSE.md +27 -0
- data/lib/bioroebe/biomart/attribute.rb +77 -0
- data/lib/bioroebe/biomart/biomart.rb +227 -0
- data/lib/bioroebe/biomart/database.rb +128 -0
- data/lib/bioroebe/biomart/dataset.rb +572 -0
- data/lib/bioroebe/biomart/filter.rb +97 -0
- data/lib/bioroebe/biomart/server.rb +152 -0
- data/lib/bioroebe/blosum/blosum.rb +88 -0
- data/lib/bioroebe/calculate/calculate_blosum_score.rb +145 -0
- data/lib/bioroebe/calculate/calculate_gc_content.rb +301 -0
- data/lib/bioroebe/calculate/calculate_levensthein_distance.rb +100 -0
- data/lib/bioroebe/calculate/calculate_melting_temperature.rb +398 -0
- data/lib/bioroebe/calculate/calculate_melting_temperature_for_more_than_thirteen_nucleotides.rb +304 -0
- data/lib/bioroebe/calculate/calculate_the_position_specific_scoring_matrix.rb +166 -0
- data/lib/bioroebe/cell/README.md +1 -0
- data/lib/bioroebe/cell/cell.rb +63 -0
- data/lib/bioroebe/cleave_and_digest/README.md +2 -0
- data/lib/bioroebe/cleave_and_digest/cleave.rb +80 -0
- data/lib/bioroebe/cleave_and_digest/digestion.rb +75 -0
- data/lib/bioroebe/cleave_and_digest/trypsin.rb +192 -0
- data/lib/bioroebe/codon_tables/README.md +9 -0
- data/lib/bioroebe/codon_tables/frequencies/287_Pseudomonas_aeruginosa.yml +101 -0
- data/lib/bioroebe/codon_tables/frequencies/3702_Arabidopsis_thaliana.yml +77 -0
- data/lib/bioroebe/codon_tables/frequencies/4932_Saccharomyces_cerevisiae.yml +103 -0
- data/lib/bioroebe/codon_tables/frequencies/7227_Drosophila_melanogaster.yml +71 -0
- data/lib/bioroebe/codon_tables/frequencies/83333_Escherichia_coli_K12.yml +103 -0
- data/lib/bioroebe/codon_tables/frequencies/9606_Homo_sapiens.yml +123 -0
- data/lib/bioroebe/codon_tables/frequencies/9685_Felis_catus.yml +78 -0
- data/lib/bioroebe/codon_tables/frequencies/README.md +10 -0
- data/lib/bioroebe/codon_tables/frequencies/parse_frequency_table.rb +337 -0
- data/lib/bioroebe/codons/README.md +28 -0
- data/lib/bioroebe/codons/codon_table.rb +416 -0
- data/lib/bioroebe/codons/codon_tables.rb +123 -0
- data/lib/bioroebe/codons/codons.rb +517 -0
- data/lib/bioroebe/codons/convert_this_codon_to_that_aminoacid.rb +102 -0
- data/lib/bioroebe/codons/detect_minimal_codon.rb +180 -0
- data/lib/bioroebe/codons/determine_optimal_codons.rb +74 -0
- data/lib/bioroebe/codons/possible_codons_for_this_aminoacid.rb +380 -0
- data/lib/bioroebe/codons/sanitize_codon_frequency.rb +144 -0
- data/lib/bioroebe/codons/show_codon_tables.rb +130 -0
- data/lib/bioroebe/codons/show_codon_usage.rb +197 -0
- data/lib/bioroebe/codons/show_this_codon_table.rb +573 -0
- data/lib/bioroebe/codons/start_codons.rb +105 -0
- data/lib/bioroebe/colours/colour_schemes/README.md +10 -0
- data/lib/bioroebe/colours/colour_schemes/array_available_colour_schemes.rb +38 -0
- data/lib/bioroebe/colours/colour_schemes/buried.rb +70 -0
- data/lib/bioroebe/colours/colour_schemes/colour_scheme.rb +101 -0
- data/lib/bioroebe/colours/colour_schemes/colour_scheme_demo.rb +262 -0
- data/lib/bioroebe/colours/colour_schemes/helix.rb +65 -0
- data/lib/bioroebe/colours/colour_schemes/hydropathy.rb +70 -0
- data/lib/bioroebe/colours/colour_schemes/nucleotide.rb +47 -0
- data/lib/bioroebe/colours/colour_schemes/score.rb +112 -0
- data/lib/bioroebe/colours/colour_schemes/simple.rb +42 -0
- data/lib/bioroebe/colours/colour_schemes/strand.rb +65 -0
- data/lib/bioroebe/colours/colour_schemes/taylor.rb +58 -0
- data/lib/bioroebe/colours/colour_schemes/turn.rb +65 -0
- data/lib/bioroebe/colours/colour_schemes/zappo.rb +59 -0
- data/lib/bioroebe/colours/colourize_sequence.rb +262 -0
- data/lib/bioroebe/colours/colours.rb +119 -0
- data/lib/bioroebe/colours/misc_colours.rb +80 -0
- data/lib/bioroebe/colours/rev.rb +41 -0
- data/lib/bioroebe/colours/sdir.rb +21 -0
- data/lib/bioroebe/colours/sfancy.rb +21 -0
- data/lib/bioroebe/colours/sfile.rb +21 -0
- data/lib/bioroebe/colours/simp.rb +21 -0
- data/lib/bioroebe/colours/swarn.rb +29 -0
- data/lib/bioroebe/colours/use_colours.rb +27 -0
- data/lib/bioroebe/configuration/configuration.rb +114 -0
- data/lib/bioroebe/configuration/constants.rb +35 -0
- data/lib/bioroebe/constants/GUIs.rb +79 -0
- data/lib/bioroebe/constants/aminoacids_and_proteins.rb +146 -0
- data/lib/bioroebe/constants/base_directory.rb +120 -0
- data/lib/bioroebe/constants/carriage_return.rb +14 -0
- data/lib/bioroebe/constants/codon_tables.rb +77 -0
- data/lib/bioroebe/constants/database_constants.rb +107 -0
- data/lib/bioroebe/constants/files_and_directories.rb +579 -0
- data/lib/bioroebe/constants/misc.rb +189 -0
- data/lib/bioroebe/constants/newline.rb +14 -0
- data/lib/bioroebe/constants/nucleotides.rb +114 -0
- data/lib/bioroebe/constants/regex.rb +28 -0
- data/lib/bioroebe/constants/roebe.rb +38 -0
- data/lib/bioroebe/constants/row_terminator.rb +16 -0
- data/lib/bioroebe/constants/tabulator.rb +14 -0
- data/lib/bioroebe/constants/unicode.rb +12 -0
- data/lib/bioroebe/constants/urls.rb +50 -0
- data/lib/bioroebe/conversions/README.md +3 -0
- data/lib/bioroebe/conversions/convert_aminoacid_to_dna.rb +298 -0
- data/lib/bioroebe/conversions/dna_to_aminoacid_sequence.rb +569 -0
- data/lib/bioroebe/count/README.md +1 -0
- data/lib/bioroebe/count/count_amount_of_aminoacids.rb +352 -0
- data/lib/bioroebe/count/count_amount_of_nucleotides.rb +491 -0
- data/lib/bioroebe/count/count_at.rb +39 -0
- data/lib/bioroebe/count/count_gc.rb +43 -0
- data/lib/bioroebe/css/README.md +5 -0
- data/lib/bioroebe/css/project.css +121 -0
- data/lib/bioroebe/data/README.md +10 -0
- data/lib/bioroebe/data/bam/README.md +1 -0
- data/lib/bioroebe/data/data.txt +192 -0
- data/lib/bioroebe/data/fasta/GFP_mutant_3_coding_sequence.fasta +12 -0
- data/lib/bioroebe/data/fasta/alu_elements.fasta +42 -0
- data/lib/bioroebe/data/fasta/lady_slippers_orchid.fasta +1197 -0
- data/lib/bioroebe/data/fasta/loxP.fasta +2 -0
- data/lib/bioroebe/data/fasta/ls_orchid.fasta +1197 -0
- data/lib/bioroebe/data/fasta/pax6_in_mouse.fasta +1 -0
- data/lib/bioroebe/data/fasta/test.fasta +7 -0
- data/lib/bioroebe/data/fasta/test_DNA.fasta +1 -0
- data/lib/bioroebe/data/fastq/fastq_example_file.fastq +32 -0
- data/lib/bioroebe/data/fastq/fastq_example_file_SP1.fastq +1000 -0
- data/lib/bioroebe/data/fastq/one_random_fastq_entry.fastq +4 -0
- data/lib/bioroebe/data/genbank/sample_file.genbank +15 -0
- data/lib/bioroebe/data/genbank/standard.fasta +3 -0
- data/lib/bioroebe/data/gff/Escherichia_coli_K12_plasmid_F_DNA_NC_002483.1.gff3 +345 -0
- data/lib/bioroebe/data/gff/sample.gff +2 -0
- data/lib/bioroebe/data/gff/test_gene.gff +4 -0
- data/lib/bioroebe/data/gff/transcripts.gff +16 -0
- data/lib/bioroebe/data/gtf/README.md +1 -0
- data/lib/bioroebe/data/json/example_config.json +48 -0
- data/lib/bioroebe/data/pdb/1VII.pdb +754 -0
- data/lib/bioroebe/data/pdb/ala_phe_ala.pdb +228 -0
- data/lib/bioroebe/data/pdb/rcsb_pdb_1VII.fasta +2 -0
- data/lib/bioroebe/data/phylip/README.md +11 -0
- data/lib/bioroebe/data/phylip/example.phylip +7 -0
- data/lib/bioroebe/data/svg/example.svg +301 -0
- data/lib/bioroebe/databases/README.md +1 -0
- data/lib/bioroebe/databases/download_taxonomy_database.rb +102 -0
- data/lib/bioroebe/dotplots/README.md +5 -0
- data/lib/bioroebe/dotplots/advanced_dotplot.rb +256 -0
- data/lib/bioroebe/dotplots/dotplot.rb +184 -0
- data/lib/bioroebe/electron_microscopy/coordinate_analyzer.rb +191 -0
- data/lib/bioroebe/electron_microscopy/fix_pos_file.rb +102 -0
- data/lib/bioroebe/electron_microscopy/generate_em2em_file.rb +122 -0
- data/lib/bioroebe/electron_microscopy/parse_coordinates.rb +197 -0
- data/lib/bioroebe/electron_microscopy/read_file_xmd.rb +282 -0
- data/lib/bioroebe/electron_microscopy/simple_star_file_generator.rb +131 -0
- data/lib/bioroebe/encoding/README.md +2 -0
- data/lib/bioroebe/encoding/encoding.rb +45 -0
- data/lib/bioroebe/enzymes/README.md +2 -0
- data/lib/bioroebe/enzymes/has_this_restriction_enzyme.rb +46 -0
- data/lib/bioroebe/enzymes/restriction_enzyme.rb +200 -0
- data/lib/bioroebe/enzymes/restriction_enzymes_file.rb +72 -0
- data/lib/bioroebe/enzymes/return_restriction_enzyme_sequence_and_cut_position.rb +80 -0
- data/lib/bioroebe/enzymes/return_sequence_that_is_cut_via_restriction_enzyme.rb +65 -0
- data/lib/bioroebe/enzymes/show_restriction_enzymes.rb +119 -0
- data/lib/bioroebe/exceptions/README.md +2 -0
- data/lib/bioroebe/exceptions/exceptions.rb +17 -0
- data/lib/bioroebe/ext/LICENCE.md +5 -0
- data/lib/bioroebe/ext/README.md +7 -0
- data/lib/bioroebe/ext/main.cpp +45 -0
- data/lib/bioroebe/ext/nucleotide.cpp +24 -0
- data/lib/bioroebe/ext/nussinov_algorithm.cpp +348 -0
- data/lib/bioroebe/ext/sequence +0 -0
- data/lib/bioroebe/ext/sequence.cpp +162 -0
- data/lib/bioroebe/fasta_and_fastq/README.md +6 -0
- data/lib/bioroebe/fasta_and_fastq/autocorrect_the_name_of_this_fasta_file.rb +88 -0
- data/lib/bioroebe/fasta_and_fastq/compact_fasta_file/compact_fasta_file.rb +151 -0
- data/lib/bioroebe/fasta_and_fastq/display_how_many_fasta_entries_are_in_this_directory.rb +111 -0
- data/lib/bioroebe/fasta_and_fastq/download_fasta.rb +248 -0
- data/lib/bioroebe/fasta_and_fastq/fasta_defline/README.md +2 -0
- data/lib/bioroebe/fasta_and_fastq/fasta_defline/fasta_defline.rb +113 -0
- data/lib/bioroebe/fasta_and_fastq/fasta_parser.rb +167 -0
- data/lib/bioroebe/fasta_and_fastq/fastq_format_explainer.rb +131 -0
- data/lib/bioroebe/fasta_and_fastq/length_modifier/length_modifier.rb +87 -0
- data/lib/bioroebe/fasta_and_fastq/parse_fasta/constants.rb +50 -0
- data/lib/bioroebe/fasta_and_fastq/parse_fasta/initialize.rb +86 -0
- data/lib/bioroebe/fasta_and_fastq/parse_fasta/menu.rb +117 -0
- data/lib/bioroebe/fasta_and_fastq/parse_fasta/misc.rb +981 -0
- data/lib/bioroebe/fasta_and_fastq/parse_fasta/parse_fasta.rb +27 -0
- data/lib/bioroebe/fasta_and_fastq/parse_fasta/report.rb +156 -0
- data/lib/bioroebe/fasta_and_fastq/parse_fasta/reset.rb +128 -0
- data/lib/bioroebe/fasta_and_fastq/parse_fasta/run.rb +20 -0
- data/lib/bioroebe/fasta_and_fastq/parse_fastq/parse_fastq.rb +83 -0
- data/lib/bioroebe/fasta_and_fastq/return_fasta_subsection_of_this_file.rb +112 -0
- data/lib/bioroebe/fasta_and_fastq/show_fasta_headers.rb +135 -0
- data/lib/bioroebe/fasta_and_fastq/show_fasta_statistics.rb +188 -0
- data/lib/bioroebe/fasta_and_fastq/simplify_fasta_header/simplify_fasta_header.rb +111 -0
- data/lib/bioroebe/fasta_and_fastq/split_this_fasta_file_into_chromosomes/constants.rb +26 -0
- data/lib/bioroebe/fasta_and_fastq/split_this_fasta_file_into_chromosomes/menu.rb +41 -0
- data/lib/bioroebe/fasta_and_fastq/split_this_fasta_file_into_chromosomes/misc.rb +23 -0
- data/lib/bioroebe/fasta_and_fastq/split_this_fasta_file_into_chromosomes/reset.rb +68 -0
- data/lib/bioroebe/fasta_and_fastq/split_this_fasta_file_into_chromosomes/split_this_fasta_file_into_chromosomes.rb +290 -0
- data/lib/bioroebe/genbank/README.md +1 -0
- data/lib/bioroebe/genbank/genbank_flat_file_format_generator.rb +275 -0
- data/lib/bioroebe/genbank/genbank_parser.rb +291 -0
- data/lib/bioroebe/gene/gene.rb +64 -0
- data/lib/bioroebe/genomes/genome_pattern.rb +165 -0
- data/lib/bioroebe/genomes/genome_retriever.rb +79 -0
- data/lib/bioroebe/gui/experimental/README.md +1 -0
- data/lib/bioroebe/gui/experimental/snapgene/snapgene.rb +147 -0
- data/lib/bioroebe/gui/gtk3/README.md +2 -0
- data/lib/bioroebe/gui/gtk3/alignment/alignment.rb +337 -0
- data/lib/bioroebe/gui/gtk3/aminoacid_composition/aminoacid_composition.rb +510 -0
- data/lib/bioroebe/gui/gtk3/aminoacid_composition/customized_dialog.rb +55 -0
- data/lib/bioroebe/gui/gtk3/anti_sense_strand/anti_sense_strand.config +6 -0
- data/lib/bioroebe/gui/gtk3/anti_sense_strand/anti_sense_strand.rb +29 -0
- data/lib/bioroebe/gui/gtk3/blosum_matrix_viewer/blosum_matrix_viewer.rb +196 -0
- data/lib/bioroebe/gui/gtk3/calculate_cell_numbers_of_bacteria/calculate_cell_numbers_of_bacteria.config +6 -0
- data/lib/bioroebe/gui/gtk3/calculate_cell_numbers_of_bacteria/calculate_cell_numbers_of_bacteria.rb +106 -0
- data/lib/bioroebe/gui/gtk3/controller/controller.rb +406 -0
- data/lib/bioroebe/gui/gtk3/dna_to_aminoacid_widget/dna_to_aminoacid_widget.rb +609 -0
- data/lib/bioroebe/gui/gtk3/dna_to_reverse_complement_widget/dna_to_reverse_complement_widget.config +6 -0
- data/lib/bioroebe/gui/gtk3/dna_to_reverse_complement_widget/dna_to_reverse_complement_widget.rb +189 -0
- data/lib/bioroebe/gui/gtk3/fasta_table_widget/fasta_table_widget.rb +245 -0
- data/lib/bioroebe/gui/gtk3/format_converter/format_converter.rb +346 -0
- data/lib/bioroebe/gui/gtk3/gene/gene.rb +182 -0
- data/lib/bioroebe/gui/gtk3/hamming_distance/hamming_distance.config +6 -0
- data/lib/bioroebe/gui/gtk3/hamming_distance/hamming_distance.rb +370 -0
- data/lib/bioroebe/gui/gtk3/levensthein_distance/levensthein_distance.config +6 -0
- data/lib/bioroebe/gui/gtk3/levensthein_distance/levensthein_distance.rb +175 -0
- data/lib/bioroebe/gui/gtk3/nucleotide_analyser/nucleotide_analyser.rb +428 -0
- data/lib/bioroebe/gui/gtk3/parse_pdb_file/parse_pdb_file.rb +342 -0
- data/lib/bioroebe/gui/gtk3/primer_design_widget/primer_design_widget.rb +580 -0
- data/lib/bioroebe/gui/gtk3/protein_to_DNA/protein_to_DNA.config +6 -0
- data/lib/bioroebe/gui/gtk3/protein_to_DNA/protein_to_DNA.rb +182 -0
- data/lib/bioroebe/gui/gtk3/random_sequence/random_sequence.rb +566 -0
- data/lib/bioroebe/gui/gtk3/restriction_enzymes/restriction_enzymes.config +6 -0
- data/lib/bioroebe/gui/gtk3/restriction_enzymes/restriction_enzymes.rb +329 -0
- data/lib/bioroebe/gui/gtk3/show_codon_table/misc.rb +556 -0
- data/lib/bioroebe/gui/gtk3/show_codon_table/show_codon_table.rb +171 -0
- data/lib/bioroebe/gui/gtk3/show_codon_usage/show_codon_usage.rb +146 -0
- data/lib/bioroebe/gui/gtk3/sizeseq/sizeseq.rb +207 -0
- data/lib/bioroebe/gui/gtk3/three_to_one/three_to_one.rb +279 -0
- data/lib/bioroebe/gui/gtk3/three_to_one/title.rb +23 -0
- data/lib/bioroebe/gui/gtk3/www_finder/www_finder.config +6 -0
- data/lib/bioroebe/gui/gtk3/www_finder/www_finder.rb +368 -0
- data/lib/bioroebe/gui/jruby/blosum_matrix_viewer/blosum_matrix_viewer.rb +82 -0
- data/lib/bioroebe/gui/libui/README.md +4 -0
- data/lib/bioroebe/gui/libui/alignment/alignment.rb +114 -0
- data/lib/bioroebe/gui/libui/blosum_matrix_viewer/blosum_matrix_viewer.rb +112 -0
- data/lib/bioroebe/gui/libui/calculate_cell_numbers_of_bacteria/calculate_cell_numbers_of_bacteria.rb +60 -0
- data/lib/bioroebe/gui/libui/dna_to_aminoacid_widget/dna_to_aminoacid_widget.rb +161 -0
- data/lib/bioroebe/gui/libui/dna_to_reverse_complement_widget/dna_to_reverse_complement_widget.rb +76 -0
- data/lib/bioroebe/gui/libui/hamming_distance/hamming_distance.rb +135 -0
- data/lib/bioroebe/gui/libui/levensthein_distance/levensthein_distance.rb +118 -0
- data/lib/bioroebe/gui/libui/protein_to_DNA/protein_to_DNA.rb +115 -0
- data/lib/bioroebe/gui/libui/random_sequence/random_sequence.rb +174 -0
- data/lib/bioroebe/gui/libui/show_codon_table/show_codon_table.rb +132 -0
- data/lib/bioroebe/gui/libui/show_codon_usage/show_codon_usage.rb +89 -0
- data/lib/bioroebe/gui/libui/three_to_one/three_to_one.rb +111 -0
- data/lib/bioroebe/gui/shared_code/blosum_matrix_viewer/blosum_matrix_viewer_module.rb +42 -0
- data/lib/bioroebe/gui/shared_code/calculate_cell_numbers_of_bacteria/calculate_cell_numbers_of_bacteria_module.rb +216 -0
- data/lib/bioroebe/gui/shared_code/dna_to_reverse_complement_widget/dna_to_reverse_complement_widget_module.rb +284 -0
- data/lib/bioroebe/gui/shared_code/levensthein_distance/levensthein_distance_module.rb +402 -0
- data/lib/bioroebe/gui/shared_code/protein_to_DNA/protein_to_DNA_module.rb +192 -0
- data/lib/bioroebe/gui/shared_code/show_codon_table/show_codon_table_module.rb +72 -0
- data/lib/bioroebe/gui/shared_code/show_codon_usage/show_codon_usage_module.rb +213 -0
- data/lib/bioroebe/gui/tk/aminoacid_composition/aminoacid_composition.rb +206 -0
- data/lib/bioroebe/gui/tk/blosum_matrix_viewer/blosum_matrix_viewer.rb +140 -0
- data/lib/bioroebe/gui/tk/hamming_distance/hamming_distance.rb +262 -0
- data/lib/bioroebe/gui/tk/levensthein_distance/levensthein_distance.rb +243 -0
- data/lib/bioroebe/gui/tk/three_to_one/three_to_one.rb +199 -0
- data/lib/bioroebe/gui/unified_widgets/anti_sense_strand/anti_sense_strand.rb +519 -0
- data/lib/bioroebe/images/BIOROEBE.png +0 -0
- data/lib/bioroebe/images/BIOROEBE_NEW_LOGO.png +0 -0
- data/lib/bioroebe/images/BlosumMatrixViewer.png +0 -0
- data/lib/bioroebe/images/DnaToAminoacidWidget.png +0 -0
- data/lib/bioroebe/images/PRINTED_AMINOACID_TABLE.png +0 -0
- data/lib/bioroebe/images/class_ConvertAminoacidToDNA.png +0 -0
- data/lib/bioroebe/images/class_SimpleStringComparer.png +0 -0
- data/lib/bioroebe/images/example_of_FASTA_coloured_output.png +0 -0
- data/lib/bioroebe/images/libui_hamming_distance_widget.png +0 -0
- data/lib/bioroebe/images/pretty_DNA_picture.png +0 -0
- data/lib/bioroebe/images/primer_design_widget.png +0 -0
- data/lib/bioroebe/images/restriction_enzyme_commandline_result.png +0 -0
- data/lib/bioroebe/images/ruby-gtk_three_to_one_widget.png +0 -0
- data/lib/bioroebe/images/small_DNA_logo.png +0 -0
- data/lib/bioroebe/images/small_drosophila_image.png +0 -0
- data/lib/bioroebe/java/README.md +6 -0
- data/lib/bioroebe/java/bioroebe/AllInOne.class +0 -0
- data/lib/bioroebe/java/bioroebe/AllInOne.java +214 -0
- data/lib/bioroebe/java/bioroebe/Base.class +0 -0
- data/lib/bioroebe/java/bioroebe/Base.java +102 -0
- data/lib/bioroebe/java/bioroebe/BisulfiteTreatment.class +0 -0
- data/lib/bioroebe/java/bioroebe/BisulfiteTreatment.java +23 -0
- data/lib/bioroebe/java/bioroebe/Cat.class +0 -0
- data/lib/bioroebe/java/bioroebe/Codons.class +0 -0
- data/lib/bioroebe/java/bioroebe/Codons.java +22 -0
- data/lib/bioroebe/java/bioroebe/Esystem.class +0 -0
- data/lib/bioroebe/java/bioroebe/Esystem.java +47 -0
- data/lib/bioroebe/java/bioroebe/GUI/BaseFrame.class +0 -0
- data/lib/bioroebe/java/bioroebe/GUI/BaseFrame.java +65 -0
- data/lib/bioroebe/java/bioroebe/GenerateRandomDnaSequence.class +0 -0
- data/lib/bioroebe/java/bioroebe/GenerateRandomDnaSequence.java +32 -0
- data/lib/bioroebe/java/bioroebe/IsPalindrome.class +0 -0
- data/lib/bioroebe/java/bioroebe/IsPalindrome.java +18 -0
- data/lib/bioroebe/java/bioroebe/PartnerNucleotide.class +0 -0
- data/lib/bioroebe/java/bioroebe/PartnerNucleotide.java +19 -0
- data/lib/bioroebe/java/bioroebe/README.md +4 -0
- data/lib/bioroebe/java/bioroebe/RemoveFile.class +0 -0
- data/lib/bioroebe/java/bioroebe/RemoveFile.java +24 -0
- data/lib/bioroebe/java/bioroebe/RemoveNumbers.class +0 -0
- data/lib/bioroebe/java/bioroebe/RemoveNumbers.java +14 -0
- data/lib/bioroebe/java/bioroebe/SanitizeNucleotideSequence.class +0 -0
- data/lib/bioroebe/java/bioroebe/SanitizeNucleotideSequence.java +20 -0
- data/lib/bioroebe/java/bioroebe/SaveFile.java +44 -0
- data/lib/bioroebe/java/bioroebe/Sequence.java +28 -0
- data/lib/bioroebe/java/bioroebe/ToCamelcase.class +0 -0
- data/lib/bioroebe/java/bioroebe/ToCamelcase.java +32 -0
- data/lib/bioroebe/java/bioroebe/ToplevelMethods.class +0 -0
- data/lib/bioroebe/java/bioroebe/ToplevelMethods.java +15 -0
- data/lib/bioroebe/java/bioroebe/enums/DNA.java +6 -0
- data/lib/bioroebe/java/bioroebe.jar +0 -0
- data/lib/bioroebe/matplotlib/matplotlib_generator.rb +104 -0
- data/lib/bioroebe/misc/quiz/README.md +6 -0
- data/lib/bioroebe/misc/quiz/three_letter_to_aminoacid.rb +163 -0
- data/lib/bioroebe/misc/ruler.rb +244 -0
- data/lib/bioroebe/misc/useful_formulas.rb +129 -0
- data/lib/bioroebe/ncbi/efetch.rb +253 -0
- data/lib/bioroebe/ncbi/ncbi.rb +93 -0
- data/lib/bioroebe/ngs/README.md +2 -0
- data/lib/bioroebe/ngs/phred_quality_score_table.rb +123 -0
- data/lib/bioroebe/nucleotides/complementary_dna_strand.rb +166 -0
- data/lib/bioroebe/nucleotides/molecular_weight_of_nucleotides.rb +135 -0
- data/lib/bioroebe/nucleotides/most_likely_nucleotide_sequence_for_this_aminoacid_sequence.rb +198 -0
- data/lib/bioroebe/nucleotides/sanitize_nucleotide_sequence.rb +133 -0
- data/lib/bioroebe/nucleotides/show_nucleotide_sequence.rb +556 -0
- data/lib/bioroebe/palindromes/palindrome_2D_structure.rb +139 -0
- data/lib/bioroebe/palindromes/palindrome_finder.rb +208 -0
- data/lib/bioroebe/palindromes/palindrome_generator.rb +272 -0
- data/lib/bioroebe/parsers/biolang_parser.rb +156 -0
- data/lib/bioroebe/parsers/blosum_parser.rb +222 -0
- data/lib/bioroebe/parsers/genbank_parser.rb +78 -0
- data/lib/bioroebe/parsers/gff.rb +346 -0
- data/lib/bioroebe/parsers/parse_embl.rb +76 -0
- data/lib/bioroebe/parsers/stride_parser.rb +117 -0
- data/lib/bioroebe/patterns/README.md +5 -0
- data/lib/bioroebe/patterns/analyse_glycosylation_pattern.rb +149 -0
- data/lib/bioroebe/patterns/is_this_sequence_a_EGF2_pattern.rb +66 -0
- data/lib/bioroebe/patterns/profile_pattern.rb +182 -0
- data/lib/bioroebe/patterns/rgg_scanner.rb +160 -0
- data/lib/bioroebe/patterns/scan_for_repeat.rb +157 -0
- data/lib/bioroebe/pdb/download_this_pdb.rb +67 -0
- data/lib/bioroebe/pdb/fetch_fasta_sequence_from_pdb.rb +164 -0
- data/lib/bioroebe/pdb/parse_mmCIF_file.rb +63 -0
- data/lib/bioroebe/pdb/parse_pdb_file.rb +1086 -0
- data/lib/bioroebe/pdb/report_secondary_structures_from_this_pdb_file.rb +225 -0
- data/lib/bioroebe/perl/README.md +7 -0
- data/lib/bioroebe/perl/local_to_global.pl +694 -0
- data/lib/bioroebe/project/project.rb +264 -0
- data/lib/bioroebe/protein_structure/alpha_helix.rb +96 -0
- data/lib/bioroebe/protein_structure/helical_wheel.rb +205 -0
- data/lib/bioroebe/raw_sequence/README.md +17 -0
- data/lib/bioroebe/raw_sequence/raw_sequence.rb +557 -0
- data/lib/bioroebe/readline/README.md +2 -0
- data/lib/bioroebe/readline/readline.rb +31 -0
- data/lib/bioroebe/regexes/README.md +2 -0
- data/lib/bioroebe/regexes/regexes.rb +34 -0
- data/lib/bioroebe/requires/commandline_application.rb +5 -0
- data/lib/bioroebe/requires/require_all_aminoacids_files.rb +28 -0
- data/lib/bioroebe/requires/require_all_calculate_files.rb +26 -0
- data/lib/bioroebe/requires/require_all_codon_files.rb +26 -0
- data/lib/bioroebe/requires/require_all_colour_scheme_files.rb +26 -0
- data/lib/bioroebe/requires/require_all_count_files.rb +26 -0
- data/lib/bioroebe/requires/require_all_dotplot_files.rb +28 -0
- data/lib/bioroebe/requires/require_all_electron_microscopy_files.rb +26 -0
- data/lib/bioroebe/requires/require_all_enzymes_files.rb +28 -0
- data/lib/bioroebe/requires/require_all_fasta_and_fastq_files.rb +32 -0
- data/lib/bioroebe/requires/require_all_nucleotides_files.rb +28 -0
- data/lib/bioroebe/requires/require_all_palindromes_files.rb +29 -0
- data/lib/bioroebe/requires/require_all_parser_files.rb +28 -0
- data/lib/bioroebe/requires/require_all_pattern_files.rb +29 -0
- data/lib/bioroebe/requires/require_all_pdb_files.rb +26 -0
- data/lib/bioroebe/requires/require_all_sequence_files.rb +26 -0
- data/lib/bioroebe/requires/require_all_string_matching_files.rb +28 -0
- data/lib/bioroebe/requires/require_all_svg_files.rb +12 -0
- data/lib/bioroebe/requires/require_all_taxonomy_files.rb +35 -0
- data/lib/bioroebe/requires/require_all_utility_scripts_files.rb +32 -0
- data/lib/bioroebe/requires/require_cleave_and_digest.rb +24 -0
- data/lib/bioroebe/requires/require_colours.rb +20 -0
- data/lib/bioroebe/requires/require_encoding.rb +7 -0
- data/lib/bioroebe/requires/require_sequence.rb +7 -0
- data/lib/bioroebe/requires/require_the_bioroebe_project.rb +162 -0
- data/lib/bioroebe/requires/require_the_bioroebe_shell.rb +7 -0
- data/lib/bioroebe/requires/require_the_bioroebe_sinatra_components.rb +7 -0
- data/lib/bioroebe/requires/require_the_constants.rb +23 -0
- data/lib/bioroebe/requires/require_the_toplevel_methods.rb +29 -0
- data/lib/bioroebe/requires/require_yaml.rb +94 -0
- data/lib/bioroebe/sequence/alignment.rb +214 -0
- data/lib/bioroebe/sequence/dna.rb +211 -0
- data/lib/bioroebe/sequence/nucleotide_module/nucleotide_module.rb +404 -0
- data/lib/bioroebe/sequence/protein.rb +281 -0
- data/lib/bioroebe/sequence/reverse_complement.rb +148 -0
- data/lib/bioroebe/sequence/sequence.rb +706 -0
- data/lib/bioroebe/shell/add.rb +108 -0
- data/lib/bioroebe/shell/assign.rb +360 -0
- data/lib/bioroebe/shell/chop_and_cut.rb +281 -0
- data/lib/bioroebe/shell/colours/colours.rb +235 -0
- data/lib/bioroebe/shell/configuration/additionally_set_xorg_buffer.yml +1 -0
- data/lib/bioroebe/shell/configuration/may_we_show_the_startup_information.yml +1 -0
- data/lib/bioroebe/shell/configuration/upcase_nucleotides.yml +1 -0
- data/lib/bioroebe/shell/configuration/use_silent_startup.yml +1 -0
- data/lib/bioroebe/shell/constants.rb +166 -0
- data/lib/bioroebe/shell/download.rb +335 -0
- data/lib/bioroebe/shell/enable_and_disable.rb +158 -0
- data/lib/bioroebe/shell/enzymes.rb +310 -0
- data/lib/bioroebe/shell/fasta.rb +345 -0
- data/lib/bioroebe/shell/gtk.rb +76 -0
- data/lib/bioroebe/shell/help/class.rb +443 -0
- data/lib/bioroebe/shell/help/help.rb +25 -0
- data/lib/bioroebe/shell/history.rb +132 -0
- data/lib/bioroebe/shell/initialize.rb +217 -0
- data/lib/bioroebe/shell/loop.rb +74 -0
- data/lib/bioroebe/shell/menu.rb +5320 -0
- data/lib/bioroebe/shell/misc.rb +4341 -0
- data/lib/bioroebe/shell/prompt.rb +107 -0
- data/lib/bioroebe/shell/random.rb +289 -0
- data/lib/bioroebe/shell/readline/readline.rb +91 -0
- data/lib/bioroebe/shell/reset.rb +335 -0
- data/lib/bioroebe/shell/scan_and_parse.rb +135 -0
- data/lib/bioroebe/shell/search.rb +337 -0
- data/lib/bioroebe/shell/sequences.rb +200 -0
- data/lib/bioroebe/shell/shell.rb +41 -0
- data/lib/bioroebe/shell/show_report_and_display.rb +2901 -0
- data/lib/bioroebe/shell/startup.rb +127 -0
- data/lib/bioroebe/shell/taxonomy.rb +14 -0
- data/lib/bioroebe/shell/tk.rb +23 -0
- data/lib/bioroebe/shell/user_input.rb +88 -0
- data/lib/bioroebe/shell/xorg.rb +45 -0
- data/lib/bioroebe/siRNA/README.md +2 -0
- data/lib/bioroebe/siRNA/siRNA.rb +93 -0
- data/lib/bioroebe/string_matching/README.md +13 -0
- data/lib/bioroebe/string_matching/find_longest_substring.rb +162 -0
- data/lib/bioroebe/string_matching/find_longest_substring_via_LCS_algorithm.rb +175 -0
- data/lib/bioroebe/string_matching/hamming_distance.rb +313 -0
- data/lib/bioroebe/string_matching/levensthein.rb +698 -0
- data/lib/bioroebe/string_matching/simple_string_comparer.rb +294 -0
- data/lib/bioroebe/string_matching/smith_waterman.rb +276 -0
- data/lib/bioroebe/svg/README.md +1 -0
- data/lib/bioroebe/svg/glyph.rb +719 -0
- data/lib/bioroebe/svg/mini_feature.rb +111 -0
- data/lib/bioroebe/svg/page.rb +570 -0
- data/lib/bioroebe/svg/primitive.rb +70 -0
- data/lib/bioroebe/svg/svgee.rb +326 -0
- data/lib/bioroebe/svg/track.rb +263 -0
- data/lib/bioroebe/taxonomy/README.md +1 -0
- data/lib/bioroebe/taxonomy/chart.rb +95 -0
- data/lib/bioroebe/taxonomy/class_methods.rb +181 -0
- data/lib/bioroebe/taxonomy/colours.rb +26 -0
- data/lib/bioroebe/taxonomy/constants.rb +218 -0
- data/lib/bioroebe/taxonomy/edit.rb +97 -0
- data/lib/bioroebe/taxonomy/help/help.rb +65 -0
- data/lib/bioroebe/taxonomy/help/helpline.rb +53 -0
- data/lib/bioroebe/taxonomy/info/check_available.rb +143 -0
- data/lib/bioroebe/taxonomy/info/info.rb +337 -0
- data/lib/bioroebe/taxonomy/info/is_dna.rb +150 -0
- data/lib/bioroebe/taxonomy/interactive.rb +1933 -0
- data/lib/bioroebe/taxonomy/menu.rb +905 -0
- data/lib/bioroebe/taxonomy/node.rb +118 -0
- data/lib/bioroebe/taxonomy/parse_fasta.rb +383 -0
- data/lib/bioroebe/taxonomy/shared.rb +287 -0
- data/lib/bioroebe/taxonomy/taxonomy.rb +521 -0
- data/lib/bioroebe/toplevel_methods/ad_hoc_task.rb +56 -0
- data/lib/bioroebe/toplevel_methods/aminoacids_and_proteins.rb +715 -0
- data/lib/bioroebe/toplevel_methods/atomic_composition.rb +198 -0
- data/lib/bioroebe/toplevel_methods/base_composition.rb +121 -0
- data/lib/bioroebe/toplevel_methods/blast.rb +153 -0
- data/lib/bioroebe/toplevel_methods/calculate_n50_value.rb +57 -0
- data/lib/bioroebe/toplevel_methods/cat.rb +71 -0
- data/lib/bioroebe/toplevel_methods/chunked_display.rb +92 -0
- data/lib/bioroebe/toplevel_methods/cliner.rb +81 -0
- data/lib/bioroebe/toplevel_methods/complement.rb +58 -0
- data/lib/bioroebe/toplevel_methods/convert_global_env.rb +39 -0
- data/lib/bioroebe/toplevel_methods/databases.rb +73 -0
- data/lib/bioroebe/toplevel_methods/delimiter.rb +19 -0
- data/lib/bioroebe/toplevel_methods/digest.rb +71 -0
- data/lib/bioroebe/toplevel_methods/download_and_fetch_data.rb +146 -0
- data/lib/bioroebe/toplevel_methods/e.rb +20 -0
- data/lib/bioroebe/toplevel_methods/editor.rb +21 -0
- data/lib/bioroebe/toplevel_methods/esystem.rb +22 -0
- data/lib/bioroebe/toplevel_methods/exponential_growth.rb +74 -0
- data/lib/bioroebe/toplevel_methods/extract.rb +56 -0
- data/lib/bioroebe/toplevel_methods/fasta_and_fastq.rb +353 -0
- data/lib/bioroebe/toplevel_methods/file_and_directory_related_actions.rb +257 -0
- data/lib/bioroebe/toplevel_methods/frequencies.rb +92 -0
- data/lib/bioroebe/toplevel_methods/hamming_distance.rb +60 -0
- data/lib/bioroebe/toplevel_methods/infer.rb +66 -0
- data/lib/bioroebe/toplevel_methods/is_on_roebe.rb +39 -0
- data/lib/bioroebe/toplevel_methods/leading_five_prime_and_trailing_three_prime.rb +101 -0
- data/lib/bioroebe/toplevel_methods/levensthein.rb +63 -0
- data/lib/bioroebe/toplevel_methods/log_directory.rb +109 -0
- data/lib/bioroebe/toplevel_methods/longest_common_substring.rb +55 -0
- data/lib/bioroebe/toplevel_methods/map_ncbi_entry_to_eutils_id.rb +88 -0
- data/lib/bioroebe/toplevel_methods/matches.rb +259 -0
- data/lib/bioroebe/toplevel_methods/misc.rb +432 -0
- data/lib/bioroebe/toplevel_methods/nucleotides.rb +715 -0
- data/lib/bioroebe/toplevel_methods/number_of_clones.rb +63 -0
- data/lib/bioroebe/toplevel_methods/open_in_browser.rb +77 -0
- data/lib/bioroebe/toplevel_methods/open_reading_frames.rb +236 -0
- data/lib/bioroebe/toplevel_methods/opn.rb +34 -0
- data/lib/bioroebe/toplevel_methods/palindromes.rb +127 -0
- data/lib/bioroebe/toplevel_methods/parse.rb +59 -0
- data/lib/bioroebe/toplevel_methods/phred_error_probability.rb +68 -0
- data/lib/bioroebe/toplevel_methods/rds.rb +24 -0
- data/lib/bioroebe/toplevel_methods/remove.rb +86 -0
- data/lib/bioroebe/toplevel_methods/return_source_code_of_this_method.rb +35 -0
- data/lib/bioroebe/toplevel_methods/return_subsequence_based_on_indices.rb +68 -0
- data/lib/bioroebe/toplevel_methods/rna_splicing.rb +73 -0
- data/lib/bioroebe/toplevel_methods/rnalfold.rb +69 -0
- data/lib/bioroebe/toplevel_methods/searching_and_finding.rb +116 -0
- data/lib/bioroebe/toplevel_methods/shuffleseq.rb +37 -0
- data/lib/bioroebe/toplevel_methods/statistics.rb +53 -0
- data/lib/bioroebe/toplevel_methods/sum_of_odd_integers.rb +62 -0
- data/lib/bioroebe/toplevel_methods/taxonomy.rb +187 -0
- data/lib/bioroebe/toplevel_methods/three_delimiter.rb +34 -0
- data/lib/bioroebe/toplevel_methods/time_and_date.rb +53 -0
- data/lib/bioroebe/toplevel_methods/to_camelcase.rb +26 -0
- data/lib/bioroebe/toplevel_methods/truncate.rb +48 -0
- data/lib/bioroebe/toplevel_methods/url.rb +36 -0
- data/lib/bioroebe/toplevel_methods/verbose.rb +59 -0
- data/lib/bioroebe/utility_scripts/align_open_reading_frames.rb +191 -0
- data/lib/bioroebe/utility_scripts/analyse_local_dataset.rb +119 -0
- data/lib/bioroebe/utility_scripts/check_for_mismatches/check_for_mismatches.rb +230 -0
- data/lib/bioroebe/utility_scripts/compacter.rb +131 -0
- data/lib/bioroebe/utility_scripts/compseq/compseq.rb +529 -0
- data/lib/bioroebe/utility_scripts/consensus_sequence.rb +374 -0
- data/lib/bioroebe/utility_scripts/create_batch_entrez_file.rb +130 -0
- data/lib/bioroebe/utility_scripts/determine_antigenic_areas.rb +115 -0
- data/lib/bioroebe/utility_scripts/determine_missing_nucleotides_percentage.rb +137 -0
- data/lib/bioroebe/utility_scripts/display_open_reading_frames/determine.rb +73 -0
- data/lib/bioroebe/utility_scripts/display_open_reading_frames/display_open_reading_frames.rb +31 -0
- data/lib/bioroebe/utility_scripts/display_open_reading_frames/initialize.rb +37 -0
- data/lib/bioroebe/utility_scripts/display_open_reading_frames/menu.rb +49 -0
- data/lib/bioroebe/utility_scripts/display_open_reading_frames/misc.rb +471 -0
- data/lib/bioroebe/utility_scripts/display_open_reading_frames/report.rb +113 -0
- data/lib/bioroebe/utility_scripts/display_open_reading_frames/reset.rb +56 -0
- data/lib/bioroebe/utility_scripts/dot_alignment.rb +177 -0
- data/lib/bioroebe/utility_scripts/download_files_from_rebase.rb +72 -0
- data/lib/bioroebe/utility_scripts/find_gene.rb +202 -0
- data/lib/bioroebe/utility_scripts/mirror_repeat.rb +235 -0
- data/lib/bioroebe/utility_scripts/move_file_to_its_correct_location.rb +151 -0
- data/lib/bioroebe/utility_scripts/parse_taxonomy.rb +168 -0
- data/lib/bioroebe/utility_scripts/pathways.rb +152 -0
- data/lib/bioroebe/utility_scripts/permutations.rb +145 -0
- data/lib/bioroebe/utility_scripts/punnet/punnet.rb +126 -0
- data/lib/bioroebe/utility_scripts/show_this_dna_sequence.rb +148 -0
- data/lib/bioroebe/utility_scripts/showorf/constants.rb +36 -0
- data/lib/bioroebe/utility_scripts/showorf/help.rb +33 -0
- data/lib/bioroebe/utility_scripts/showorf/initialize.rb +52 -0
- data/lib/bioroebe/utility_scripts/showorf/menu.rb +68 -0
- data/lib/bioroebe/utility_scripts/showorf/reset.rb +39 -0
- data/lib/bioroebe/utility_scripts/showorf/run.rb +152 -0
- data/lib/bioroebe/utility_scripts/showorf/show.rb +97 -0
- data/lib/bioroebe/utility_scripts/showorf/showorf.rb +488 -0
- data/lib/bioroebe/version/version.rb +44 -0
- data/lib/bioroebe/viennarna/README.md +3 -0
- data/lib/bioroebe/viennarna/rnafold_wrapper.rb +196 -0
- data/lib/bioroebe/with_gui.rb +18 -0
- data/lib/bioroebe/www/bioroebe.cgi +44 -0
- data/lib/bioroebe/www/embeddable_interface.rb +686 -0
- data/lib/bioroebe/www/sinatra/sinatra.rb +1013 -0
- data/lib/bioroebe/yaml/agarose/agarose_concentrations.yml +21 -0
- data/lib/bioroebe/yaml/aminoacids/amino_acids.yml +92 -0
- data/lib/bioroebe/yaml/aminoacids/amino_acids_abbreviations.yml +31 -0
- data/lib/bioroebe/yaml/aminoacids/amino_acids_average_mass_table.yml +33 -0
- data/lib/bioroebe/yaml/aminoacids/amino_acids_classification.yml +18 -0
- data/lib/bioroebe/yaml/aminoacids/amino_acids_english.yml +26 -0
- data/lib/bioroebe/yaml/aminoacids/amino_acids_frequency.yml +44 -0
- data/lib/bioroebe/yaml/aminoacids/amino_acids_long_name_to_one_letter.yml +61 -0
- data/lib/bioroebe/yaml/aminoacids/amino_acids_molecular_formula.yml +32 -0
- data/lib/bioroebe/yaml/aminoacids/amino_acids_monoisotopic_mass_table.yml +38 -0
- data/lib/bioroebe/yaml/aminoacids/amino_acids_reste.yml +35 -0
- data/lib/bioroebe/yaml/aminoacids/amino_acids_three_to_one.yml +34 -0
- data/lib/bioroebe/yaml/aminoacids/hydropathy_table.yml +44 -0
- data/lib/bioroebe/yaml/aminoacids/molecular_weight.yml +29 -0
- data/lib/bioroebe/yaml/aminoacids/simple_aminoacids.yml +66 -0
- data/lib/bioroebe/yaml/aminoacids/weight_of_common_proteins.yml +33 -0
- data/lib/bioroebe/yaml/antisense/antisense.yml +9 -0
- data/lib/bioroebe/yaml/base_composition_of_dna.yml +37 -0
- data/lib/bioroebe/yaml/blosum/blosum45.yml +36 -0
- data/lib/bioroebe/yaml/blosum/blosum50.yml +34 -0
- data/lib/bioroebe/yaml/blosum/blosum62.yml +35 -0
- data/lib/bioroebe/yaml/blosum/blosum80.yml +37 -0
- data/lib/bioroebe/yaml/blosum/blosum90.yml +36 -0
- data/lib/bioroebe/yaml/blosum/blosum_matrix.yml +200 -0
- data/lib/bioroebe/yaml/chromosomes/chromosome_numbers.yml +30 -0
- data/lib/bioroebe/yaml/codon_tables/1.yml +113 -0
- data/lib/bioroebe/yaml/codon_tables/10.yml +89 -0
- data/lib/bioroebe/yaml/codon_tables/11.yml +91 -0
- data/lib/bioroebe/yaml/codon_tables/12.yml +89 -0
- data/lib/bioroebe/yaml/codon_tables/13.yml +89 -0
- data/lib/bioroebe/yaml/codon_tables/14.yml +89 -0
- data/lib/bioroebe/yaml/codon_tables/15.yml +94 -0
- data/lib/bioroebe/yaml/codon_tables/16.yml +89 -0
- data/lib/bioroebe/yaml/codon_tables/2.yml +86 -0
- data/lib/bioroebe/yaml/codon_tables/21.yml +89 -0
- data/lib/bioroebe/yaml/codon_tables/22.yml +89 -0
- data/lib/bioroebe/yaml/codon_tables/23.yml +91 -0
- data/lib/bioroebe/yaml/codon_tables/24.yml +89 -0
- data/lib/bioroebe/yaml/codon_tables/25.yml +89 -0
- data/lib/bioroebe/yaml/codon_tables/26.yml +96 -0
- data/lib/bioroebe/yaml/codon_tables/27.yml +104 -0
- data/lib/bioroebe/yaml/codon_tables/28.yml +97 -0
- data/lib/bioroebe/yaml/codon_tables/29.yml +93 -0
- data/lib/bioroebe/yaml/codon_tables/3.yml +98 -0
- data/lib/bioroebe/yaml/codon_tables/30.yml +91 -0
- data/lib/bioroebe/yaml/codon_tables/31.yml +94 -0
- data/lib/bioroebe/yaml/codon_tables/33.yml +101 -0
- data/lib/bioroebe/yaml/codon_tables/4.yml +96 -0
- data/lib/bioroebe/yaml/codon_tables/5.yml +100 -0
- data/lib/bioroebe/yaml/codon_tables/6.yml +96 -0
- data/lib/bioroebe/yaml/codon_tables/9.yml +97 -0
- data/lib/bioroebe/yaml/codon_tables/overview.yml +42 -0
- data/lib/bioroebe/yaml/configuration/README.md +12 -0
- data/lib/bioroebe/yaml/configuration/browser.yml +1 -0
- data/lib/bioroebe/yaml/configuration/colourize_fasta_sequences.yml +14 -0
- data/lib/bioroebe/yaml/configuration/default_colours_for_the_aminoacids.yml +28 -0
- data/lib/bioroebe/yaml/configuration/temp_dir.yml +1 -0
- data/lib/bioroebe/yaml/configuration/try_to_use_matplotlib.yml +1 -0
- data/lib/bioroebe/yaml/configuration/use_opn.yml +1 -0
- data/lib/bioroebe/yaml/configuration/use_this_database.yml +1 -0
- data/lib/bioroebe/yaml/create_these_directories_on_startup/create_these_directories_on_startup.yml +9 -0
- data/lib/bioroebe/yaml/default_dna_input.yml +3 -0
- data/lib/bioroebe/yaml/enzymes/enzyme_classes.yml +15 -0
- data/lib/bioroebe/yaml/enzymes/pH-Optima.yml +11 -0
- data/lib/bioroebe/yaml/fasta_and_fastq/fastq_quality_schemes.yml +44 -0
- data/lib/bioroebe/yaml/genomes/README.md +16 -0
- data/lib/bioroebe/yaml/humans/README.md +2 -0
- data/lib/bioroebe/yaml/humans/human_chromosomes.yml +53 -0
- data/lib/bioroebe/yaml/laboratory/README.md +1 -0
- data/lib/bioroebe/yaml/laboratory/pipettes.yml +8 -0
- data/lib/bioroebe/yaml/mRNA/mRNA.yml +16 -0
- data/lib/bioroebe/yaml/nuclear_localization_sequences.yml +15 -0
- data/lib/bioroebe/yaml/nucleotides/abbreviations_for_nucleotides.yml +29 -0
- data/lib/bioroebe/yaml/nucleotides/nucleotide_density.yml +10 -0
- data/lib/bioroebe/yaml/nucleotides/nucleotides.yml +34 -0
- data/lib/bioroebe/yaml/nucleotides/nucleotides_weight.yml +12 -0
- data/lib/bioroebe/yaml/pathways/README.md +2 -0
- data/lib/bioroebe/yaml/pathways/citric_acid_cycle.yml +16 -0
- data/lib/bioroebe/yaml/pathways/glycolysis.yml +20 -0
- data/lib/bioroebe/yaml/pathways/shikimate_pathway.yml +23 -0
- data/lib/bioroebe/yaml/pathways/urea_cycle.yml +11 -0
- data/lib/bioroebe/yaml/primers/README.md +4 -0
- data/lib/bioroebe/yaml/primers/primers.yml +3 -0
- data/lib/bioroebe/yaml/promoters/35S.yml +15 -0
- data/lib/bioroebe/yaml/promoters/strong_promoters.yml +24 -0
- data/lib/bioroebe/yaml/proteases/proteases.yml +3 -0
- data/lib/bioroebe/yaml/proteins/ubiquitin.yml +4 -0
- data/lib/bioroebe/yaml/remote_urls/README.md +4 -0
- data/lib/bioroebe/yaml/remote_urls/remote_urls.yml +3 -0
- data/lib/bioroebe/yaml/restriction_enzymes/restriction_enzymes.yml +630 -0
- data/lib/bioroebe/yaml/sequences/JX472995_Green_fluorescent_protein_from_Aequorea_victoria.fasta +14 -0
- data/lib/bioroebe/yaml/sequences/README.md +2 -0
- data/lib/bioroebe/yaml/talens.yml +22 -0
- data/lib/bioroebe/yaml/viruses/ecoli_phages.yml +63 -0
- data/lib/bioroebe/yaml/viruses/viruses.yml +6 -0
- data/lib/bioroebe.rb +5 -0
- data/spec/testing_toplevel_method_editor.rb +20 -0
- data/spec/testing_toplevel_method_url.rb +15 -0
- data/spec/testing_toplevel_method_verbose.rb +13 -0
- data/test/advanced_svg_example.rb +307 -0
- data/test/testing_bioroebe.rb +25 -0
- data/test/testing_codons.rb +45 -0
- data/test/testing_dna_to_rna_conversion.rb +15 -0
- data/test/testing_parse_pdb_file.rb +23 -0
- data/test/testing_reverse_complement.rb +32 -0
- data/test/testing_svg_component_of_bioroebe.rb +311 -0
- data/test/testing_svg_component_of_bioroebe_from_json_dataset.rb +34 -0
- data/test/testing_taxonomy.rb +22 -0
- metadata +1059 -0
data/doc/blosum.md
ADDED
@@ -0,0 +1,37 @@
|
|
1
|
+
One goal of the Bioroebe project is to offer everything that BioPHP
|
2
|
+
offers as well, in ruby.
|
3
|
+
|
4
|
+
BioPHP can be found here:
|
5
|
+
|
6
|
+
http://www.biophp.org/
|
7
|
+
|
8
|
+
This file here (BIO_PHP.md) keeps track of this ongoing endeavour.
|
9
|
+
|
10
|
+
Not all functionality will be ported, though. For example,
|
11
|
+
the "PCR Amplification" functionality of biophp is utterly
|
12
|
+
useless.
|
13
|
+
|
14
|
+
First, some more relevant links:
|
15
|
+
|
16
|
+
http://www.biophp.org/resources.php?mode=functions
|
17
|
+
|
18
|
+
Status of the rewrites:
|
19
|
+
|
20
|
+
Restriction digest of DNA | | http://www.biophp.org/minitools/restriction_digest/demo.php
|
21
|
+
Find Palindromic Sequences | [IMPLEMENTED FULLY]
|
22
|
+
Protein to DNA | [IMPLEMENTED FULLY]
|
23
|
+
Sequence manipulation and data | [not implemented yet]
|
24
|
+
Melting Temperature (Tm) Calculator | [not implemented yet]
|
25
|
+
PCR Amplification | [will not be implemented] | http://www.biophp.org/minitools/pcr_amplification/
|
26
|
+
Microsatellite Repeats Finder | [not implemented yet]
|
27
|
+
Alignment of DNA/Protein sequences | http://www.biophp.org/functions/alignment_of_dna/
|
28
|
+
DNA to protein | [not implemented yet]
|
29
|
+
Microarray analysis: adaptive quantification | [not implemented yet]
|
30
|
+
Protein sequence information | [not implemented yet]
|
31
|
+
Reduced alphabets for proteins | [not implemented yet]
|
32
|
+
Chaos Game Representation | [not implemented yet]
|
33
|
+
GC-, AT-, KETO- and oligo-skews generator | [not implemented yet]
|
34
|
+
Oligonucleotide Frequency | [not implemented yet]
|
35
|
+
Oligonucleotides for distance among sequences | [not implemented yet]
|
36
|
+
Random DNA | http://www.biophp.org/functions/random_dna/
|
37
|
+
Useful formulas |
|
@@ -0,0 +1,56 @@
|
|
1
|
+
This file will keep track of which functionality of EMBOSS is either
|
2
|
+
fully implemented, partially implemented or not yet implemented.
|
3
|
+
|
4
|
+
Implemented here means "available as part of the Bioroebe" project -
|
5
|
+
either via some webinterface or as its own standalone file.
|
6
|
+
|
7
|
+
Overview:
|
8
|
+
|
9
|
+
aligncopy |
|
10
|
+
aligncopypair |
|
11
|
+
biosed |
|
12
|
+
cgplot | https://www.bioinformatics.nl/cgi-bin/emboss/cpgplot
|
13
|
+
codcopy |
|
14
|
+
cutseq | [90% IMPLEMENTED; missing output format possibilities]
|
15
|
+
degapseq |
|
16
|
+
descseq |
|
17
|
+
entret |
|
18
|
+
extractalig |
|
19
|
+
extractfeat |
|
20
|
+
extractseq |
|
21
|
+
featcopy |
|
22
|
+
featmerge |
|
23
|
+
featreport |
|
24
|
+
feattext |
|
25
|
+
listor | https://www.bioinformatics.nl/cgi-bin/emboss/help/listor
|
26
|
+
makenucseq |
|
27
|
+
makeprotseq |
|
28
|
+
maskambignuc |
|
29
|
+
maskambigprot |
|
30
|
+
maskfeat |
|
31
|
+
maskseq |
|
32
|
+
newseq |
|
33
|
+
nohtml |
|
34
|
+
noreturn | [Will not be re-implemented, as it is not necessary.]
|
35
|
+
nospace |
|
36
|
+
notab |
|
37
|
+
notseq |
|
38
|
+
nthseq |
|
39
|
+
nthseqset |
|
40
|
+
pasteseq |
|
41
|
+
revseq |
|
42
|
+
seqcount | [Will not be re-implemented, as it is not necessary.]
|
43
|
+
seqret |
|
44
|
+
seqretsetall |
|
45
|
+
seqretsplit |
|
46
|
+
sizeseq |
|
47
|
+
skipredundant |
|
48
|
+
skipseq | https://www.bioinformatics.nl/cgi-bin/emboss/skipseq
|
49
|
+
splitsource |
|
50
|
+
splitter |
|
51
|
+
trimest |
|
52
|
+
trimseq | Implemented in the file bioroebe/utility_scripts/compacter.rb.
|
53
|
+
trimspace |
|
54
|
+
union |
|
55
|
+
vectorstrip |
|
56
|
+
yank |
|
@@ -0,0 +1,35 @@
|
|
1
|
+
This file here attempts to note down what should be possible and doable
|
2
|
+
within the Bioroebe main sequence class.
|
3
|
+
|
4
|
+
It is closely modelled towards this file here from the bioruby project:
|
5
|
+
|
6
|
+
https://raw.githubusercontent.com/bioruby/bioruby/master/doc/Tutorial.rd
|
7
|
+
|
8
|
+
There will not be a lot of explanation going on here in this file - the
|
9
|
+
whole purpose of this file is to simply show what is possible with the
|
10
|
+
Bioroebe sequence, in a succinct manner.
|
11
|
+
|
12
|
+
For a more extensive explanation, either consider looking at the online
|
13
|
+
documentation for the various methods/classes, or simply have a look at
|
14
|
+
the .md file (README.md) that is distributed with the bioroebe-gem
|
15
|
+
as such.
|
16
|
+
|
17
|
+
require 'bioroebe'
|
18
|
+
|
19
|
+
seq = Bioroebe::Sequence.new("ATGCATGCAAAA")
|
20
|
+
seq.complement # => "TTTTGCATGCAT"
|
21
|
+
seq.subseq(3,8) # => "GCATGC"
|
22
|
+
seq.composition # => {"A"=>6, "T"=>2, "G"=>2, "C"=>2}
|
23
|
+
seq.gc_percent # => 33
|
24
|
+
seq.translate # => "MHAK"
|
25
|
+
seq.translate(2) # => "CMQ" # This translates from frame 2
|
26
|
+
seq.translate(:frame_two) # => "CMQ" # Alias to the above.
|
27
|
+
seq.translate(:from_frame_two) # => "CMQ" # Alias to the above.
|
28
|
+
seq.translate.names # => ["methionine", "histidine", "alanine", "lysine"]
|
29
|
+
seq.translate.codes # => ["Met", "His", "Ala", "Lys"]
|
30
|
+
seq.translate(1,11) # codon table 11
|
31
|
+
seq.translate(2) # translate from frame 2 ==> "CMQ"
|
32
|
+
seq.translate(1,11) # codon table 11 ==> "MHAK"
|
33
|
+
seq.translate.composition # => {"K"=>1, "A"=>1, "M"=>1, "H"=>1}
|
34
|
+
seq.translate.molecular_weight # => 485.605
|
35
|
+
seq.complement.translate # => "FCMH"
|
@@ -0,0 +1,27 @@
|
|
1
|
+
# =========================================================================== #
|
2
|
+
# === German names for the aminoacids
|
3
|
+
#
|
4
|
+
# This file just carries the german names for the specific aminoacid
|
5
|
+
# residues.
|
6
|
+
# =========================================================================== #
|
7
|
+
|
8
|
+
Alanin: -CH3
|
9
|
+
Arginin: -CH2CH2CH2NH-C(NH)NH2
|
10
|
+
Asparagin: -CH2CONH2
|
11
|
+
Asparaginsäure: -CH2COOH
|
12
|
+
Cystein: -CH2SH
|
13
|
+
Glutamin: -CH2CH2CONH2
|
14
|
+
Glutaminsäure: -CH2CH2COOH
|
15
|
+
Glycin: -H
|
16
|
+
Histidin: -CH2(C3H3N2)
|
17
|
+
Isoleucin: -CH(CH3)CH2CH3
|
18
|
+
Leucin: -CH2CH(CH3)2
|
19
|
+
Lysin: -CH2CH2CH2CH2NH2
|
20
|
+
Methionin: -CH2CH2SCH3
|
21
|
+
Phenylalanin: -CH2(C6H5)
|
22
|
+
Prolin: -CH2CH2CH2-
|
23
|
+
Serin: -CH2OH
|
24
|
+
Threonin: -CH(OH)CH3
|
25
|
+
Tryptophan: -CH2(C8H6N)
|
26
|
+
Tyrosin: -CH2(C6H4)OH
|
27
|
+
Valin: -CH(CH3)2
|
@@ -0,0 +1,504 @@
|
|
1
|
+
=== ABOUT
|
2
|
+
|
3
|
+
This document refers to the taxonomy/ subdirectory of the bioroebe
|
4
|
+
project.
|
5
|
+
|
6
|
+
The Taxonomy module is written in ruby and loosely built around the
|
7
|
+
official NCBI Taxonomy Database, which is available via ftp at
|
8
|
+
the following URL:
|
9
|
+
|
10
|
+
ftp://ftp.ncbi.nih.gov/pub/taxonomy/taxdump.tar.gz
|
11
|
+
|
12
|
+
=== INSTALLATION
|
13
|
+
|
14
|
+
The taxonomy-related code is distributed with the bioroebe project,
|
15
|
+
so if you install the latter then you will also have the Taxonomy
|
16
|
+
submodule available.
|
17
|
+
|
18
|
+
The archetypical way to install bioroebe is through this way:
|
19
|
+
|
20
|
+
gem install bioroebe
|
21
|
+
|
22
|
+
Sometimes you may not have server-wide access or it may be unwanted
|
23
|
+
to install it globally.
|
24
|
+
|
25
|
+
In this case, you can install bioroebe into the home directory by
|
26
|
+
using the --user-install comandline switch.
|
27
|
+
|
28
|
+
Example:
|
29
|
+
|
30
|
+
gem install --user-install bioroebe
|
31
|
+
|
32
|
+
If you have a local .gem then you can also install this, e. g.
|
33
|
+
via:
|
34
|
+
|
35
|
+
gem install --user-install ./*.gem
|
36
|
+
|
37
|
+
The Taxonomy module itself can be used primarily for the following
|
38
|
+
three main tasks:
|
39
|
+
|
40
|
+
(1)
|
41
|
+
|
42
|
+
- Update the remote NCBI Taxonomy database into a local database:
|
43
|
+
|
44
|
+
The URL that will be used for this main task can be found here:
|
45
|
+
|
46
|
+
ftp://ftp.ncbi.nih.gov/pub/taxonomy/taxdump.tar.gz
|
47
|
+
|
48
|
+
This URL is assigned to the constant called REMOTE_URL, in
|
49
|
+
the file constants/constants.rb of the Taxonomy module.
|
50
|
+
|
51
|
+
You can obtain the file path of this file inside the interactive
|
52
|
+
module by issuing "shared_code?" as a command, without the quotes,
|
53
|
+
or from the very commandline itself, via:
|
54
|
+
|
55
|
+
taxonomy shared_code?
|
56
|
+
taxonomy scode? # Shortcuts are also possible.
|
57
|
+
|
58
|
+
To update the NCBI database (which will download the remote
|
59
|
+
taxonomy database), issue this command:
|
60
|
+
|
61
|
+
update ncbi_database
|
62
|
+
|
63
|
+
Also see the rest of the file for a more thorough explanation.
|
64
|
+
|
65
|
+
(2)
|
66
|
+
|
67
|
+
- Generate and Populate a table that holds information from(2)
|
68
|
+
the /data/curated/* directory subset, and subsequently,
|
69
|
+
from localome/ as well.
|
70
|
+
|
71
|
+
This is the dataset that derives from local datasets.
|
72
|
+
The directory that will fetch this dataset is derived from
|
73
|
+
incoming/.
|
74
|
+
|
75
|
+
(3)
|
76
|
+
|
77
|
+
- Query data from the dataset interactively, via the file
|
78
|
+
interactive.rb, but also from the commandline directly.
|
79
|
+
|
80
|
+
Note Bene: all commands that work from the file interactive.rb
|
81
|
+
will also work from the commandline.
|
82
|
+
|
83
|
+
The following examples will demonstrate how all of this can be
|
84
|
+
done - some more options are available, though not all of them
|
85
|
+
are extensively documented and commented; see the case menu in
|
86
|
+
the file interactive.rb for a really comprehensive and complete
|
87
|
+
list.
|
88
|
+
|
89
|
+
Without further ado, let's start:
|
90
|
+
|
91
|
+
(1) In order to update the (local mirror of the) NCBI Taxonomy,
|
92
|
+
'update ncbi' as instruction to use should suffice:
|
93
|
+
|
94
|
+
update ncbi
|
95
|
+
update_ncbi # <-- This works as well, if you dislike the ' '
|
96
|
+
|
97
|
+
This instruction will attempt to download the Taxonomy database,
|
98
|
+
extract the archive, generate the necessary .sql files and
|
99
|
+
then connect to postgresql, so that it can then add this
|
100
|
+
information into the postgresql database.
|
101
|
+
|
102
|
+
For this to work, a connection to the database should be
|
103
|
+
established prior to running this command.
|
104
|
+
|
105
|
+
Note that the login information is stored in the file
|
106
|
+
postgresql_login_command.rb, so if at any moment in time
|
107
|
+
the information how to access the postgreSQL database gets
|
108
|
+
changed, that file can be changed as well.
|
109
|
+
|
110
|
+
The location of that file can be queried via the instruction:
|
111
|
+
'login_information?'
|
112
|
+
so:
|
113
|
+
|
114
|
+
taxonomy login_information?
|
115
|
+
|
116
|
+
And if you want to modify it in vim, simply do: 'edit login'.
|
117
|
+
|
118
|
+
taxonomy edit login
|
119
|
+
taxonomy edit_login
|
120
|
+
|
121
|
+
At the time of writing this file, the default location mirror
|
122
|
+
for the download of the NCBI Taxonomy database was:
|
123
|
+
|
124
|
+
/tmp/robert/Downloads
|
125
|
+
|
126
|
+
This of course can be changed, also in shared/shared.rb,
|
127
|
+
see the constant DOWNLOAD_DIR. Simply set it to another directory
|
128
|
+
if you wish to change the download location.
|
129
|
+
|
130
|
+
Also note that generating the .sql files and populating the
|
131
|
+
postgresql table from that information, may take about an
|
132
|
+
hour. This could possibly be optimized by using COPY rather
|
133
|
+
than INSERT INTO statements, but it seemed better to not
|
134
|
+
spend too much time into researching how to optimize and
|
135
|
+
instead see that more features are made possible in general.
|
136
|
+
|
137
|
+
(2) Generating and populating the second table, the one that holds all
|
138
|
+
the information from the "localomes", can most conveniently
|
139
|
+
be done by issuing either 'default' or simply by doing
|
140
|
+
'update localome'.
|
141
|
+
|
142
|
+
This requires a working postgresql database at present as well.
|
143
|
+
|
144
|
+
* Note that the above two commands can be combined by issuing
|
145
|
+
'update all' or 'update everything', but this will also
|
146
|
+
update (overwrite) the localome table, so 'update ncbi'
|
147
|
+
is usually better. (If we want to build up the localome
|
148
|
+
table anew anyway, we can safely use "update localome", which
|
149
|
+
will take about 24 minutes.)
|
150
|
+
|
151
|
+
(3) The third part, the query subsection, is the largest part of the
|
152
|
+
Taxonomy module. There are many aliases to the main instructions
|
153
|
+
to have - for instance, rather than having to type 'update', 'upd'
|
154
|
+
would also suffice, and so it is for many other commands as well;
|
155
|
+
most of them have shorter aliases.
|
156
|
+
|
157
|
+
To start the interactive shell, you can either invoke the file
|
158
|
+
interactive.rb directly, or you just call the 'taxonomy'
|
159
|
+
file like so:
|
160
|
+
|
161
|
+
taxonomy interactive
|
162
|
+
|
163
|
+
Of course you can also use 'taxonomy' without having to use the
|
164
|
+
interactive component, i.e. from the commandline directly.
|
165
|
+
|
166
|
+
Once inside the interactive shell, typing 'help' will give
|
167
|
+
some information about the subcommands available.
|
168
|
+
|
169
|
+
Now let's look at a few specific examples:
|
170
|
+
|
171
|
+
taxid 33 # Obtain the Organism that has taxID 33, which is
|
172
|
+
# Myxococcus fulvus. This queries the NCBI dataset.
|
173
|
+
|
174
|
+
scientific_name 33 # This will attempt to fetch the scientific
|
175
|
+
# name of an organism. "sname" is a shorter
|
176
|
+
# alias.
|
177
|
+
|
178
|
+
parent_id 33 # In order to obtain the ParentID, use
|
179
|
+
# this command. In our case, it gives us
|
180
|
+
# back 32, so we now know that the parentID
|
181
|
+
# of 33 is 32, so 33 -> 32.
|
182
|
+
|
183
|
+
last_update? # This tells us when we last updated the NCI
|
184
|
+
# database. The constant SHALL_WE_LOG_LAST_UPDATE
|
185
|
+
# determines whether we will log - set it to false
|
186
|
+
# if you don't need logging.
|
187
|
+
|
188
|
+
The update() or lupdate() method can be used to specifically
|
189
|
+
update a database. For instance:
|
190
|
+
|
191
|
+
update ncbi # Update the NCBI database.
|
192
|
+
update localomes # Update the database from localomes.
|
193
|
+
update lineage # Update the lineage - this is only required after
|
194
|
+
# the NCBI database was downloaded (at the weekend).
|
195
|
+
|
196
|
+
update all # This will perform both updates, first ncbi,
|
197
|
+
# then localomes. Also note that an update
|
198
|
+
# in this way will drop all tables, which
|
199
|
+
# is ok for the NCBI database, but possibly
|
200
|
+
# unwanted for the localomes database. In general,
|
201
|
+
# it is recommended to do only "update ncbi".
|
202
|
+
|
203
|
+
available? # Query which species are available in the localomes table,
|
204
|
+
# in total. This is quite similar to doing this:
|
205
|
+
# name,taxid,path
|
206
|
+
|
207
|
+
size? # This instruction queries the size of the 2 NCBI
|
208
|
+
# tables, names and nodes, and the localomes (should
|
209
|
+
# be less than 200 MB in total). "table_size?" is
|
210
|
+
# a more explicit alias to this. In order for this
|
211
|
+
# to correctly work, these entries must exist of
|
212
|
+
# course.
|
213
|
+
|
214
|
+
Helpful queries can be done like so:
|
215
|
+
|
216
|
+
query localome # Queries how many entries are in the localome.
|
217
|
+
# "localome?" or just "loc?" is an alias to this.
|
218
|
+
|
219
|
+
More specific queries can be simplified by separating with a ',' character,
|
220
|
+
and will default to the localome table, such as inputting the following
|
221
|
+
as examples:
|
222
|
+
|
223
|
+
taxid,name # Show taxid,name relation from the localome table.
|
224
|
+
name,taxid # As above, but in reverse order - name first, then taxid.
|
225
|
+
name,taxid,path # As above, but also includes the local path.
|
226
|
+
name,path,taxid # As above but a different order.
|
227
|
+
taxid,name,path,filesize # Similar to the above but also gives us the filesize.
|
228
|
+
taxid,comment_field # Show the taxid and the comment_field
|
229
|
+
name,taxid,comment_field
|
230
|
+
name,taxid,comment_field,filesize
|
231
|
+
taxid,lineage_ids # Display taxid and lineage_ids.
|
232
|
+
taxid,lineage_ids,path
|
233
|
+
|
234
|
+
There are ten entries in the localome table, so you can combine these
|
235
|
+
ten in any way to yield whatever information you desire.
|
236
|
+
|
237
|
+
To get all comments, do:
|
238
|
+
|
239
|
+
comments?
|
240
|
+
|
241
|
+
Searching can be done like this:
|
242
|
+
|
243
|
+
search_by_name Blastocystis hominis # This would search for Blastocystis
|
244
|
+
# hominis in the names table.
|
245
|
+
search_in_localomes Blastocystis hominis # As above, but we search
|
246
|
+
# in localomes rather than
|
247
|
+
# in NCBI. "slocalomes" is a
|
248
|
+
# shorter alias to this.
|
249
|
+
|
250
|
+
To sort the localome table by modtime, we can do this:
|
251
|
+
|
252
|
+
modtime
|
253
|
+
|
254
|
+
To feedback the paths stored in the localome table, do this:
|
255
|
+
|
256
|
+
paths?
|
257
|
+
|
258
|
+
If you want to modify the path manually, you can use either set_path
|
259
|
+
or path (both are the same). We require the ID to update the path.
|
260
|
+
|
261
|
+
For example:
|
262
|
+
|
263
|
+
set_path TAXONOMY_ID_HERE new_path_goes_here
|
264
|
+
set_path 45157 /data/curated/sequences/localome/proteome/Cyanidioschyzon_merolae_pep.fa
|
265
|
+
|
266
|
+
Keep in mind that this can all be done through the commandline as well:
|
267
|
+
|
268
|
+
taxonomy set_path 45157 /data/curated/sequences/localome/proteome/Cyanidioschyzon_merolae_pep.fa
|
269
|
+
|
270
|
+
If we do not want to see any colours, we can disable them via:
|
271
|
+
|
272
|
+
nocolours # There may be some warnings about redefines taking place,
|
273
|
+
# but they are harmless. They happen because some methods
|
274
|
+
# get redefined.
|
275
|
+
|
276
|
+
If we don't want to see the help hint after every hit to the return-key,
|
277
|
+
do:
|
278
|
+
|
279
|
+
nohelp
|
280
|
+
|
281
|
+
If we wish to see the main entries in the nodes and names tables
|
282
|
+
from the NCBI Taxonomy, we can do this:
|
283
|
+
|
284
|
+
nodes?
|
285
|
+
names?
|
286
|
+
|
287
|
+
The above two commands will fetch 10 random entries from the
|
288
|
+
two tables, which should aid a little bit in ensuring that the
|
289
|
+
dataset is correct.
|
290
|
+
|
291
|
+
If we wish to find out which Eukaryota and which Prokaryota are
|
292
|
+
available in the localome table (the name of that table is "fasta"
|
293
|
+
at the time of writing this file here), we can do this:
|
294
|
+
|
295
|
+
eukarya
|
296
|
+
prokarya
|
297
|
+
|
298
|
+
Note that this, by default, won't give us the path, so in order
|
299
|
+
to find out the local path as well, we need to issue this command:
|
300
|
+
|
301
|
+
eukarya path
|
302
|
+
prokarya path # There were no entries from prokarya at the time
|
303
|
+
# of writing this file here though.
|
304
|
+
|
305
|
+
If we wish to change the path of a localome entry, we must know
|
306
|
+
the TaxID, then we can do something like this here:
|
307
|
+
|
308
|
+
path 1257118 /resources/seqdata/curated/sequences/localome/proteomes/Acanthamoeba_castellanii_Neff_pep.fa
|
309
|
+
|
310
|
+
The first argument is the mandatory TaxID, the second entry is the
|
311
|
+
new path that we wish to assign. Note that if you need to modify
|
312
|
+
several paths, you can also load up a file instead:
|
313
|
+
|
314
|
+
path name_of_file_here
|
315
|
+
|
316
|
+
This file should be in CSV format with two entries - to the left
|
317
|
+
the TaxID, to the right the new path, separated by | or \t (tabulator),
|
318
|
+
for example:
|
319
|
+
|
320
|
+
1257118 | /new/path/Acanthamoeba_castellanii_Neff_pep.fa
|
321
|
+
8128 | /new/path/Bla_ble.fa
|
322
|
+
|
323
|
+
To eliminate an entry from the localomes, do this:
|
324
|
+
|
325
|
+
remove 1257118
|
326
|
+
|
327
|
+
The argument to this must be the Taxonomic ID for now. Obviously
|
328
|
+
you should be sure that you want to remove this entry.
|
329
|
+
|
330
|
+
If you wish to load a dataset into the localome database, we
|
331
|
+
need the .INFO file. Then, do this:
|
332
|
+
|
333
|
+
load_from_info foo.INFO
|
334
|
+
|
335
|
+
The argument must be the path to the .INFO file in question.
|
336
|
+
Alternatively, simply provide the .INFO file here. If as
|
337
|
+
part of the location we can find the string 'incoming/', we
|
338
|
+
will assume that you want to load it into the table.
|
339
|
+
|
340
|
+
To show the lineage directly from the localome table, we can do this:
|
341
|
+
|
342
|
+
show_lineage_from_localome_table 2762
|
343
|
+
|
344
|
+
* Note that the TaxID must exist before we can query it (remember,
|
345
|
+
you can do: "taxid,names" to quickly fetch the table data).
|
346
|
+
|
347
|
+
If you wish to find out the amount of .INFO files available, do
|
348
|
+
this here:
|
349
|
+
|
350
|
+
n_info?
|
351
|
+
|
352
|
+
This will look at the directory /data/curated/sequences/INFO/ and
|
353
|
+
the directory /resources/seqdata/curated/sequences/localome/incoming.
|
354
|
+
|
355
|
+
You can parse an info file by doing:
|
356
|
+
|
357
|
+
info foo.INFO
|
358
|
+
|
359
|
+
or by simply putting the full location of the .INFO file
|
360
|
+
in the interactive shell.
|
361
|
+
|
362
|
+
Note that the code that handles parsing of .INFO files will also
|
363
|
+
attempt to locate a corresponding .fa entry. It will first check
|
364
|
+
for the local directory before checking the directory
|
365
|
+
/data/curated/sequences/aa/ or /data/curated/sequences/nt/
|
366
|
+
|
367
|
+
To parse a fasta file, we can do this:
|
368
|
+
|
369
|
+
pfasta foo.fa
|
370
|
+
|
371
|
+
or by simply putting the full location of the .fa file in the interactive
|
372
|
+
shell. The argument to this should be the fasta file in question of course.
|
373
|
+
|
374
|
+
If you quickly wish to obtain the taxtree in a tree display-like
|
375
|
+
fashion, try these two ways:
|
376
|
+
|
377
|
+
taxtree 77166
|
378
|
+
taxtree 106583
|
379
|
+
|
380
|
+
If we wish to update the lineage IDs and lineage scientific names in
|
381
|
+
the localome table, we can do this by issuing the following command:
|
382
|
+
|
383
|
+
update lineage
|
384
|
+
|
385
|
+
This will only update these two entries.
|
386
|
+
|
387
|
+
If you need to show the password, do:
|
388
|
+
|
389
|
+
password?
|
390
|
+
|
391
|
+
If you need to modify the password, try:
|
392
|
+
|
393
|
+
edit password
|
394
|
+
|
395
|
+
This will open vim at the file position.
|
396
|
+
|
397
|
+
The password is also defined in the postgresql_login_command.rb file,
|
398
|
+
which you can simply open via: "edit login"
|
399
|
+
|
400
|
+
If you need to display the port we will use, do any of these:
|
401
|
+
|
402
|
+
show_port
|
403
|
+
port?
|
404
|
+
--port
|
405
|
+
|
406
|
+
== Commandline Usage (no interactive shell)
|
407
|
+
|
408
|
+
In principle, all that can be done in the interactive shell, can also
|
409
|
+
be done from the commandline alone. The main "binary" (it is just a
|
410
|
+
text file of course, but resides in the bin/ subdirectory) to
|
411
|
+
invoke is the file 'taxonomy':
|
412
|
+
|
413
|
+
bin/taxonomy
|
414
|
+
taxonomy
|
415
|
+
|
416
|
+
Here are some example for how to use it in cron scripts, with arguments -
|
417
|
+
of course all of the above commands should also work as well, but we
|
418
|
+
assume that in the interactive mode we need to feedback something
|
419
|
+
to the user, whereas in a cron-like task we don't need to give any output
|
420
|
+
to the user at all:
|
421
|
+
|
422
|
+
taxonomy update ncbi # Download the remote NCBI database and generate
|
423
|
+
# a new nodes and names table.
|
424
|
+
|
425
|
+
taxonomy update lineage # Just as the above, we will update the lineage
|
426
|
+
# in the localome table (the fasta table).
|
427
|
+
|
428
|
+
== Aliases
|
429
|
+
|
430
|
+
In general, for most available actions in the main menu of the Taxonomy
|
431
|
+
module, aliases exist. So the subsection "query", for instance, can be
|
432
|
+
shortened to "qu". In case of ambiguity, we try to decide which component
|
433
|
+
is more important from the point of view of the Taxonomy module, so there
|
434
|
+
should always be a clear "winner" in these cases.
|
435
|
+
|
436
|
+
== Readline support
|
437
|
+
|
438
|
+
If available, the Taxonomy module will make use of GNU Readline, which
|
439
|
+
basically means that the <TAB> key can be used for autocompletion and
|
440
|
+
that the cursor arrow keys (up and down) will work as well for the
|
441
|
+
input history, just as it works in shells like bash or zsh.
|
442
|
+
|
443
|
+
== Dependencies
|
444
|
+
|
445
|
+
The Taxonomy module depends on a few ruby gems (addons), which can
|
446
|
+
be queried in the interactive taxonomy shell via:
|
447
|
+
|
448
|
+
dependencies?
|
449
|
+
|
450
|
+
== Packaging, Licensing and Redistribution
|
451
|
+
|
452
|
+
If you need to package the taxonomy module for any reason, you can do
|
453
|
+
so via the interactive menu by issuing:
|
454
|
+
|
455
|
+
gem
|
456
|
+
|
457
|
+
or
|
458
|
+
|
459
|
+
make_gem
|
460
|
+
|
461
|
+
This will try to build a .gem file and tell you the full location
|
462
|
+
to this file as well. This .gem file can be used and redistributed.
|
463
|
+
(A .gem file is Ruby's way to distribute addons, similar to perl's
|
464
|
+
CPAN.)
|
465
|
+
|
466
|
+
Permission is explicitly granted to allow the I.M.P. team to
|
467
|
+
modify all code parts that were produced specifically for the
|
468
|
+
Taxonomy module in any way they see fit - in other words, the full
|
469
|
+
content of this .gem archive is licensed to the I.M.P, in particular
|
470
|
+
to Alex Schleiffer. This should allow modifications to everything
|
471
|
+
that belongs to the Taxonomy module at any moment of time.
|
472
|
+
|
473
|
+
The .gem file would allow installation by doing something like this:
|
474
|
+
|
475
|
+
gem install taxonomy.gem # global installation
|
476
|
+
|
477
|
+
or into the user home directory via:
|
478
|
+
|
479
|
+
gem install --user-install taxonomy.gem
|
480
|
+
|
481
|
+
== Shebang line
|
482
|
+
|
483
|
+
At the ray server, the Shebang line to ruby appears to be this line:
|
484
|
+
|
485
|
+
#!/biosw/debian5-x86_64/ruby/1.9.3/bin/ruby -w
|
486
|
+
|
487
|
+
On another server this may have to be adjusted, or one may have to call
|
488
|
+
the "env" binary to find the proper location for ruby. But this of course
|
489
|
+
works the same as if one would have Shebang point to perl - it all comes
|
490
|
+
down to the real location of the binary in question. (On localomics,
|
491
|
+
another ruby version was installed by default by Petair, so another
|
492
|
+
Shebang would have to be used - also, /biosw seems to not exist on
|
493
|
+
localomics; I assume this is a completely different machine with virtual
|
494
|
+
and different slices assigned to particular tasks).
|
495
|
+
|
496
|
+
== TEMP_DIR: the temporary directory
|
497
|
+
|
498
|
+
Downloads will happen to the temporary directory, stored in the constant
|
499
|
+
TEMP_DIR, which defaults to /tmp/robert at the time of writing this file.
|
500
|
+
|
501
|
+
This is defined in the file shared/shared.rb. If that directory does not
|
502
|
+
exist, we will attempt to create it. This is important because we will
|
503
|
+
download data and generate files into that directory, so it must exist
|
504
|
+
for the module to work.
|
data/doc/legacy_paths.md
ADDED
@@ -0,0 +1,9 @@
|
|
1
|
+
This small file here just is a reminder of the paths that were in
|
2
|
+
use during February 2014 for the Taxonomy project when it was still
|
3
|
+
a standalone project:
|
4
|
+
|
5
|
+
/biosw/debian5_x86_64/
|
6
|
+
/biosw/debian5_x86_64/postgresql/8.4.16/bin
|
7
|
+
|
8
|
+
One day this file here may be removed, but for the time being it will
|
9
|
+
remain as it is.
|