RubyGems - bioroebe - Versions diffs - 0.10.80 → 0.11.24 - Mend

bioroebe 0.10.80 → 0.11.24

Potentially problematic release.

This version of bioroebe might be problematic. Click here for more details.

Files changed (129) hide show

checksums.yaml +4 -4
data/README.md +1204 -772
data/bioroebe.gemspec +3 -3
data/doc/README.gen +1203 -771
data/doc/todo/bioroebe_todo.md +391 -365
data/lib/bioroebe/aminoacids/aminoacid_substitution.rb +1 -9
data/lib/bioroebe/aminoacids/codon_percentage.rb +1 -9
data/lib/bioroebe/aminoacids/deduce_aminoacid_sequence.rb +1 -9
data/lib/bioroebe/aminoacids/display_aminoacid_table.rb +1 -0
data/lib/bioroebe/aminoacids/show_hydrophobicity.rb +1 -6
data/lib/bioroebe/base/colours_for_base/colours_for_base.rb +18 -8
data/lib/bioroebe/base/commandline_application/commandline_arguments.rb +13 -11
data/lib/bioroebe/base/commandline_application/misc.rb +18 -8
data/lib/bioroebe/base/misc.rb +16 -0
data/lib/bioroebe/base/prototype/misc.rb +1 -1
data/lib/bioroebe/codons/show_codon_tables.rb +6 -2
data/lib/bioroebe/codons/show_codon_usage.rb +2 -1
data/lib/bioroebe/constants/aminoacids_and_proteins.rb +1 -0
data/lib/bioroebe/constants/database_constants.rb +1 -1
data/lib/bioroebe/constants/files_and_directories.rb +20 -1
data/lib/bioroebe/constants/misc.rb +20 -0
data/lib/bioroebe/count/count_amount_of_nucleotides.rb +3 -0
data/lib/bioroebe/crystal/README.md +2 -0
data/lib/bioroebe/crystal/to_rna.cr +19 -0
data/lib/bioroebe/data/README.md +11 -8
data/lib/bioroebe/data/electron_microscopy/pos_example.pos +396 -0
data/lib/bioroebe/data/electron_microscopy/test_particles.star +36 -0
data/lib/bioroebe/{shell/tk.rb → electron_microscopy/electron_microscopy_module.rb} +15 -10
data/lib/bioroebe/electron_microscopy/simple_star_file_generator.rb +4 -9
data/lib/bioroebe/fasta_and_fastq/show_fasta_headers.rb +27 -12
data/lib/bioroebe/genome/README.md +4 -0
data/lib/bioroebe/genome/genome.rb +67 -0
data/lib/bioroebe/gui/gtk3/protein_to_DNA/protein_to_DNA.rb +18 -18
data/lib/bioroebe/gui/gtk3/random_sequence/random_sequence.rb +19 -11
data/lib/bioroebe/gui/shared_code/protein_to_DNA/protein_to_DNA_module.rb +14 -14
data/lib/bioroebe/misc/ruler.rb +1 -0
data/lib/bioroebe/parsers/genbank_parser.rb +353 -24
data/lib/bioroebe/parsers/gff.rb +1 -9
data/lib/bioroebe/pdb/parse_pdb_file.rb +1 -9
data/lib/bioroebe/project/project.rb +1 -1
data/lib/bioroebe/python/README.md +1 -0
data/lib/bioroebe/python/__pycache__/mymodule.cpython-39.pyc +0 -0
data/lib/bioroebe/python/gui/gtk3/all_in_one.css +4 -0
data/lib/bioroebe/python/gui/gtk3/all_in_one.py +59 -0
data/lib/bioroebe/python/gui/gtk3/widget1.py +20 -0
data/lib/bioroebe/python/gui/tkinter/all_in_one.py +91 -0
data/lib/bioroebe/python/mymodule.py +8 -0
data/lib/bioroebe/python/protein_to_dna.py +33 -0
data/lib/bioroebe/python/shell/shell.py +19 -0
data/lib/bioroebe/python/to_rna.py +14 -0
data/lib/bioroebe/python/toplevel_methods/open_in_browser.py +20 -0
data/lib/bioroebe/python/toplevel_methods/palindromes.py +42 -0
data/lib/bioroebe/python/toplevel_methods/rds.py +13 -0
data/lib/bioroebe/python/toplevel_methods/three_delimiter.py +34 -0
data/lib/bioroebe/python/toplevel_methods/time_and_date.py +43 -0
data/lib/bioroebe/python/toplevel_methods/to_camelcase.py +11 -0
data/lib/bioroebe/requires/require_the_bioroebe_project.rb +3 -1
data/lib/bioroebe/sequence/nucleotide_module/nucleotide_module.rb +28 -25
data/lib/bioroebe/sequence/protein.rb +105 -3
data/lib/bioroebe/sequence/sequence.rb +61 -2
data/lib/bioroebe/shell/menu.rb +3451 -3366
data/lib/bioroebe/shell/misc.rb +51 -4311
data/lib/bioroebe/shell/readline/readline.rb +1 -1
data/lib/bioroebe/shell/shell.rb +11192 -28
data/lib/bioroebe/siRNA/siRNA.rb +81 -1
data/lib/bioroebe/string_matching/find_longest_substring.rb +3 -2
data/lib/bioroebe/taxonomy/class_methods.rb +3 -8
data/lib/bioroebe/taxonomy/constants.rb +4 -3
data/lib/bioroebe/taxonomy/edit.rb +2 -1
data/lib/bioroebe/taxonomy/help/help.rb +10 -10
data/lib/bioroebe/taxonomy/info/check_available.rb +15 -9
data/lib/bioroebe/taxonomy/info/info.rb +17 -2
data/lib/bioroebe/taxonomy/info/is_dna.rb +46 -36
data/lib/bioroebe/taxonomy/interactive.rb +139 -95
data/lib/bioroebe/taxonomy/menu.rb +27 -18
data/lib/bioroebe/taxonomy/parse_fasta.rb +3 -1
data/lib/bioroebe/taxonomy/shared.rb +1 -0
data/lib/bioroebe/taxonomy/taxonomy.rb +1 -0
data/lib/bioroebe/toplevel_methods/aminoacids_and_proteins.rb +31 -24
data/lib/bioroebe/toplevel_methods/databases.rb +1 -1
data/lib/bioroebe/toplevel_methods/fasta_and_fastq.rb +101 -63
data/lib/bioroebe/toplevel_methods/misc.rb +17 -16
data/lib/bioroebe/toplevel_methods/nucleotides.rb +22 -5
data/lib/bioroebe/toplevel_methods/open_in_browser.rb +2 -0
data/lib/bioroebe/toplevel_methods/palindromes.rb +1 -2
data/lib/bioroebe/toplevel_methods/taxonomy.rb +2 -2
data/lib/bioroebe/toplevel_methods/to_camelcase.rb +5 -0
data/lib/bioroebe/utility_scripts/align_open_reading_frames.rb +1 -9
data/lib/bioroebe/utility_scripts/check_for_mismatches/check_for_mismatches.rb +1 -9
data/lib/bioroebe/utility_scripts/compacter.rb +1 -9
data/lib/bioroebe/utility_scripts/compseq/compseq.rb +1 -9
data/lib/bioroebe/utility_scripts/create_batch_entrez_file.rb +1 -9
data/lib/bioroebe/utility_scripts/dot_alignment.rb +1 -9
data/lib/bioroebe/utility_scripts/move_file_to_its_correct_location.rb +1 -4
data/lib/bioroebe/utility_scripts/showorf/constants.rb +0 -5
data/lib/bioroebe/utility_scripts/showorf/reset.rb +1 -4
data/lib/bioroebe/version/version.rb +2 -2
data/lib/bioroebe/www/embeddable_interface.rb +101 -52
data/lib/bioroebe/www/sinatra/sinatra.rb +186 -70
data/lib/bioroebe/yaml/aminoacids/amino_acids_long_name_to_one_letter.yml +2 -2
data/lib/bioroebe/yaml/configuration/browser.yml +1 -1
data/lib/bioroebe/yaml/genomes/README.md +3 -4
data/lib/bioroebe/yaml/restriction_enzymes/restriction_enzymes.yml +3 -3
metadata +32 -35
data/doc/setup.rb +0 -1655
data/lib/bioroebe/genbank/genbank_parser.rb +0 -291
data/lib/bioroebe/shell/add.rb +0 -108
data/lib/bioroebe/shell/assign.rb +0 -360
data/lib/bioroebe/shell/chop_and_cut.rb +0 -281
data/lib/bioroebe/shell/constants.rb +0 -166
data/lib/bioroebe/shell/download.rb +0 -335
data/lib/bioroebe/shell/enable_and_disable.rb +0 -158
data/lib/bioroebe/shell/enzymes.rb +0 -310
data/lib/bioroebe/shell/fasta.rb +0 -345
data/lib/bioroebe/shell/gtk.rb +0 -76
data/lib/bioroebe/shell/history.rb +0 -132
data/lib/bioroebe/shell/initialize.rb +0 -217
data/lib/bioroebe/shell/loop.rb +0 -74
data/lib/bioroebe/shell/prompt.rb +0 -107
data/lib/bioroebe/shell/random.rb +0 -289
data/lib/bioroebe/shell/reset.rb +0 -335
data/lib/bioroebe/shell/scan_and_parse.rb +0 -135
data/lib/bioroebe/shell/search.rb +0 -337
data/lib/bioroebe/shell/sequences.rb +0 -200
data/lib/bioroebe/shell/show_report_and_display.rb +0 -2901
data/lib/bioroebe/shell/startup.rb +0 -127
data/lib/bioroebe/shell/taxonomy.rb +0 -14
data/lib/bioroebe/shell/user_input.rb +0 -88
data/lib/bioroebe/shell/xorg.rb +0 -45

data/doc/todo/bioroebe_todo.md CHANGED Viewed

@@ -1,16 +1,124 @@
--------------------------------------------------------------------------------
-(1) → https://biopython.org/DIST/docs/tutorial/Tutorial.html#sec15
+------------------------------------------------------------------------------------------
+(1) → set_dna_sequence alu
+      ^^^ fetch random alu
+      ^^^ alu sequence
+          Ok we started this now adding more details, but we
+          need to become better at searching for this
+          sequence.
+------------------------------------------------------------------------------------------
+(2)  →  draw things based on GR
+------------------------------------------------------------------------------------------
+(3)  → https://mycocosm.jgi.doe.gov/help/screenshots/browser_viewer.png
+^^^ offer the same functionality
+------------------------------------------------------------------------------------------
+(4)  →  https://genome.cshlp.org/content/12/10/1611/F3.expansion.html
+^^^ enable this, we must obtain a sequence then store into genbank format
+so, first fetch; then store as-is.
+------------------------------------------------------------------------------------------
+(5)  →  be able to generate nice graphics
+https://genome.cshlp.org/content/12/10/1611/F1.large.jpg
+------------------------------------------------------------------------------------------
+(6)  →  add rmagicks wrappre, perhaps via imageparadise or something
+        the idea is that we can make fancy drawings and generate
+        an image for the end user to see
+------------------------------------------------------------------------------------------
+(7)  →  https://bioperl.org/howtos/Beginners_HOWTO.html#item13
+extend the sequence object and document it
+also add:
+class Genome
+and:
+  def is_circular?
+    @internal_hash[:is_circular]
+  end; alias circular? is_circular? # === circular?
+  def species?
+    @internal_hash[:species] # return the species here
+  end
+------------------------------------------------------------------------------------------
+(2) http://lib.ysu.am/open_books/312400.pdf
+clone:
+Primer.pl
+This program was written to support the required informatics for a sequencing
+lab. The desire was to quickly generate primer pair candidates for use in STS
+mapping. We use Bioperl modules to fetch the sequences from GenBank.
+#! /usr/bin/perl
+#
+# primers.pl
+#
+# Reads a list of
+% primers.pl AC013798
+AC013798
+Left Right Length Penalty
+CCTCCTGGACAACCTGTGTT TGAAGTCAGGGGACATAGGG 280 0.0823
+CCTCCTGGACAACCTGTGTT AGGCCAGTAGACTGGGTGTG 298 0.1758
+CCTCCTGGACAACCTGTGTT GGTGTGAAGTCAGGGGACAT 284 0.1852
+TTCCCGCATCTCTTAGCAGT AGGCCAGTAGACTGGGTGTG 209 0.1962
+CTTCCCGCATCTCTTAGCAG GACACTAGTGGCAAGGAGGC 226 0.2362
+Most of the primers.pl program is extremely simple. The real guts and power
+of the program lie in the classes and the methods we call. The next section
+examines the Primer3 module, which is similar to many Bioperl modules
+------------------------------------------------------------------------------------------
+(1)  →  Clone all of Emboss. :)
+     → Clone and document the getorf functionality properly.
+      See: http://emboss.sourceforge.net/apps/cvs/emboss/apps/getorf.html
+http://emboss.sourceforge.net
+http://emboss.sourceforge.net/apps/cvs/emboss/apps/getorf.html
+------------------------------------------------------------------------------------------
+(3) → Add useful formulas for bioshell.
+------------------------------------------------------------------------------------------
+(1)  → Polish the GUI sets:
+https://i.imgur.com/djElIMh.png
+------------------------------------------------------------------------------------------
+(4)  → The taxonomy part should be fully integrated, without it
+       being a standalone part anymore.
+       continue on the taxonomy stuff.
+       ne day this will work again *shake fist*
+------------------------------------------------------------------------------------------
+(1) → Show the frequency of codons in different tables
+  This works quite ok, but right now the approach is to store
+  this in a .yml file which is not ideal.
+  Thus, we have to add two things:
+    - The ability to store this into a SQL database
+    - The ability to batch-download all of these codons,
+      which first requires that we have a way to obtain all
+      taxonomic ids.
+Add where this can be found.
+IMPROVE THIS ALL!!!!!!!
+------------------------------------------------------------------------------------------
+(2) improve docu + tests for melting temperature analysis again
++ usage example + GUI + web-use
+------------------------------------------------------------------------------------------
+(3) → https://biopython.org/DIST/docs/tutorial/Tutorial.html#sec15
     ^^^ work through the above, also integrate it + write docs
 https://raw.githubusercontent.com/biopython/biopython/master/Doc/examples/ls_orchid.fasta
--------------------------------------------------------------------------------
-(2) → integrate electrno microscopy slowly and also add documentation
+------------------------------------------------------------------------------------------
+(4) → integrate electrno microscopy slowly and also add documentation
       about this AS YOU GO!!!!!
       ^^^ yup add more of it
--------------------------------------------------------------------------------
-(3) → Add save session support
+------------------------------------------------------------------------------------------
+(5) → Add save session support
       to reload our last activity completely ...
   hmmm..
   This has to be well designed...
@@ -27,9 +135,8 @@ https://raw.githubusercontent.com/biopython/biopython/master/Doc/examples/ls_orc
   upon startup of the bioroebe shell.
   This is in preparation for save-session support.
--------------------------------------------------------------------------------
-(5) → Lys-Asp-Glu-Leu
+------------------------------------------------------------------------------------------
+(6) → Lys-Asp-Glu-Leu
   if i.include?('-') and Bioroebe.is_in_the_three_letter_code?(i)
   end
@@ -47,11 +154,11 @@ https://raw.githubusercontent.com/biopython/biopython/master/Doc/examples/ls_orc
 ^^ yep this is also called KDEL
 https://en.wikipedia.org/wiki/KDEL_(amino_acid_sequence)
--------------------------------------------------------------------------------
-(6) → Add "orthologs". this shall show us the top 25 orthologs or
+------------------------------------------------------------------------------------------
+(7) → Add "orthologs". this shall show us the top 25 orthologs or
       something. In the bioshell? Hmm. Not sure yet.
--------------------------------------------------------------------------------
-(7) → clone the functionality of this:
+------------------------------------------------------------------------------------------
+(8) → clone the functionality of this:
     http://www.kazusa.or.jp/codon/cgi-bin/countcodon.cgi
     http://www.kazusa.or.jp/codon/countcodon.html
@@ -63,18 +170,18 @@ https://en.wikipedia.org/wiki/KDEL_(amino_acid_sequence)
   widget first. And sinatra output too.
 AND document it as well
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (8) → SARS genom analyisere in bioroebe
       eventuell auch graphisch
 Gibt es neue GUIs die wir kombinieren könnten? Hmmm.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (9) → In bioroebe, generate that .ps thingy graphical thing from the
       vienna RNA tutorial. Hmmm.
 https://www.tbi.univie.ac.at/RNA/tutorial/
--------------------------------------------------------------------------------
-(1) → get insulin squence frmo NCBI
+------------------------------------------------------------------------------------------
+(10) → get insulin squence frmo NCBI
         human
       then apply trypsin onto it
       and try it like this:
@@ -88,13 +195,13 @@ Also add:
 ^^^ to show it
 Hmm. Perhaps also auto-download or something.
--------------------------------------------------------------------------------
-(1) → in bioroebe: UAG?
+------------------------------------------------------------------------------------------
+(11) → in bioroebe: UAG?
       ^^^ show all stop codons with that in the bioshell
       all UAG sequences... hmm. and TAG?
       Finish that.
-..........................................................................
-(1) → The position of a symbol in a string is the total number of
+------------------------------------------------------------------------------------------
+(12) → The position of a symbol in a string is the total number of
       symbols found to its left, including itself (e.g., the positions
       of all occurrences of 'U' in "AUGCUUCAGAAAGGUCUUACG" are 2, 5,
       6, 15, 17, and 18). The symbol at position i
@@ -102,70 +209,70 @@ Hmm. Perhaps also auto-download or something.
   ^^^ add a solution there, a toplevel API
   !!!!!
--------------------------------------------------------------------------------
-(1) → http://bioruby.org/rdoc/Bio/Blast.html
+------------------------------------------------------------------------------------------
+(13) → http://bioruby.org/rdoc/Bio/Blast.html
       ^^^ add support for BLAST
-..........................................................................
-(1) → add: parse_pdb()
+------------------------------------------------------------------------------------------
+(14) → add: parse_pdb()
       With this we shall just show some info, about a given
       .pdb file at hand.
       Also make it commandline based too + bioshell variant
       here, and a sinatra interface once this all works.
       Don't forget to document it!!!!!
       ^^^ and google a bit how others do that
-..........................................................................
-(2) → pdb 1a6m
+------------------------------------------------------------------------------------------
+(15) → pdb 1a6m
       ^^^ download this when that is used in the bioshell; we also have
           to use the download directory for this, so make sure that
           we do.
       ^^^ And then, also document this clearly.
--------------------------------------------------------------------------------
-(3) show_string
+------------------------------------------------------------------------------------------
+(16) show_string
 ^^^ slowly port this ... find out differences
     then unify into one method. right now we used
     two or something.
--------------------------------------------------------------------------------
-(4) → Try to see if we can integrate this into our GUI:
+------------------------------------------------------------------------------------------
+(17) → Try to see if we can integrate this into our GUI:
   https://cdn.snapgene.com/assets/7.6.11/assets/images/snapgene/homepage/homepage-hero.png
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (5) → Scan for leucine zipper!
       This is ~25% implemented. We need to double-check what
       exactly is a leucine zipper.
-..........................................................................
+------------------------------------------------------------------------------------------
 (6) → Extend the sinatra-interface for the Rosalind task,
       perhaps add a sub-link to show which parts are solved
       as-is. Hmm. I am not continuing on this though.
 ^^^^
 well - make rosalind anew again or something.
-...........................................................................
+------------------------------------------------------------------------------------------
 (7) - Add a blast interface; both via the web-interface, GUI,
       and also from the commandline.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (8) - Write a tutorial about primer design.
       also make sure that the GUI has support for this.
-..........................................................................
+------------------------------------------------------------------------------------------
 (9) - In the documentation examples, show some exampls for how to work
       with different organisms.
-..........................................................................
+------------------------------------------------------------------------------------------
 (10) - In the bioshell, if "stop?" is issued, then the colouring isn't
        correct. It currently does not show any result. This has to
        be fixed.
-..........................................................................
+------------------------------------------------------------------------------------------
 (11)  →  https://www.rubydoc.info/gems/biomart
          ^^^ integrate biomart
          p biomart.list_datasets
          p biomart.datasets?
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (12) Add Trypsin und Trypsinogen sequences, both as FASTA
     but also as shortcut via the commandline such as:
       show_orf :trypsine
       show_orf :trypsin
     Or something like this; and document it as well.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (13) → 1..60
   setdna 57
@@ -177,12 +284,12 @@ well - make rosalind anew again or something.
   5' - ATGTGCAGTCAGGTGAATTTATTGAAAAATTTGAGGCTCCTGGTGGTGCAAATCAAAGAACTGCTCCTCAGTGGATGTTGCCTTTACTTCTAG - 3'
   ^^^ hier beim colourize, wenn das letzte codon ein STOP codon ist
        dann colourizen wir das auch.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (14) → MG1655
       ^^^ input this to download the sequence. Also show it to the user.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (15) → extend virus-information into the bioroebe project.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (16) → Add a way to analyse the chemical structure of all
       aminoacids. We wish to show the chemical formula.
@@ -196,22 +303,22 @@ well - make rosalind anew again or something.
        I don't understand why it removes H and 0 so perhaps
        dont remove that part. But still show the -R.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (17) FIX THE COLOURIZATION BUG; THIS ONE TRIGGERED THE WHOLE
     REWRITE AFTER ALL!
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (18) FIX TAXONOMY related-problems AS WELL
   ^^^^^^ AND DOCUMENT THIS related-problems.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (19) Do note that z will then be a String, not a sequence object anymore.
   (This may be subject to change in the future, but for now, aka
   **February 2020**, it is that way.)
 ^^^^
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (20) ^^^ colours are appended. That should not be the case!
         ADD SOMETHING NEW ... some todo entries
          and some python tool
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (21) → rewrite the whole project anew
       - improve the documentation
        - focus on class Protein first and add
@@ -219,10 +326,10 @@ well - make rosalind anew again or something.
          that, as well as:
            .backtrans
            .reverse_translate
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (22) → AND THEN test on windows as well.
       ^^^^^^^^^^^^^^
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (23) →
 Reduced alphabets for proteins                  | [not implemented yet]
 ^^^ check this as well
@@ -252,9 +359,9 @@ First focus on bioroebe.
 efetch "https://www.ncbi.nlm.nih.gov/gene/744779"
 ^^^ test this. again
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (25) fix tk-levensthein
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (26) → rewrite the whole project anew
       - improve the documentation
        - rework the WHOLE tutorial as well
@@ -263,13 +370,13 @@ efetch "https://www.ncbi.nlm.nih.gov/gene/744779"
          that
            .backtrans
            .reverse_translate
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (27) → analyze /Depot/Temp/Bioroebe/1CEZ.pdb
        ^^^
        support this. Already works half-way, we started writing a pdb parser.
        this should work in general, for .fasta files as well.
-..........................................................................
+------------------------------------------------------------------------------------------
 (28) → SINATRA STUFF:
       FIX AND EXTEND SINATRA IN BIOROEBE.
       extend it too.
@@ -281,7 +388,7 @@ efetch "https://www.ncbi.nlm.nih.gov/gene/744779"
       and special-dispaly on sinatra kaa
       where the nucleotide sequence has numbers
       ^^^
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (29) pick any virus and begin to amass tons of data; and then when done
     also connect this into a GUI for use therein.
@@ -302,7 +409,7 @@ https://www.ncbi.nlm.nih.gov/nuccore/NC_038391.1
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (1)  → Fix:
        require 'bioroebe/toplevel_methods/open_reading_frames.rb'
@@ -310,10 +417,10 @@ https://www.ncbi.nlm.nih.gov/nuccore/NC_038391.1
        Something is wrong; it returns regions that contain
        a stop codon, which can not be true.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (3)  → Fix: extend glycovirology parts
        seek stuff in viral genomes
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (4)  →
        seq = Bio::Sequence::NA.new("atgcatgcaaaaaaa")
@@ -336,13 +443,13 @@ https://www.ncbi.nlm.nih.gov/nuccore/NC_038391.1
        seq = Bioroebe::Sequence.new("atgcatgcaaaaaaa")
        puts seq
        puts seq.complement
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (5)  →
      make sure we have a good fasta-showing widget
      show how many nucleotides are
      AND add support to modify this as-is
      ^^^^
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (6)  → In BioRoebe:
        Add a table showing how compatible bioroebe is compared to the other
@@ -352,7 +459,7 @@ https://www.ncbi.nlm.nih.gov/nuccore/NC_038391.1
        including Bio (ruby-bio) the main ruby project here.
        And add a table which functionality is implemented
        in Java already.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (7) →
 ********************************************************************************
 Was passiert wenn wir das Lambda-Genom mit EcoRI behandeln?
@@ -375,19 +482,19 @@ Bioroebe.digest_this_dna("/root/Bioroebe/fasta/NC_001416.1_Enterobacteria_phage_
 DNA.
 ^^^ this now works kind of ... but it must be better
     documented and we must test this with more data.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (8) → add the bioroebe logo to sinatra, but as appropriate size,
       via base64. perhaps width 50 or so. need to determine
       which size fits here.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (9) → Integrate http://nc2.neb.com/NEBcutter2/cutshow.php?name=ffe1d68e-
       in particular the visual part.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (10) → https://international.neb.com/products/r0196-ncii#Product%20Information
        ^^^ autogenerate such an image, aka restriction cutting enzyme
            to indicate the target sequence.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (6) → how to do codon optimiation in e.coli? bioroebe must support this!
 we must first get a display which codon is very commonly used in
@@ -399,34 +506,23 @@ and then we look which codons may be improvable - display
 them on the commandline
 class: OptimizeCodons.new(of_this_sequence)
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (7) → Molekulare Grösse von "Ubiquitin"? "8.5 kd".
       ^^^ das sollte automatisch ausgerechnet werden
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (8) → taxonomy !!!!!!!!!!!!!!!!!!
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (9) → Given a list of gene names that I would like to get chromosome/position
       information for (in mm10). Is there some service online where I can
       paste this list? ^^^ enable this
--------------------------------------------------------------------------------
-(10) → Show the frequency of codons in different tables
-  This works quite ok, but right now the approach is to store
-  this in a .yml file which is not ideal.
-  Thus, we have to add two things:
-    - The ability to store this into a SQL database
-    - The ability to batch-download all of these codons,
-      which first requires that we have a way to obtain all
-      taxonomic ids.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (11) → Add a way in bioroebe to store a gene into a yaml file
       or so, and to also load it up again. Perhaps simplify
       this automatically. Need some ways to describe that.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (12) → Make bioroebe very useful from the www, no matter if via sinatra
       or rails. It should be a tool-set project on the www as well.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (13) → Suppose you have a GenBank file which you want to turn into a
       Fasta file. For example, lets consider the file cor6_6.gb
       which is included in the Biopython unit tests under the
@@ -441,12 +537,12 @@ call it format-converter or so
 the GUI works somewhat but needs to be polished up.
 THEN THIS CAN BE REMOVED!!!!!!!
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (14) → Wir brauchen eine table wo wir die starken promotoren verschiedener
       Organismen zusammenstellen und vergleichen können.
         strong_promoters.yml
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (15) → add:
   start position of exons
   and show the sequence based on that file
@@ -454,9 +550,9 @@ THEN THIS CAN BE REMOVED!!!!!!!
   Normally there's a "gene" entry for each gene, so:
   awk 'BEGIN{FS="\t"; OFS="\t"}{if($3 == "gene") print $1, $4, $5}' foo.gtf
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (16) → also add 30-33 to aminoacids hmmm difficult.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (17) → http://bioinformatics.oxfordjournals.org/content/18/8/1135
       "TFBS: Computational framework for transcription factor
       binding site analysis"
@@ -464,7 +560,7 @@ THEN THIS CAN BE REMOVED!!!!!!!
     into bioroebe
     http://tfbs.genereg.net/
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (18) → They include trypsin, chymotrypsin, thrombin, plasmin, papain and factor Xa.
       ^^^ provide means to identify where they cut,
           and show this then by simualting a digest.
@@ -472,7 +568,7 @@ THEN THIS CAN BE REMOVED!!!!!!!
           also document this on bioroebe todo
       this is done via digestion/digestions
       but it's not quite perfect yet.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (19) → a) add a commandline way to generate a random protein
          with a specified length and then display it on the
       commandline [DONE] !!!
@@ -498,29 +594,29 @@ THEN THIS CAN BE REMOVED!!!!!!!
   Enable this BOTH from the commandline AND from the
   interactive variant and from sinatra! Hmmmm.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (1) → add an option to design a
     degenerate primer
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (2) Add upcase to sequences and ensure that it works; also document it
     internally and in the .pdf tutorial
     what does that mean? upcase as method? hmmm.
-..........................................................................
+------------------------------------------------------------------------------------------
 (1) → http://www.biomart.org/other/user-docs.pdf
        ^^^ work through this
 ^^^ integrate the old .cgi part and improve as you go
-..........................................................................
+------------------------------------------------------------------------------------------
 (1) → Access geninfo numbers easily.
       Die suchen und runterladen.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - Add all of bioruby into bioroebe:
     continous project
     https://github.com/biopython/biopython
     https://github.com/bioruby/bioruby/tree/master/lib/bio
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (3) → https://github.com/bioruby/bioruby/issues/134
 ^^^ check this, for restriction enzymes
     http://rebase.neb.com/rebase/enz/MboII.html
@@ -530,9 +626,9 @@ THEN THIS CAN BE REMOVED!!!!!!!
 > seq = seq.reverse_complement
 > Bio::RestrictionEnzyme.cut(seq, 'MboII').primary rescue [seq]
 => ["atcatcaatcctaatcttct"]
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (4) → Document how an ORF is defined for the bioroebe project.
-..........................................................................
+------------------------------------------------------------------------------------------
 (5) Continue with biojava in bioroebe.
     → We need to make some table that tells us what is implemented
@@ -547,7 +643,7 @@ THEN THIS CAN BE REMOVED!!!!!!!
     dprimer M-T-T-Y-Y-T-A-A-A-STOP
-..........................................................................
+------------------------------------------------------------------------------------------
 (1)  →  The codon tables:
      →  In January we added a codon-table GUI to ruby-gtk3.
@@ -576,31 +672,29 @@ THEN THIS CAN BE REMOVED!!!!!!!
 This now sorta works semi-ok.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (1) → In the bioroebe-shell, enable input such as:
          NC_000011.10
        This shall quickly download this sequence into the
        local file, and also rename it properly.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 → clone all of bioruby
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (1) → bioinf bücher udrhclesen und zeug inkludiere !!!
 ^^^^^ mehr bilderchen hinzufügen ... auchv on den GUIs eventuell.
     Und auch biopython durcharbeiten und alles wichtige nach
     bioroebe übertragen.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - Add: DetectMotif
   This class shall be used for detecting subsequences.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - Neue funktionälit rein
--------------------------------------------------------------------------------
-- mehr doku!
--------------------------------------------------------------------------------
-- continue on bioroebe, and when it is done, write to the guy.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
+- mehr doku!!!
+------------------------------------------------------------------------------------------
 - Rewrite bioroebe completely - add some tests, too or so, to
   test this. ^^^
   That way we learn how to write tests.
@@ -643,22 +737,13 @@ extend bioroebe sinatra interface
 also add a footer to show which entries are available or so
 → in bioroebe, mach das die postgresql datenbank wieder funktioniert ...
-..........................................................................
+------------------------------------------------------------------------------------------
 → ^^^ improve this whole project a lot
   before uploading then send email
-- 1fat.pdb
-  ^^^ download this, also via bioshell
-  download 1fat
-  ^^^ notify the user about this
-  but put it into the dir of bioshell
 → add:
     set_dna :insulin
@@ -674,44 +759,26 @@ also add a footer to show which entries are available or so
   → becomes: http://www.ncbi.nlm.nih.gov/gene/3630
   wtf ... better to learn how NCBI uworks
--------------------------------------------------------------------------------
-- Add a seuqence table int obioroebe for GFP, YFP etc
+------------------------------------------------------------------------------------------
+- Add a seuqence table into bioroebe for GFP, YFP etc
   and mae this show in both the interactio bioshell but
   also the main README.md
--------------------------------------------------------------------------------
-- stop_frame1?
-  ^^^ add support for this
-  and stop_frame2?
-  etcc
-  to show stop-codons in this colour
-  THEN UPLOAD!
-  ^^^ this works now but is not documented
--------------------------------------------------------------------------------
-- chop to first ATG
-  chop :ATG
-  ^^^^ enable this, to chop towards the first ATG
-       sequence in the string
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 → http://www.biophp.org/stats/describe_data/demo.php?show=formula
   ^^^ should also add documentation like this, also via www interface
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 → add mouse chromsoome URL, also in the bioshell
   and the main README, to be of help for the
   user. add a mouse subsection.
-..........................................................................
+------------------------------------------------------------------------------------------
 → fix the taxonomy stuff...
-..........................................................................
+------------------------------------------------------------------------------------------
 (1) → add 2nd_orf
     → this shall scan for the 2nd orf
     → and third ORF as well, then, and document it.
-..........................................................................
+------------------------------------------------------------------------------------------
 (2) → Add a "cutter-range example" in restriction enzymes +
       table + examples + tutorial
@@ -719,21 +786,16 @@ also add a footer to show which entries are available or so
       Also, add in the documentation where this
       can be found.
-..........................................................................
-(3) → Add aaruler, similar to "ruler"; in the bioshell.
-      But we want to do this on the dna-sequence rather
-      than the aminoacid sequence.
-      This works but the display is not ideal.
-..........................................................................
+------------------------------------------------------------------------------------------
 (4) → Add some codon-usage analyzer. What shall it show? It
       should show how many codons are used, frequencies etc...
       by an organism, and compare that to other data.
-..........................................................................
+------------------------------------------------------------------------------------------
 (5) → Implement a GPCR interface.
       This is for "G-protein coupled receptors."
       Denote which variants exist and so forth. Document it as well.
-..........................................................................
+------------------------------------------------------------------------------------------
 (6) → alu?
       Will read from the file `/Programs/Ruby/2.3.0/lib/ruby/site_ruby/2.3.0/bioroebe/yaml/alu_elements.yml`.
@@ -756,7 +818,7 @@ also add a footer to show which entries are available or so
        ^^^ add this and document it or something like that
            And perhaps add a small protein as an example how to
            work with .pdb files instead.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (4) → Extend bioroebe to allow download
        PDB files
@@ -770,13 +832,13 @@ also add a footer to show which entries are available or so
        in 3EML 2VTP 2VEZ
        do
-..........................................................................
+------------------------------------------------------------------------------------------
 (1) → Fully integrate electron microscopy then remove the old entry.
       Test it though.
       Hmm... but ... we will first polish the main bioroebe
       gem AND the taxonomy gem and THEN AFTERWARDS
       integate elctron microsopcy.
-..........................................................................
+------------------------------------------------------------------------------------------
 (1) → ORF Finder:
       We must add an ORF finder for the bioroebe project,
@@ -785,23 +847,23 @@ also add a footer to show which entries are available or so
       This works partially... start_stop works but we do not
       yet find all subsequences.
-..........................................................................
+------------------------------------------------------------------------------------------
 (1) → must change determine whether we have protein or nucleotide or
       so via a topelvel method!
-..........................................................................
+------------------------------------------------------------------------------------------
 (1) →  there is a talens module.
        we have to improve on it for a while
        better docu
        more testing
        then we can get rid of this entry here
-..........................................................................
+------------------------------------------------------------------------------------------
 (1) → 33.44
       Next showing the nucleotides 33 to 44 (including 33 and 44).
         The length of the fragment will be 12 nucleotides.
         5' - 2;70;130;180 - 3'
         ^^^ there is some problem; we somehow embed the colour codes,
             which should not happen.
-..........................................................................
+------------------------------------------------------------------------------------------
 (1) → set_aa DTLCIGYHAN NSTDTVDTVL EKNVTVTHSV NLLEDKHNGK LCKLRGVAPL HLGKCNIAGW ILGNPECESL STASSWSYIV ETSNSDNGTC YPGDFINYEE LREQLSSVSS FERFEIFPKT SSWPNHDNKG VTAACPHAGA KSFYKNLIWL VKKGNSYPKL NQSYINDKGK EVLVLWGIHH PSTTADQQSL YQNADAYVFV GTSRYSKKFK PEIATRPKVR DQEGRMNYYW TLVEPGDKIT FEATGNLVVP RYAFMERNAG SGIIISDTPV HDCNTTCQTP EGAINTSLPF QNIHPITIGK CPKYVKSTKL RLATGLRNVP SIQSRGLFGA IAGFIEGGWT GMVDGWYGYH HQNEQGSGYA ADLKSTQNAI DKITNKVNSV IKMNTQFTAV GKEFNHLEKR IENLNKKVDD GFLDIWTYNA ELLVLLENER TLDYHDSNVK NLYEKVRNQL KNNAKEIGNG CFEFYHKCDN TCMESVKNGT YDYPKYSEEA KLNREKIDGV KLESTRIYHH HHHH
     ^^^ enable copy/pasting,
@@ -816,7 +878,7 @@ also add a footer to show which entries are available or so
     This sequence has 50 aminoacids.
     ^^^ das stimmt net.
-..........................................................................
+------------------------------------------------------------------------------------------
 (1) → add this functionality:
       meting temper
@@ -853,70 +915,57 @@ also add a footer to show which entries are available or so
     and also provide a commandline-way to calculate them,
     using ruby. The latter may be useful and rather easy for
     scripted use.
-..........................................................................
+------------------------------------------------------------------------------------------
 (1) → show insulin
       ^^^ to show the insulin structure
           how to find it? no idea...
           but we should have these structures already made available somewhere.
-..........................................................................
+------------------------------------------------------------------------------------------
 (1) → Todo: find family of enzymes, based on sequence structure
       alone.
-..........................................................................
+------------------------------------------------------------------------------------------
 (1) → https://pubchem.ncbi.nlm.nih.gov/compound/16131099#section=Top
       ^^^ this website is quite interesting; try to use components
           from it.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (1) → Add some option   to show the aminoacid sequence, at the least
       store it; and optionally show it.
       possibly always report how many aminoacids are
       part of that file; and optionally also show
       the whole sequence.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (1) → WORK THROUGH the PROTOCOL AT BOKU. THEN WORK THROUGH THE VARIOUST
       TIDBIDS AT UNI WIEN STARTING WITH HEIKO.
         ^^^ da sind wir nun.
       wir sind an beginn von 1b ... hmmmm, also zerst mal das an der
       BOKU durchgehen. Dann das löschen.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (1) → Begin tk-bindings for bioroebe, following the gtk stuff.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (2) → frame_value = position_of_the_stop_codon - position_of_the_start_codon
       ^^^ continue on this ...
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (1) → improve both the gtk-apps parts, and the sinatra web-interface,
       and other GUI-like elements. The idea is to make this software
       more useful for people around the world, which should help
       increase its adoption rate.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (2) → Look to integrate this:
         http://www.ncbi.nlm.nih.gov/nuccore/NM_007315.3?report=fasta&log$=seqview&format=text
       ^^^
--------------------------------------------------------------------------------
-(1) → Clone and document the getorf functionality properly.
-      See: http://emboss.sourceforge.net/apps/cvs/emboss/apps/getorf.html
--------------------------------------------------------------------------------
-(2) → set_dna_sequence alu
-      ^^^ fetch random alu
-      ^^^ alu sequence
-          Ok we started this now adding more details, but we
-          need to become better at searching for this
-          sequence.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (3) → We need to make available the ... thingy magick
       emboss functionality. that may seem useful
       but also feel free to extend these parts for
       bioroebe as necessary.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (4) → integrate electron_microscopy fully
       This will take more time, so first we finish with the
       taxonomy module instead.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (5) → Improve support for BLAST up until
       middle of 2015 so that I am better prepared
@@ -927,7 +976,7 @@ also add a footer to show which entries are available or so
       So, work on BLAST tutorial at bioinf page:
         bl bioinf; rf bioinf
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (3) → integrate a "codon usage database",  whatever this means.
       It is a cool database anyway. Then document this.
       First, create a codon-usage analyze on a per-FASTA
@@ -935,7 +984,7 @@ also add a footer to show which entries are available or so
       and calculate the codon usage from there.
    ^^^ and add some GUI to this. hmmm
-..........................................................................
+------------------------------------------------------------------------------------------
 (4) → Input sequence:
       MFLMVSPTAYHQNKDECFLP
@@ -951,46 +1000,40 @@ also add a footer to show which entries are available or so
       ^^^ we should also show this on the commandline AND the
           www ... hmmm.
-..........................................................................
+------------------------------------------------------------------------------------------
 (5) → enable a graphical layer so that we can find out which
       transcription factor activates which gene(s). This
       should show e. g. a transcription factor highlighting
       a target genetic area.
-..........................................................................
+------------------------------------------------------------------------------------------
 (2) → We should add more screenshots, make them available on imgur
       as well, after storing them locally. Start with the more
       important functionality.
-..........................................................................
+------------------------------------------------------------------------------------------
 (2)  → clone serial cloner or whatever the name was, that GUI,
        so that we can offer the same functionality.
-..........................................................................
+------------------------------------------------------------------------------------------
 (1)  →
        # * searching for PubMed IDs given a query string:
        #   * Bio::PubMed#esearch  (recommended)
        #   * Bio::PubMed#search   (only retrieves top 20 hits; will be deprecated)
        ^^^ implement this
-..........................................................................
+------------------------------------------------------------------------------------------
 (3)  → Aufgabe 16 in bioroebe lösen könnnen
-..........................................................................
-(4)  → The taxonomy part should be fully integrated, without it
-       being a standalone part anymore.
-       continue on the taxonomy stuff.
-       ne day this will work again *shake fist*
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (5) → re1 = Bio::RestrictionEnzyme::DoubleStranded.new(enzyme1)
     ^^^ add this? hmmmm
     ^^^ from here.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (1) → Colourize exon/intron boundaries.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (2) → In bioroebe: enhance phylogeny stuff and perhaps automatically
       generate pictures here.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (1) → In sinatra: add a backtranseq entry point, perhaps
       alias it as well.
@@ -1000,7 +1043,7 @@ bioroebe --protein-to-dna
 ^^^ this shall start the GTK3 variant
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (1) → require 'rubygems/text'
        include Gem::Text
        levenshtein_distance 'shevy', 'chevy' # => 1
@@ -1012,13 +1055,13 @@ bioroebe --protein-to-dna
        https://github.com/rubygems/rubygems/blob/master/lib/rubygems/text.rb
      ^^^ actually move that part into bioroebe itself...
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (1) → add _source to all APIs in sinatra there. Ensure that this works
       too. The user should be able to view the source code.
       ^^^ it has been added for 2 methods so far in sinatra; we need
           to add it for the remaining ones too. Then we can remove
           this entry point.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (2) → Check out expasy
         peptidcutter
       also offer this functionality, through commandline, GUI
@@ -1026,16 +1069,12 @@ bioroebe --protein-to-dna
         https://web.expasy.org/peptide_cutter/
       We now have added trypsin but we should add more here; and
       still have to add support for sinatra here.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (3) → melting temperature subsection
       hmmm .... molecular weight calculation works now ... but
       ... is it correct for a ssDNA string? hmm...
--------------------------------------------------------------------------------
-(3) → Add useful formulas for bioshell.
-...........................................................................
+------------------------------------------------------------------------------------------
 (1) →  Degenerate Primers
        You can try to determine the degenerate primers via the Shell
@@ -1046,7 +1085,7 @@ bioroebe --protein-to-dna
        ^^^ epxnad that subsection
        more explanations and examples
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (1) → Copy the functionality of plotorf:
       See:
@@ -1062,7 +1101,7 @@ bioroebe --protein-to-dna
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (2) → Start nucleotide position is at: 142
   See the following example:
@@ -1072,24 +1111,24 @@ bioroebe --protein-to-dna
   BIO SHELL>
       ^^^ this does not work; nothing is highlighted.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (2) → Add a myristoylierung-signal
         Met-Gly-Xaa-Xaa-YXaa-Ser/Thr-Lys-Lys
      1^^ but check first.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (3) → integrate the bioroebe_tutorial.cgi into the .md file completely.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (4) → Integrate everything from the biopython tutorial, if it makes
       sense.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (5) → Improve the codon-optimizer in Bioroebe, including the
       documentation. We need to make this really useful.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (6) →
            5'- TACACGGCACAT -3'
            3'- ATGTGCCGTGTA -5'
@@ -1098,7 +1137,7 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
 ^^^ integrate mirror repeats creation
     and searching for them. Hmmm.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (7) → continue porting bioroebe/taxonomy
   ^^^^^^^^^^
@@ -1108,12 +1147,12 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
       ^^^ das ist der nächste schritt, so das
           wir das nit mehr benötigen.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (8) → find out which bacteria all contain the needle complex; find out
       the sequence for the needle complex as well and   study it;
       find the positions of the genes responsible.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (9) → Add trypsin_digest, also in the shell, but possibly
       on toplevel as well (if the input is a protein sequence.
@@ -1127,29 +1166,24 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
      And document it; but do not digest if a prolin
      follows !!!
      ^^^ document this too into .md
--------------------------------------------------------------------------------
-(10) → in bioroebe, add a commassie check... do we include
-       arginine or not.
-..........................................................................
+------------------------------------------------------------------------------------------
 (11) → add codon usage in bioroebe
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (12) → Clone the following functionality.
         http://www.bioinformatics.nl/cgi-bin/emboss/help/sirna
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (13) → Improve the "find and scan" subsection. We must be able to find
        subsequences; check for "matches" as well, including the bioshell.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (14) → Clone the CLUSTAL format aligment.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (15) → We need to be able to load up a whole geneome into bioroebe,
        and then be able to manipulate it.
    ^^^ perhaps test this with some example
        data or so...
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (16) → Restriction enzymes:
        Add a subsection about restritction enzymes including
@@ -1163,7 +1197,7 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
        general, so that we can reproduce and verify the
        information there.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (18) → clone pepinfo
        The program "pepinfo" plots various amino acid properties in
@@ -1181,7 +1215,7 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
        The data are also written out to an output file.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (19) → gff?
     There are 6 .gff3 files in the current directory.
@@ -1193,23 +1227,22 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
     ^^^ we need an analyze-mode as well.
-..........................................................................
+------------------------------------------------------------------------------------------
 (20) →  ^^^^ add the ability to
              show a ruler AND highlighting as well
              ^^^ then document it.
-..........................................................................
+------------------------------------------------------------------------------------------
 (21)  →  https://github.com/bioperl/bioperl-live
          Look what we can take from ^^^.
          https://github.com/bioperl/bioperl-live/tree/master/examples
-..........................................................................
+------------------------------------------------------------------------------------------
 (23)  → continue biojava, and bioroebe a bit
         Ideally we should have biojava o a working point.
-..........................................................................
-(24)  →  Clone all of Emboss. :)
-..........................................................................
+------------------------------------------------------------------------------------------
 (25)  → clone the functionality found at https://web.expasy.org/protparam/
        https://web.expasy.org/cgi-bin/protparam/protparam
@@ -1219,7 +1252,7 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
   Theoretical pI: 5.78
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (27) → NP_417539.1
       https://www.ncbi.nlm.nih.gov/protein/NP_417539.1
@@ -1227,26 +1260,26 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
   ^^^ if the input is exactly like the above, on the first line,
       download the sequence.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (28) → Integrate these nice GUI parts parts:
     https://dev.to/kojix2/introduction-to-gr-rb-data-visualization-with-ruby-2c39
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (29) → http://insilico.ehu.es/
   ^^^ check if we have all of this incorporated
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (30) → http://www.biostars.org/
        ^^^ regularly work through this
        and try to help
        and extend bioruby at the same time.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (31) → The taxonomy-submodule should work one day, and be properly
        documented as well. Perhaps integrate the parts of Taxonomy
        that can be included into the toplevel domain.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (32) → Enable:
          Bioroebe.set_genetic_code()
@@ -1262,7 +1295,7 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
       ^^^ enable this as well; extent documentation too.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (34) → We have found a restriction enzyme called NheI.
        The sequence this 6-cutter relates to is: `5' - GCTAGC - 3'`
@@ -1270,23 +1303,23 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
        This restriction enzyme will produce a blunt overhang.
          ^^^ nope das ist falsch
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (35) → Sau3A?
          ^^^ enable this restriction site
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (37) → Add matplotlib support.
          try_to_use_matplotlib
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (38) → https://www.ncbi.nlm.nih.gov/CBBresearch/Lu/Demo/tmTools/RESTfulAPIs.html
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (39) → The following input:
          downcase; orf?; seq?
        leads to strange display. Something is wrong here, must be checked.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (40) → Continue with rosalind problems.
        These challenges can be found here:
@@ -1295,42 +1328,42 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
        Also integrate these rosalind-quizzes into bioroebe
        when possible.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (41) → https://web.expasy.org/cgi-bin/peptide_mass/peptide-mass.pl
        ^^^ make the above usable in sinaitra as well
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (42) → Integrate a way to search for commonly known promoters:
         promoters?
         ^^^ this functionality
         ^^^ this has to be expanded
         and ...
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (43) → Integrate:
        http://biotools.nubic.northwestern.edu/OligoCalc.html
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (44) → Extend the Java part of BioRoebe systematically..
   What should come next? Let's make a list.
        → remove_numbers [DONE]
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (46) → Study gnuplot; one day we have to draw graphs.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (47) → Add a genome browser, both ascii without GUI and also
        with. In ruby-gtk.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (48) → Clone the functionality of:
          http://www.biophp.org/minitools/restriction_digest/demo.php
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (50) → Add the loxP sequence to readme [DONE] and explain this
        better on the main readme; and perhaps also assign
        the sequence via the bioshell.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (51) → 33. Cephalodiscidae Mitochondrial UAA-Tyr Code (transl_table=33)
             AAs  = FFLLSSSSYYY*CCWWLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSSKVVVVAAAADDEEGGGG
@@ -1341,7 +1374,7 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
         ^^^ add a parser, and document it, that can take this input
             and output the corresponding code, in a valid .yml file.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (52) → Add to bioroebe the ability to add cloning vectors
        and molecular_weight calcuation
        for this
@@ -1363,19 +1396,19 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
      ^^^ we also need a way to find out what resistance genes
          are carried there.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (53) → In the lambda genome sequence there are 10 EcoB and
        5 EcoK sites.
        ^^^ verify this too, as an example as well
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (54) → show restriction sites, composable and compatible with
        serial clone ... hmm
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (55) → enable:
          BIOROEBE_USE_COLOURS:
        can be 0 or 1
        what is this?
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (56) → Burrows-Wheeler-Transform (BWT)
        ^^^ add some method here
@@ -1388,15 +1421,15 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
        also test this against my paper-result
        with input being: "GATAG$".
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (56) → Enable working with several genes... hmm and store that somewhere.
        Something like a per-project workspace thingy.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (57) → Add:
        http://nar.oxfordjournals.org/content/35/suppl_2/W71.long
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (58) → Now, you may want to translate the nucleotides up to
        the first in frame stop codon, and then stop (as
        happens in nature):
@@ -1410,14 +1443,14 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
        Then continue from here:
          https://people.duke.edu/~ccc14/pcfb/biopython/BiopythonSequences.html
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (59) → Add:
          set_dna :Ubiquitin
          set_dna :ubiquitin
        ^^^ we want to obtain the ubuiqitin sequence
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (59) → Telomers
        Telomeres are listed from 5' to 3'.
@@ -1431,28 +1464,28 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
          doc_telomeres
          ^^^ add this to say the human telomere sequence
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (60) → ORF_positions?
       ^^^ change this a bit, to actually show the positions
           of the various ORFs with the start-position.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (62) → add:
        setgene2
        add_dna2
        dna2
        dna? <--- this one is not a setter but a query.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (63) → improve the TM calculation. must be better, must have more
        documentation, and a small tutorial.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (64) → Compare bioroebe to:
          https://www.ncbi.nlm.nih.gov/orffinder
        whether both return the same
        also possibly add a web-gui
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (65) → Find out ratios from:
        Doolittle RF. 1989. Redundancies in protein sequences. I
@@ -1478,16 +1511,16 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
     Bioroebe::Blosum[50] as an API.
     and document it in general.
-..........................................................................
+------------------------------------------------------------------------------------------
 (65) → http://www.biomart.org/other/user-docs.pdf
        ^^^ work through this
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (66) → add:
            class Cell
             ^^^ simulate a cell
       Hmmm. Needs specific components ... and needs a better plan.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (68) → class Protein:
         add glycosyslation patteren
@@ -1496,18 +1529,18 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
         need to somehow add the modiication type
         https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5358406/
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (69) → In the BioShell we must be able to do probes - completementary
        to amino acids.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (70) → Add www-related functionality to bioroebe eventually make use
        of rails, but start with sinatra possibly. In the long run,
        make it flexible to work with as many different frameworks
        as possible, though.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (71) → Spaltstellen anzeigen zum beispiel lambda-DNA verdau
        BgI II.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (72) → dnaanalyze
   In the DNA string `TCCGTCGCAACACATCGCCTCAACAAACCGACCGGGATATGCAATACCGGAATCCGATCCTTTAGAAGCTGCATTCCAAACGCTTGCAATAACACCCACTCGACTATTCAGCATTGGCAAAGGGTACGAATTCGACGAAGGGAGGGTGCTATATTTTCCAAGTTGCTCGCCGATTGATACGGAGCCTGTGGAAAGATTTCGCGGCTCTAGTCTTTAGCTTTGATGTCACCCCTGAGTAGTAACCCGGCGTGGTAGCTTTCATTAGACTTCTCGGAGAGAGTATTAAGCAAAGGTGGAGGTCCCAGGGGTCCAGTGAGCTGTATCGCACTAAAAGCATGCCTACGGGCAATGCTATTTTGCTCACAGGAACTTTGGGGGAGCCACAAACTCTCGAAGCCGGATTGTTGTGGCGGCTAACTTTCCAAAGGCGACCATTCATGGTCTGAATGGGCCCTCACCAGAAGAACGTTTTCGACGGGCATTCTTCCCCGGGGTTTCGAAGGCAAGGGTCAGCACGGCGCGGAAAAGTACGCGACGCATACCGGACTAGTCATGCAACTCCCTCGGAACTGGCGATTCCCACCCAAGAGACGCACGCTGATCATTGCCCATGCCGACTGGAGATGCTGAATTTGGTATGCGGGTCTGTTGCCAGCGCTGACATTATCGGACATTGTGGGGAGAACCGTGTGATTGATTGAGCTGGCGCATTTGTCCGCATGCTCTCCTCATGTGGACACCTTCGCAGGTTCTTTCCGCGGCCACAGTGTCGGGATCTACCCCTGGTGCGTCGCCGCGAGTACAGGTGGGGTTTCGCGCATGAGAACCAATGTTGCACGCCTCAAAACATGGCTGTAACATATTAGCGCCAATAAAAATTTTTGGCAACAAAGAAACAAGGCCAACCGAAGTGCTAAGCCGCGATCATGAAGGGGCGATGCCAGAATGGGAGTCTGCCTTTCCTGTGTGGACGTGAGATTGTACCTAGACAGAGAACGCC` we found these Nucleotides:
@@ -1532,11 +1565,11 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
         we need to make it so that an input sequence
         can be assigned, and dnaanalyse --GUI should
         start it too. ALSO document it once this works.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (73) → go through the individual components slowly and improve them,
        step by step, including the documentation. Then eventually
        remove this todo-entry here.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (74) → Add a consensus sequence for:
          Asn-X-Ser/Thr-Conesnsus
@@ -1548,13 +1581,13 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
         NGlyc
         /N-?Glyc/i
         ^^^ use that regex
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (74) → make sure that newly generated files respect the
        default chmod value on the system. from bioroebe.
        right now we default to 755 which I assume is
        hardcoded but perhaps this is wrong.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (75) → require 'bio'
   # creating a Bio::Sequence::NA object containing ambiguous alphabets
@@ -1579,34 +1612,34 @@ Imperfect DNA mirror repeats (IMRs) are less than 100% symmetrical.
   part nto a standalone file
   so taht it can be used by both the .cgi and
   well rdoc...
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - Add more protein-specific thingies to bioroebe.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - Die bioshell vorantreiben und durch std_biology.rb abarbeiten.
   Vielleicht können wir ja etwas davon auslagern in eine Klasse
   oder so.
   Das ganze sollte auch mit Webmin (biomin) verknüpft werden, so das
   wir die Bioshell auch elegant über das www verwenden können!
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - ^^^ when we find restriction enzyme sites in a DNA
       string, colourize them RED.
    also set it to
      set_restriction_size()
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - also ... while learning C++ we extend the project here...
   Useful C++ things will be combined.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - As of April 2003, there were 176,890 total taxa represented.
   ^^^ we need a way to also output how many entries we
   have there.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - Replace bioruby with bioroebe completely!
   In order for this to work, we first need to find out
   what bioruby is able to do. :P
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - append 33
   # ^^^ in the bioshell
   Only numbers were given: Adding 33 random nucleotides to the main string next.
@@ -1626,7 +1659,7 @@ Did you mean?  return_random_codon_sequence_for_this_aminoacid_sequence
  ^^^^^ BUG!
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 > rest?
 We found these restriction sites within the sequence `TTCAGAACTCAACGCCTGGTTGGCCGTCCAGTAAGCTGACTAAGTAAGTCTATGCCCGCGATAACCAGGATACAGATATCGTGAAACCTGGTTTATCTCCTTCTATAAGAGTCTGCACATCTAGC`:
@@ -1656,7 +1689,7 @@ We found these restriction sites within the sequence `TTCAGAACTCAACGCCTGGTTGGCCG
 ^^^^ also show the position
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
     PMID entries are:
@@ -1728,7 +1761,7 @@ We found these restriction sites within the sequence `TTCAGAACTCAACGCCTGGTTGGCCG
-..........................................................................
+------------------------------------------------------------------------------------------
 Bei der Datenbanksuche werden die gemessenen Massen mit den Peptidmassen
 aller Proteine bzw. Gene in einer Datenbank (NCBI, Uniprot) verglichen. DNA-
 Sequenzen werden dazu in Proteinsequenzen übersetzt und in silico mit der beim
@@ -1738,7 +1771,7 @@ Verdau benutzten Protease geschnitten.
-..........................................................................
+------------------------------------------------------------------------------------------
 Complexity of libraries:
 How many independent clones are necessary to represent a genome (plant,
 animal/fungus) or how many such clones have to be screened to have realistic
@@ -1773,7 +1806,7 @@ have to be hybridized.
-..........................................................................
+------------------------------------------------------------------------------------------
 BIO SHELL> BglI?
@@ -1818,12 +1851,12 @@ List all enzymes that produce compatible ends for the enzyme.
 http://biopython.org/DIST/docs/api/Bio.Restriction.Restriction.Blunt-class.html
-..........................................................................
+------------------------------------------------------------------------------------------
 https://www.reddit.com/r/bioinformatics/comments/5o3kn8/bioinformatics_contest_2017_jan_23rd29th_solve_as/
-..........................................................................
+------------------------------------------------------------------------------------------
 (1) → Finish all of biophp integration into bioroebe.
 http://www.biophp.org/
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 locate oriC here:
@@ -1858,13 +1891,13 @@ But I do not know how to locate ORIs.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 ^^^ also integrate git into bioroebe.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 WIR MÜSSEN DAS HIER EXTREM VERBESSERN.
 DANN UPLOADEN UND ALS BASIS FÜR APPLICATIONS NUTZEN.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 Study MetaCyc
 ^^^ study metabolic pathways.
@@ -1873,7 +1906,7 @@ http://metacyc.org/
 → Create KuroMetaCyc, in Analogy towards Metabolic Cycle.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 Welcome to BioShell May 2012. Type "help" to get some help.
@@ -1895,7 +1928,7 @@ When we type this, we then ask:
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 http://biopython.org/DIST/docs/cookbook/Restriction.html#mozTocId101269
@@ -1985,16 +2018,16 @@ ausreichend.
-..........................................................................
+------------------------------------------------------------------------------------------
   BioTodo - GENESIS, science fiction.
      - create virus(:which_one, :amount) # Note the difference to the below
      - create hydra(:amount)
      - create bread
-..........................................................................
+------------------------------------------------------------------------------------------
   → both
      ^ should work, does not work right now.
-..........................................................................
+------------------------------------------------------------------------------------------
 → Taxonomy is now integrated into bioroebe. This is good but we need more
   documentation, some more tests, a rethinking of the layout and the
   structures, and a fixing of the query-part of the database.
@@ -2008,13 +2041,13 @@ ausreichend.
   at about the same time \o/
   AND document this related-problems too
   Integrate this some other day...
-..........................................................................
+------------------------------------------------------------------------------------------
 - http://www.restrictionmapper.org/cgi-bin/sitefind3.pl
   ^^^ Das sollte man integrieren, die Funktionalität, so das
       man ALLE Restriktion-Enzymes ausprobiert ausgehend von
       einer bestimmten Sequenz.
-..........................................................................
+------------------------------------------------------------------------------------------
 → A search is essentially substring search across a database of strings
   (albeit with a smaller alphabet). Some common use cases: one,
   scientists will search for certain genes that they've used in engineered
@@ -2033,13 +2066,13 @@ ausreichend.
     Bioroebe::DetermineOptimalCodons
     ^^^ this is currently incomplete.
-..........................................................................
+------------------------------------------------------------------------------------------
 → Redo restrictions enzymes completely.
   And polish this a LOT.
   This may take some days. But we want this to be REALLY good and
   lasting for a long time.
   Need to keep on working at that!
-..........................................................................
+------------------------------------------------------------------------------------------
 → Add: average_aminoacid_weight?
@@ -2077,7 +2110,7 @@ end
 → We must be able to align not only nucleotides but also aminoacids.
   But where is the alignment comparer? perhaps hamming distance?
   hmm we have to see.
-..........................................................................
+------------------------------------------------------------------------------------------
 → /Programs/Ruby/2.3.1/lib/ruby/site_ruby/2.3.0/bioroebe/bioshell/menu.rb:311:in `menu': undefined method `upcase' for ["EcoRI"]:Array (NoMethodError)
      from /Programs/Ruby/2.3.1/lib/ruby/site_ruby/2.3.0/bioroebe/bioshell/user_input.rb:31:in `block in enter_main_loop'
      from /Programs/Ruby/2.3.1/lib/ruby/site_ruby/2.3.0/bioroebe/bioshell/user_input.rb:12:in `loop'
@@ -2106,12 +2139,12 @@ end
         at this date.'
         SendEmail.new to: Roebe.email?, data
-..........................................................................
+------------------------------------------------------------------------------------------
 → Document which parts of emboss have already been copied.
   → EMBOSS.md
-..........................................................................
+------------------------------------------------------------------------------------------
@@ -2168,7 +2201,7 @@ Traceback (most recent call last):
     http://www.snapgene.com/products/snapgene_viewer/
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (1) → Wir sollten GFP tagging unterstützen, also wie das
       Protein-Konstrukt aussehen soll und so weiter.
       Das geht teilweise...
@@ -2177,22 +2210,22 @@ Traceback (most recent call last):
        fügt die sequence asl main dna sequenz ein.
        Was fehlt? Hmmmm... eventuell noch mehr an
        dokumentation.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - in bioroebe, create subsequences for siRNA, then scan for
   submatcher + report where these are. Should be fast too.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - Reverse complement now works quite well, also via the sinatra
   interface. We still should have a way to show 5' and
   3', both on the commandline, and via sinatra.
   Perhaps via --fancy commandline flag or so.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - Cn3D files?
   ^^^ add support for these; research what they are, too.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - Consider adding graphviz, perhaps to the taxonomy project
   where we make graphs towards different nodes or so...
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - in parse fasta
    @colourize_sequence = false
    ^^^ change this lateron...
@@ -2200,7 +2233,7 @@ Traceback (most recent call last):
    this method now exists, but we still have to make
    the check better whether it is a protein or a DNA/RNA
    add a toplevel method for this.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - clone the BLast ident matcher functionality for aminacids into
   Bioroebe.
@@ -2215,7 +2248,7 @@ Traceback (most recent call last):
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - Be able to mark exon/intron boundaries.
 - Add "taxid?" to tell us the name of the organism. This works now.
@@ -2259,9 +2292,9 @@ Traceback (most recent call last):
   ^^^
   study sumoplot ...
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - http://a-little-book-of-r-for-bioinformatics.readthedocs.io/en/latest/src/chapter7.html
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - http://biopython.org/DIST/docs/tutorial/Tutorial.html#htoc22
   ^^^ continue here; "You can also specify the table using the
   NCBI table number which is shorter, and often included in
@@ -2269,7 +2302,7 @@ Traceback (most recent call last):
 ^^^ work through this and see if it is good.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - Clone ALL of biophp, if it us useful.
@@ -2316,7 +2349,7 @@ Palindromic sequences finder
 We should also put this poart into doc/ subsection
 to keep track of what is missing and what is not.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (1) → sizeseq
   ^^^ clone this functionality and describe it in detail.
@@ -2353,7 +2386,7 @@ foobar.fasta
 ALSO ADD A GUI; sizeseq.rb was added in February 2021.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - In the sinatra-web-interface for Bioroebe:
   continue quiz in rosalind !!!
   also, at to_dna: default to RNA
@@ -2372,8 +2405,8 @@ ALSO ADD A GUI; sizeseq.rb was added in February 2021.
        → formatted_view
        111^^^^ in ncbi format
        and document all of this.
-..........................................................................
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - Add a ruby-GUI stuff, probably the old biology/ subsection
   will be moved into the project.
@@ -2470,7 +2503,7 @@ ALSO ADD A GUI; sizeseq.rb was added in February 2021.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - Identifying amino acid cleavage sites (Sigcleave)
   For amino acid sequences we may be interested to know whether
@@ -2533,29 +2566,22 @@ ALSO ADD A GUI; sizeseq.rb was added in February 2021.
   ^^^ da gibt es einen bug. später nochmals probieren.
-- We will read from NM_001180897.3_Saccharomyces_cerevisiae_S288c_Aga2p_AGA2.fasta
-  The file NM_001180897.3_Saccharomyces_cerevisiae_S288c_Aga2p_AGA2.fasta has this FASTA header:
-    >gi|398364826|ref|NM_001180897.3| Saccharomyces cerevisiae S288c Aga2p (AGA2), mRNA
-  ^^^ this should also (optionally) tell us the organism, via a switch.
-      for this we need some way to return the taxonomic ID of an organism
 - we have to add expasy...
   functionality to the cmdline too.
   Which one specifically? Let's see...
     https://www.expasy.org/
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - https://biopython.org/wiki/Category%3ACookbook
  ^^^ clone that
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - include covid genome, and begin to analyse it in bioroebe
     "Das Genom von SARS-CoV-2 sei doppelt so groß wie jenes
      von Influenzaviren, daher scheinen letztere viermal
      so schnell zu mutieren, schrieb Moshiri."
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - Look at the GUIs that are part of the BioRoebe project.
   Polish these part, at the least one widget, then
@@ -2570,7 +2596,7 @@ ALSO ADD A GUI; sizeseq.rb was added in February 2021.
   Hmmm. And then, also consider transitioning into gtk3,
   and make mroe screenshots.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - https://www.ebi.ac.uk/Tools/seqstats/emboss_pepstats/
   http://www.ebi.ac.uk/Tools/services/web/toolresult.ebi?jobId=emboss_pepstats-I20160208-020243-0564-53154194-oy
@@ -2582,7 +2608,7 @@ ALSO ADD A GUI; sizeseq.rb was added in February 2021.
 - Improve on temperature content and how it is calculated
     someone googled for it in 2014 so build on it
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - pfasta /Depot/Temp/bioroebe/NM_000539.3_Homo_sapiens_rhodopsin_RHO.fasta
   Will read from the file `/Depot/Temp/bioroebe/NM_000539.3_Homo_sapiens_rhodopsin_RHO.fasta`.
@@ -2593,7 +2619,7 @@ ALSO ADD A GUI; sizeseq.rb was added in February 2021.
   Now assigning aminoacid sequence to:
     AGAGTCATCCAGCTGGAGCCCTGAGTGGCTGAGCTCAGGCCTTCGCAG
     AGAGTCATCCAGCTGGAGCCCTGAGTGGCTGAGCTCAGGCCTTCGCAG
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 - Formats
@@ -2647,7 +2673,7 @@ ALSO ADD A GUI; sizeseq.rb was added in February 2021.
   tinyseq   NCBI TinySeq XML
   ztr   ZTR tracefile   ztr
-..........................................................................
+------------------------------------------------------------------------------------------
 (1) Look at f1 display:
@@ -2670,7 +2696,7 @@ we probably have to rewrite the whole thing
 BEFORE we add ANY COLOURS.
 OH WELL.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (100) → Add a primer-design widget
        The idea is to be able to manipulate forward and
@@ -2684,7 +2710,7 @@ perfect but it is a start.
 https://www.bioinformatics.nl/molbi/SCLResources/sequence_notation.htm
 ^^^ and check what is useful there. perhaps also add
     nicer visual cues to pretty it up a bit.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (1) → Compare bioroebe to:
         https://www.ncbi.nlm.nih.gov/orffinder
@@ -2694,18 +2720,18 @@ whether both return the same also possibly add a web-gui
     check this... so that we can search in standard ORF
     but also in different ORFs
     und die länge angeben, zumindest vom längsten ORF start + stop... also so das das ergebnis auch passt
-...........................................................................
+------------------------------------------------------------------------------------------
 test reverse complement in bioroebe
 ^^^
 new_WWW/
 ^^^ this should eventually become the new web-related interface.
 Ah well. Perhaps not ... ruby-cgi is soooooo annoying ...
-...........................................................................
+------------------------------------------------------------------------------------------
 (154) → the blosum-viewer should be supported in the cgi part
       and sinatra part as well.
       This now works for sinatra. Need to enable this for
       the cgi-part too eventually.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (155) → port the sinatra stuff together in bioroebe
   create a dir: web_api
   ^^^ also make params? usable in both sinatra and cgi page
@@ -2716,18 +2742,18 @@ Ah well. Perhaps not ... ruby-cgi is soooooo annoying ...
   add tons of HtmlTemplate[]
   and replace the ad-hoc code otherwise...
   ^^^ yeah, finish the HtmlTemplate stuff.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (1)  → https://i.imgur.com/ptcSn12.png
     ^^^ enable such an overview; this shows mass compuation e.g
         peptide mass and such
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (80) Bioroebe.sanitize_nucleotide_sequence
      ^^^ port this into java. The code has been written for this already,
          but we currently fail to link it.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (81) Bioroebe.base_composition
      ^^^^^^^^^ port this into java
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (82) - work a bit more on tk!!!
   in particular to start it from the bioshell as-is.
   ^^^ this is mostly done for quick
@@ -2740,20 +2766,20 @@ Ah well. Perhaps not ... ruby-cgi is soooooo annoying ...
 hamming_distance 	[PARTIALLY IMPLEMENTED; ~80%]
 protein_to_DNA
 ^^^^ improve both while improving tk_paradise docu as well.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (83) Batch-create the .exe files on windows for libui, once
     the first has been added. And then test it too
     AND document it. This should be done with the controller
     eventually. Once this works, we can remove this entry
     here.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (84) port more libui stuff in bioroebe. We have two widgets ported so far;
      add more such entries.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (85) after libui has been ported, explore how gosu works on windows.
      if possible add things to a gosu-specific UI as well, but
      we may need a common, unified GUI base for that.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (86)
 add libui bindings AND once done make sure the controller works in
@@ -2762,22 +2788,22 @@ libui as well. Embed the various things into it.
 Tab 	A set named tabs for placing items in
 ^^^ use this perhaps also in bioroebe hmmm
     yeah.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (87) https://github.com/cnjinhao/nana/wiki/User-Works-using-Nana
 ^^^ port the "DNA hybrid"
 https://camo.githubusercontent.com/4c27d554ca4d698d288628f21255f917c2c577e35d7e11dd67e21880d56b6b0a/687474703a2f2f6e616e6170726f2e6f72672f696d616765732f73637265656e73686f74732f746864795f7365715f6578706c2e706e67
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (88) Bioroebe::Cell
      ^^^ think about what to do with it. If we don't need it then perhaps
          we should just remove it. Think about this more at 2022, before
          deciding what to do.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (89) - Add emboss cgplot functionality.
 https://www.bioinformatics.nl/cgi-bin/emboss/cpgplot
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (90) - integrate calculation of the Instability index (II)
 The instability index provides an estimate of the
@@ -2815,9 +2841,9 @@ that the protein may be unstable.
 The instability index (II) is computed to be 65.43
 This classifies the protein as unstable.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------
 (1) → We have now added a method to show all hydrophobic amino acids, via the
       method .hydrophobic_amino_acids?. This works and has been documented
       in May 2022. However had, we also still need a way to PREDICT
       hydrophobic segments in a polypeptide sequence.
--------------------------------------------------------------------------------
+------------------------------------------------------------------------------------------