Abstract
Traditional sequence-based search methods such as BLAST and FASTA can be used to
identify sequence similarities. Recently, there is a growing interest in
performing RNA shape similarity searches inside selected genes to locate RNA
structure motifs that are known to possess functionally important roles. For
example, in the newly discovered RNA genetic control elements called
"riboswitches", the box domain is known to be highly conserved among various
bacterial species in both its nucleotide composition and shape. However, in
non-bacterial species, shape conservation is likely to become more important
than sequence conservation when searching for riboswitch patterns. For this
purpose, we present an approach tailored for detecting RNA shape similarities.
We extend the Structure to String (
