Abstract
The RNA G-quadruplex (rG4) is a kind of non-canonical high-order secondary structure with important biological functions and is enriched in untranslated regions (UTRs) of protein-coding genes. However, how rG4 structures evolve is largely unknown. Here, we systematically investigated the evolution of RNA sequences around UTR rG4 structures in 5 eukaryotic organisms. We found universal selection on UTR sequences, which facilitated rG4 formation in all the organisms that we analyzed. While G-rich sequences were preferred in the rG4 structural region, C-rich sequences were selectively not preferred. The selective pressure acting on rG4 structures in the UTRs of genes with higher G content was significantly smaller. Furthermore, we found that rG4 structures experienced smaller evolutionary selection near the translation initiation region in the 5′ UTR, near the polyadenylation signals in the 3′ UTR, and in regions flanking the miRNA targets in the 3′ UTR. These results suggest universal selection for rG4 formation in the UTRs of eukaryotic genomes and the selection may be related to the biological functions of rG4s.
Get full access to this article
View all access options for this article.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
