Abstract
Life on Earth builds genetically encoded proteins by using a standard alphabet of just 20 L-α-amino acids, although many others were available to life's origins and early evolution. To better understand the causes of this foundational evolutionary outcome, we extend previous analyses which have identified a highly unusual distribution of biophysical properties within the set used by life. Specifically, we use a heuristic search algorithm to identify other sets of amino acids, from a library of plausible alternatives, that emulate life's signature. We find that a subset of amino acids seems predisposed to forming such sets. We present other examples of such alphabets under various assumptions, along with analysis and reasoning about why each might be simplistic. We do so to introduce the central, open question that remains: while fundamental biophysics related to protein folding can potentially reduce a library of 1054 possible amino acid alphabets by 7 orders of magnitude, the framework of assumptions that does so leaves a further 1045 possibilities. It is therefore tempting to ask what additional assumptions can further reduce these 45 orders of magnitude? We thus conclude with a focus on library and alphabet construction as a useful target for subsequent research that may help future science speak with more confidence about what an alien amino acid alphabet would look like and why.
Get full access to this article
View all access options for this article.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
