Abstract
The idea of computational processes, which take place in nature, for example, DNA computation, is discussed in the literature. DNA computation that is going on in the immunoglobulin locus of vertebrates shows how the computations in the biological possibility space could operate during evolution. We suggest that the origin of evolutionarily novel genes and genome evolution constitute the original intrinsic computation of the information about new structures in the space of unrealized biological possibilities. Due to DNA computation, the information about future structures is generated and stored in DNA as genetic information. In evolving ontogenies, search algorithms are necessary, which can search for information about evolutionary innovations and morphological novelties. We believe that such algorithms include stochastic gene expression, gene competition, and compatibility search at different levels of structural organization. We formulate the increase in complexity principle in terms of biological computation and hypothesize the possibility of in silico computing of future functions of evolutionarily novel genes.
Introduction
In modern literature, the idea of computational processes, which take place in nature, is widely discussed, including the computational notion of life. 1 So, the concept of intrinsic computation, which refers to “how dynamical systems store, structure, and transform historical and special information,” is being used. 2 The concepts of molecular computation are also actively discussed and implemented.3-5 Molecular computation means the capability of organic molecules to process the information. The authors consider various kinds of molecular operations with DNA sequences as natural computational processes. DNA computing by human is based on DNA structure and on the processes, in which DNA participates in nature. 4 Based on these operations, the authors build DNA computers and create new computing paradigms.3,4
In our previous publications6,7 we introduced the notion of the search engine in the space of biological possibilities in relation to the evolutionary origin of morphological novelties. In the present paper, we will study the broader concept of biological computation and compatibility search as the mechanism of the complexity growth in evolution. We will use the terms “biological computation” and “DNA computation” to describe computations done by life, and the terms “DNA computing” and “biocomputing” in relation to computing by human. a
The Space of Biological Possibilities
The substances and structures existing in nature may be considered as realized possibilities.
Popper 8 developed a theory of propensities and the possibility space. According to this theory, the universe is a world of propensities, “an unfolding process of realizing possibilities and of unfolding new possibilities.” The possibility space is growing. All non-zero possibilities will realize themselves in time. Possibilities that have not yet realized themselves have a kind of reality. The future is actively present at every moment, as “a reality in the making.” Causation in a world of propensities is probabilistic causation, and natural laws have probabilistic character. 8 Similar ideas continue developing in the scientific literature.7,9
Based on Popper’s theory, we may formulate a concept of the space of biological possibilities, in which evolution occurs. The concept of biological possibility space is connected in some way with “omics” technologies and biological disciplines, which generate and use “big data.” We think that in the course of evolution the search for new structural entities took place in the biological possibility space. 7 The evolutionarily new gene, which does not have a function yet, may be considered as the “reality in the making.”
The Increase in Complexity Principle
The capability to increase the degree of aggregation and complexity of the structure is an intrinsic property of the matter. In our earlier publications, we considered the increase of the degree of aggregation and complexity as the fundamental principle7,10,11 although it may realize on short-term/localized scale in cosmological terms. In non-living matter, the increase in structural complexity occurs under the laws of physics (ie, the increase in complexity of the elements in the Mendeleev periodic table). The origin of life is considered a result of chemical evolution. 12 Organic molecules originate from mixtures of simple gases if the energy source is available. 13 In this way, the potential of atoms and simple molecules to generate more complex molecules was realized. Biological evolution is characterized by the further increase in the complexity of biological molecules and evolving organisms. Lamarck formulated the concept of gradation, which is the concept of the gradual evolution of the living matter from simple organization to complex and perfect organization. 14 The increase in complexity of the living matter may give a direction to evolution. 15 Pervasive and long-term evolutionary trends of morphological complexity increase have been described. 16 The increase in complexity is the basis of the concept of progressive evolution (see eg, Shanahan 17 for a review of the idea of evolutionary progress). The increase of complexity may not be adaptive, at least at the initial stages of complexification (reviewed in Kozlov 6 ).
Using Popper’s terminology, the evolution of complexity takes place in the possibility space and is determined by pre-existing structures.
The Anticipatory Role of Evolution at the DNA Level
Since the origin of the genetic code, the evolution of proteins depended on preliminary evolution at the DNA level, although the latter reckoned with regularities of protein evolution. For example, gene exons generally correspond to protein domains, and exon shuffling corresponds to domain rearrangement. 18
During progressive evolution, which is characterized by the origin of multigenic morphological novelties, evolution at the DNA level took the lead over the evolution of morphological structures.6,19-21
So, the coding of new structures may start before the origin of structures themselves. In other words, the information about non-existent structures may be computed at the DNA level. b
DNA Computation of the Information About Future Structures in Progressive Evolution
Natural computational processes participate in progressive evolution and the increase in complexity of biological structures. DNA computation plays a unique role in the progressive evolution of living organisms because the information about future structures is first computed at the DNA level.
An example of DNA computation that is going on in the immunoglobulin locus of vertebrates shows how the computations in the biological possibility space could operate during evolution.
Due to recombination in immunoglobulin locus (V(D)J-recombination) and subsequent mutations, the multitude of various immunoglobulins is created (1011 of possible combinations in humans), 22 which corresponds to the multiplicity (the possibility space) of all possible antigens, which vertebrates can meet. The next-generation sequencing (NGS) analysis of antibody diversity has detected 0.5 × 109 antibody molecules. It is noteworthy that the authors use the terminology “the observed antibody space” to describe the antibody diversity. 23
If only one, albeit complex, locus of the vertebrate genome can compute the whole diversity of antibodies and the corresponding diversity of antigens, the evolving vertebrate genome can probably compute the whole space of possibilities for the evolution of vertebrate morphological structures c and their adaptations to different environments.
Ohno 19 established the importance of the origin of evolutionarily novel genes for progressive evolution. d We suggest that the origin of evolutionarily novel genes and genome evolution constitute the original intrinsic computation of the information about new structures in the space of unrealized biological possibilities. e Due to DNA computation, the information about unrealized biological possibilities is generated and stored in DNA as genetic information.
Gene Competition, Compatibility Search, and Incompatibility Neutralization
During the individual and evolutionary development of multicellular organisms, the search for entities from the biological possibility space starts from the search in the genetic information space. f
In established ontogenies, the retrieval of genetic information is carried out by regulatory mechanisms. Many examples of such mechanisms may be found in the book “The Regulatory Genome.” 24 These mechanisms operate in the space of realized biological possibilities.
The peculiarity of the evolutionary search for new entities is that such entities do not exist as yet. In evolving ontogenies, additional search algorithms are necessary, which can search for information about evolutionary innovations and morphological novelties in the space of unrealized possibilities.
The molecular mechanisms, which exist before the origin of regulatory mechanisms, are gene competition 10 and stochastic gene expression. 25 These mechanisms could be used for evolutionary search in the genetic information space of unrealized possibilities.
The basic principle of evolutionary search algorithms is the compatibility of novel and older characters. In connection with genes, we studied this idea in Kozlov. 10 We have shown that in evolving organisms the products of evolutionarily novel genes must be compatible with the products of evolutionarily older genes. Incompatibility between novel genes, which is the consequence of gene competition, is resolved by spatio-temporal disconnection of their products. g In multicellular organisms, this is accomplished by cell differentiation and the origin of evolutionarily novel cell types, tissues and organs, and more complex ontogenies. As a result, the divergent functional organization of groups (networks) of compatible genes originated, with incompatible genes acting in different compatibility groups (see Tables 1 and 2 in Kozlov 10 ). h
Thus, the search for compatible novel genes and gene combinations is an important principle of the evolutionary search algorithm operation in unrealized genetic possibility space.6,7 The evolutionary search algorithm is similar to the binary search algorithm and produces functional genome organization of groups of compatible genes similar to the binary search tree or B-tree.
The evolution of immunoglobulin diversity demonstrates features of the evolutionary innovation search engine. V(D)J-recombination generates the diversity of immunoglobulins by random rearrangement of DNA segments. V(D)J-recombination results in the allelic exclusion of Ig genes, which determines the monospecificity of B lymphocytes and the possibility of clonal selection of antibody-producing cells. 26 Similarly, the “one neuron – one receptor” rule is observed in the olfactory system. 27 These processes could have evolved due to the search algorithm discussed above.
For the realization of the whole diversity of immunoglobulins encoded by immunoglobulin locus, the other levels or organization (cellular and multicellular) are necessary. The structure of immunoglobulin loci of amphibia and mammals are similar but the diversity of immunoglobulins in mammals is several orders of magnitude higher. This is because amphibia have a small number of cells in their immune organs (reviewed in Kozlov 7 ). So, the evolution of the diversity of immunoglobulins involves interplay between different levels of organization.
Autonomous Search Engines, Unfolding Possibility Spaces, and Unstable Transitory Forms
As we discussed earlier,6,10 processes at different levels of organization are relatively independent, and evolution proceeds due to the coincidence of relatively independent events at genomic (DNA), cellular, and multicellular levels. That is, the interplay between different levels of organization has stochastic nature and may generate relatively unstable transitory organisms. Only biologically meaningful coincidences were “frozen” in evolution (“frozen accidents” 28 ) and gave rise to species of organisms with higher complexity and new functions.
Cellular material for the evolutionary search engines in multicellular organisms could be hereditary tumor cells. We discussed this in our previous publications on the possible evolutionary role of heritable tumors (the theory of
Formulating the Increase in Complexity Principle in Terms of Biological Computation
Thus, the origin of novel organs, the development of which involves constellations of activity of thousands of genes and considerable cell masses, may be explained using the concept of biocomputational processes, as described above. Otherwise, the origin of such complex structures is difficult to imagine.
We suggest a new formulation of an old principle of the increase in complexity in living nature:
The complexity increase in progressive evolution is realized through biological computation of the maximum number of compatible structural entities in evolving lineages of multicellular organisms. Biological computation of complexity increase involves DNA computation in the space of unrealized possibilities; stochastic gene expression and gene competition; compatibility search and incompatibility neutralization at different levels of organization; autonomous search engines and unfolding possibility spaces; unstable transitionary forms and “freezing” of biologically meaningful constellations of entities, compatible within the ontogeny of multicellular organisms. The complexity of progressively evolving organisms tends to increase to a maximum i and can be measured as the number of structural entities from the biological possibility space realized within the ontogeny of the multicellular organism.
The above formulation is a description of progressive evolution and complexity increase as determined by law j and intrinsic causality, or as self-development of biological matter, k provided that the energy and the material source is available. Darwinian natural selection for adaptation starts when the environment is included in the compatibility search.
The above formulation is a biocomputational description of complexity increase during progressive evolution, which expands the computational notion of life and makes important predictions.
Predictions of the Biocomputational Concept of Complexity Increase
After the origin of the genetic code, the evolution of DNA determined the evolution of the living matter. That is why the existing informational network of DNA is not only the storage of information about the previous periods of evolutionary development. It also contains information about the future pathways of evolutionary development, which can be studied by bioinformatic approaches with the help of supercomputers.
We can perform in silico computing of future functions of evolutionarily novel genes (in silico evolution), which are in the stage of their origin (Popper’s reality in the making).
Tumor specifically expressed, evolutionarily novel genes (
After computing the future functions of human
Complexity Increase and Supremacy of Biocomputing
The purpose of this paper was to describe complexity increase in the course of progressive evolution in terms of biological computation. Nevertheless, we cannot miss the possible computer science application of the above considerations.
Similar to the idea of quantum computing supremacy, the notion of cellular supremacy in biocomputing is being considered. 5 In article, 5 the authors were looking for supremacy of biocomputing over traditional computers. They were looking for tasks where biocomputing supremacy is possible, and for features of biological systems that could be used for supremacy, focusing mostly on the cellular (bacterial) biocomputational processes. Our study extends the biocomputing supremacy considerations to complexity increase in the course of the progressive evolution of multicellular organisms. Progressive evolution resulted in the enormous increase of complexity and functional efficiency of progressively evolving organisms. The biological complexity increase demonstrates the features of biological computation which may be used to reach supremacy: DNA computation of the information about new structures; multilevel structural organization and interplay between different levels of organization; independent evolutionary development at different levels; stochastic mechanisms and “frozen accidents”; compatibility search and neutralization of incompatibilities at different levels of structural organization; autonomous search engines and unfolding possibility spaces; and unstable transitional structures. That is why our findings may be complementary to Grozinger et al 5 l and our approach may be useful in studying the supremacy of biocomputing problems.
Conclusion
The origin of novel organs during the progressive evolution of multicellular organisms may be explained using the concept of biocomputational processes in the biological possibility space. The increase in complexity principle formulated in terms of biological computation predicts that information about the future pathways of evolutionary development can be studied by bioinformatic approaches with the help of supercomputers. Biological complexity increase demonstrates features of biological computation that may be used to reach supremacy in biocomputing.
Footnotes
Acknowledgements
The author thanks anonymous reviewers for helpful comments.
Funding:
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was funded by the Ministry of Science and Higher Education of the Russian Federation under the strategic academic leadership program ‘Priority 2030’ (Agreement 075-15-2021-1333 dated 30 September 2021).
Declaration of Conflicting Interests:
The author declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Author Contributions
AP Kozlov is an only author.
