Abstract
Kindlin proteins represent a newly discovered family of evolutionarily conserved FERM domain-containing proteins. This family includes three highly conserved proteins: Kindlin-1, Kindlin-2 and Kindlin-3. All three Kindlin proteins are associated with focal adhesions and are involved in integrin activation. The FERM domain of each Kindlin is bipartite and plays a key role in integrin activation. We herein explore for the first time the evolutionary history of these proteins. The phylogeny of the Kindlins suggests a single ancestral Kindlin protein present in even the earliest metazoan ie, hydra. This protein then underwent duplication events in insects and also experienced genome duplication in vertebrates, leading to the Kindlin family. A comparative study of the Kindlin paralogs showed that Kindlin-2 is the slowest evolving protein among the three family members. The analysis of synonymous and non-synonymous substitutions in orthologous Kindlin sequences in different species showed that all three Kindlins have been evolving under the influence of purifying selection. The expression pattern of Kindlins along with phylogenetic studies supports the subfunctionalization model of gene duplication.
Keywords
Introduction
The Kindlins represent a class of focal adhesion proteins implicated in integrin activation. They comprise three evolutionarily conserved members, Kindlin-1 (FERMT,
Loss-of-function mutations in Kindlin-1 and Kindlin-3 cause Kindler syndrome and leukocyte adhesion deficiency-Ill syndrome, respectively. Kindler syndrome was in fact the first human genetic disorder clinically associated with Kindlins. It is caused by mutation of Kindlin-1 and is characterized by skin blistering, severe periodontitis and poililodermia.8,9 No human disease has yet been associated with Kindlin-2 gene pathology, however Kindlin-2 knockout mice die in early embryonic stage indicating the essential role it plays in development.
The expression patterns of all three Kindlins are quite distinct. For instance, Kindlin-1 is predominantly expressed in the epidermis and only weakly expressed in the dermis, while Kindlin-3 expression is restricted exclusively to hematopoietic tissues, where it is the dominant form of Kindlins expressed. On the other hand, Kindlin-2 is ubiquitously expressed in most parts of the body.
10
These differential expression patterns may in part explain the distinctive phenotypes that result from the loss of different Kindlins. For instance, as noted above, the Kindlin-2 homolog in
Kindlin-1 shares 62% sequence similarity with Kindlin-2 and 49% with Kindlin-3. Until now, the evolutionary aspects of Kindlin structure and function have not been addressed. In this study, we explored the evolution and divergence of these proteins in vertebrates and invertebrates. We studied the natural forces shaping the evolution of the Kindlin family of proteins by comparing different evolutionary trends in different vertebrate clades. A phylogenetic analysis of three Kindlin family members illustrates the phylogenetic history of Kindlin paralogs and also documents the duplication events leading to the formation of these paralogs from one single ancestral Kindlin. We show that the original Kindlin arose at least as early as simple metazoans, such as hydra. We also explored the effect of functional constraints on the evolution of these three paralogs in vertebrates and found evidence that purifying selection is a major force shaping the evolution of Kindlins.
Material Method
Relative levels of Kindlin-1, Kindlin-2 and Kindlin-3 transcripts were determined by real-time RT-PCR using SYBR Green. Human tissue cDNA panels (BD Biosciences) were used as a template. Triplicate samples of each PCR mixture, each containing 4.7 µl of POWER SYBR Green PCR master mixture (Applied Biosystems), 0.3 µl of a 10 pmol/µl of primer mixture, 0.3 µl of cDNA, and water to a total volume of 10 µl were transferred into a 96-well plate on an ABI 7500 Fast Real Time PCR System (Applied Biosystems). The samples were initially incubated at 95°C for 3 min, followed by 45 cycles with 95°C for 15 s, 60°C for 60 s. Dissociation curves were generated after each PCR run to ensure that a single, specific product was amplified. The results were analyzed with the comparative Cycle threshold (Ct) method. For normalization, we used the expression level of β-actin (ACTB). The PCR primers are shown in Table 1.
Primers used for Realtime PCR.
In order to explore the evolutionary history of Kindlins, sequences of the complete transcripts and the corresponding protein sequences of all three Kindlins (Kindlin-1, Kindlin-2 and Kindlin-3) from different species were extracted from NCBI (http://www.ncbi.nlm.nih.gov) and ENSEMBLE (http://www.ensembl.org) genome browsers (Table 2). After alignment using CLUSTALW program, 15 all positions containing gaps and missing data were eliminated, because of the possible ambiguity of the alignments. This stringent approach reduced the risk of misinterpretations The evolutionary history was inferred by using the maximum likelyhood method (ML) as implemented in the TREEFINDER (TF) program package. 16 The TF support values indicate the reliability of internal branches. The analyses of amino acid sequences were performed using the WAG2000 model 17 applying eight classes of rate heterogeneity among sites (8F). The ML analysis involved 26 amino acid sequences. A Neighbor Joining analysis 18 on amino acid sequences was done by the MEGA 4 program. 19 The reliability of the NJ tree was estimated by the bootstrap method, based on 1000 pseudo replicates.
Names and IDs of peptides and transcripts of Kindlin genes.
A variety of methods were employed to explore the functional constraints shaping the evolution of Kindlins. Pairwise comparison of the number of synonymous nucleotide substitutions per synonymous (dS) site and non-synonymous nucleotide substitutions per non-synonymous site (dN) was carried out by using Nei-Gojobori method.
20
In addition to pairwise methods, the dN/dS ratio in different branches of the maximum-likelihood tree was estimated using the codon-based genetic algorithm implemented in the GA-BRANCH program available at the Datamonkey server (http://www.datamonkey.org/help/GABranch.php). This approach assigns each branch to an incrementally estimated class of dN/dS ratios without requiring a specification of the branches
Evolutionary distance between all possible pairs of Kindlin paralogs was estimated by Tajima's relative rate test. 22 Each pair of paralogs was compared with amphioxus protein sequences taken as an out-group. Mega 4 software was used for evolutionary analysis. 19
Results and Discussion
Expression pattern of kindlins
Realtime PCR analyses of the expression pattern of Kindlins revealed distinct patterns of Kindlin expression. As expected, Kindlin-2 was expressed almost ubiquitously in all six of the tissues tested, while Kindlin-1 showed a one hundred-fold lower expression in each of these tissues with the exception of human kidneys where the expression level is low but significant. Kindlin-3 showed detectable expression in leukocytes only where it is expressed moderately (Fig. 1). These results are very much in agreement with existing data on the expression patterns of Kindlin proteins, which also show the ubiquitous nature of Kindlin-2 expression and the tissue specific expression of both Kindlin-1 and Kindlin-3. 10

The expression profile of all three Kindlin paralogs was quantified by real-time RT-PCR using SYBR Green. Data are presented as the relative expression of Kindlins (fold change) normalized by a housekeeping gene, β-actin.
Phylogeny
Phylogenetic analysis was carried out on the amino acid sequences of Kindlin proteins, with the phylogenetic tree rooted by orthologous genes from invertebrate species (Fig. 2). The phylogenetic tree was calculated based on the maximum likelihood method. Notably, an NJ analysis on the same data predicted the same topology (Fig. 3). The resulting phylogenetic tree is very well supported for most branches except for the deep divergences of Branchisotoma floridae and Strongylocentrotus purpuratus and one leading to the grouping of Xenopus laevis Kindlin-2 with the mammals. Otherwise the vertebrate relationships for each Kindlin ortholog are resolved in a manner concomitant with previous phylogenomic analyses, 23 indicating a reliable evolutionary reconstruction among the Kindlin families.

The evolutionary history of the Kindlin protein family inferred by using the Maximum likelihood method.

The evolutionary history of the Kindlin protein family inferred by using the neighbor joining method.
The resulting phylogenetic tree suggests an interesting evolutionary history for Kindlin family proteins. The phylogeny exhibits a topology of the form (A)(BC) ie, Kindlin-1 and Kindlin-3 form a cluster while Kindlin-2 forms an out-group. It appears that in invertebrates before the divergences of arthropods, a single ancestral Kindlin had its ancient origin in hydra. This gene than underwent a lineage specific duplication event in insects, giving rise to two Kindlin paralogs. Although no study exists to date on the roles of Kindlins in insects, it is plausible to assume that because of the diversity found in insects, the duplicated copies were maintained in response to selection pressures impacting this highly diverse group. One of the two Kindlins was lost in other higher phyla before the origin of vertebrates ie, echinoderms (Strongylocentrotus purpuratus) urochordates (Ciona intestinalis), cephalochordates (Amphioxus) and hemidchordates (Saccoglossus kowalevskii). However, the remaining single Kindlin gene copy underwent two duplication events in vertebrates that may have occurred together with two rounds of genome duplication. Regardless of the exact mechanism, these events gave rise to three Kindlin paralogs: Kindlin-1, Kindlin-2 and Kindlin-3. More data is still required to determine whether these Kindlin gene duplication events in the fish were due to whole genome duplications or individual segmental gene duplication.
Whatever was the cause of this duplication, it is evident from our analyses that three vertebrate Kindlin paralogs originated after duplication events in the fish genome and were maintained in all subsequent vertebrate forms, probably due to the selection pressures that have promoted the diverse and complicated morphological and physiological properties of vertebrates.24–27 It is important to notice that branch lengths of Kindlin-2 are quite short when compared to both Kindlin-1 and Kindlin-3. Very short branch lengths for Kindlin-2 on the phylogenetic tree indicate that this gene has experienced a slower evolutionary rate than its paralogs. This result is congruent with existing experimental studies on Kindlin genes.9,10,13,14 For instance, Kindlin-2 is a ubiquitously expressed gene playing its structural and functional roles in broad array of tissues. 10 It has been indicated in various studies that ubiquitously expressed genes tend to evolve slowly compared to those with tissue specific expression. 28 As noted previously, Kindlin-1 and Kindlin-3 are expressed in specific tissues (epithelial tissues and the hematopoietic system, respectively) (Fig. 1), and it is therefore commensurate that the evolutionary rate of these paralogs is much higher than that of Kindlin-2. In support of this are Kindlin knock out studies for all three Kindlins which show that Kindlin-2 knockout mice die during early embryogenesis. 29 In contrast to the milder phenotypes of Kindlin-1 and Kindlin-3, these studies support the idea that Kindlin-2 is under tighter functional constraint than Kindlin 1 and 3.
Estimation of the selective forces shaping the evolution of Kindlin proteins
Comparison between non-synonymous and synonymous substitutions in orthologous transcript sequences can reveal the selective pressure that shapes the evolution of these genes. The ratio between non-synonymous substitution per non-synonymous site to synonymous substitution per synonymous site dn/ds or omega (ω) indicates whether the evolution of genes is due to adaptive selection or due to neutral evolution. A value of co more than 1 suggests that the gene is under positive selection. A value close to 1 suggests that a gene is under neutral selection and is experiencing neutral evolution. However, a value of less than 1 indicates that a gene is under the influence of negative or purifying selection. The Nei-Gojobori method we employed to estimate the dn/ds ratio clearly showed co values below one, (ie, dn/ds < 1) for all the three Kindlin paralogs in all of the vertebrate species compared for this analysis (Table 3). Interestingly, the co value for Kindlin-2 was much lower (at least ten fold less) than co values for either of the other two Kindlins. Similarly the analysis within and between mammalian and non-mammalian groups clearly showed the same pattern observed in individual pairwise comparisons between different species ie, the absence of any positive selection whatsoever in all the groups. However, co values were slightly higher in non-mammals than in mammals. This difference may result from the high substitution rate often seen in fish genomes, leading to higher dn/ds ratios. Kindlin-2 in both mammals and non-mammals showed values of ds similar to Kindlin-1 and Kindlin-3, but with much lower values of dn, thereby producing greatly lowered values of co (Table 4).
Estimation of dn/ds values for Kindlin orthologs.
Average Ka and Ks values between and within mammalian - non-mammalian lineages for Kindlin orthologs.
To gain further insight into the lineage specific nature of the selective pressures acting on each branch of the phylogenetic tree, we performed a genetic algorithm, namely (Ga)-branch analysis. The GA-branch method is an alternative to the branch site method. This method, unlike the branch site method, does not require the manual selection of branches of interest to identify evidence for positive or negative selection. Because the GA branch method does not require the user to select branches of interest, or that testing be performed one branch at a time, it experiences reduced statistical instability while also offering improved interpretability for poorly supported models. It achieves this by mining the data for good-fitting models. In addition, inferences based on multiple models (as opposed to a null-alternative pair) are less vulnerable to model misspecification. In our study, Ga-branch analysis selected a model with five classes of ω. In total, 72% of branches are assigned to a ω of 0.037, named as Class D, with the remaining 28% of branches assigned to four additional Classes, designated A, B, C and E, with ω values of 0.102,0.050, 0.037 and 0.026, respectively. None of the branches studied show any trend for positive selection, with the probability of positive selection being 0% for each branch of the tree. Notably, here again the very low value of ω (0.013) for Kindlin-2 indicates that it is evolving under the influence of much stronger negative selection than Kindlin-1 and Kindlin-3 (Fig. 4 and Table 5).

Lineage-specific analysis of selective pressure in vertebrate Kindlins. A cladogram is shown with maximum-likelihood estimates of lineage-specific
Lineage specific dn/ds values Kindlins.
Not a
In short, whether calculated by pairwise comparison or by lineage specific analysis, dn/ds ratios consistently indicate that vertebrate Kindlins have been evolving under the influence of purifying selection, with Kindlin-2 under much stronger negative selection than Kindlin-1 and Kindlin-3. Functional studies on Kindlins also very much support this trend. For instance, if adaptive selection was the main force for Kindlin divergence rather than purifying selection, we would expect that the functional roles of Kindlins may be diverse. However, such diversity is not evident from data available on Kindlin function. In fact, the hallmark function of Kindlins is integrin activation and all the other higer order functions associated with Kindlins, including cell migration, cell spreading, cell adhesion, cellular signaling and cancer promotion are associated with the ability of Kindlins to activate integrin. On the other hand, these higher order processes impacted by Kindlins – through integrin activation – are so essential for organism survival and viability that very strong functional constraints exist over Kindlin evolution. The more stringent functional constraint of Kindlin-2 compared to either of its two counterpart paralogs is likely due to the fact that it is expressed ubiquitously in the body while Kindlins 1 and 3 are tissue specific.
Divergence rate between paralogs
For studying the evolutionary distances between different Kindlin paralogs, Tajima's relative rate test was employed. This test involves pairwise comparison of protein sequences from Kindlin paralogs of each species while using the orthologous Kindlin sequences from amphioxus (Branchiostoma floridae) as an out-group (Table 6). These tests produced intriguing results which, in correlation with phylogenetic data, show that Kindlin-2 has undergone relatively limited divergence compared to both Kindlin-1 and Kindlin-3, of which Kindlin-3 is indicated to be most divergent. Interestingly, it seems that Kindlin-2 is the representative of the original ancient Amphioxus Kindlin which underwent two duplications in fish giving rise to Kindlin-1 and Kindlin-3. There are two important reasons to believe that Kindlin-2 is the representative of the unduplicated ancestral Kindlin gene, whose duplication in vertebrates gave rise to two other paralogs. Firstly the Tajima test clearly shows that Kindlin-2 is closest to Amphioxus Kindlin, with very low levels of divergence evident compared to Kindlin-1 and Kindlin-3. Secondly, Kindlin-2 is not only a ubiquitously expressed protein but also the only Kindlin protein expressed in embryonic stem cells. Although no study has been conducted to explore the expression pattern of Kindlins in Amphioxus,
Tajima's relative rate test for the comparison of evolutionary distance between Kindlin paralogs in different species using Amphioxis BRAFLDRAFT_285279 as an outgroup.
Relatively high divergence of Kindlin-1 and Kindlin-3 relative to Kindlin-2 can be explained by analyzing the expression pattern of the proteins. 10 During evolution, from Amphioxus to higher vertebrates, the expression pattern of the two paralogs have diverged. Our analysis of Kindlin expression patterns in human tissue samples clearly shows that unlike Kindlin-2, which is expressed ubiquitously, both Kindlin-1 and Kindlin-3 present tissue specific expression patterns (Fig. 1). Similarly, Ussar et al have shown in their study that both Kindlin-1 and Kindlin-3 are expressed predominantly in epithelial tissues and the hematopoietic system respectively. 10 The distinct expression patterns of all three Kindlins in vertebrates thus supports the subfunctionalization model of gene duplication. 31 It seems likely that that the function of the ancestral unduplicated Kindlin was subfunctionalized in vertebrates in part due to the divergence of Kindlin expression location. Thus, while Kindlin-1 and Kindlin-3 are expressed exclusively in epithelial and hematopoietic tissues, respectively, Kindlin-2 – being the representative of ancestral Kindlin gene – is expressed in a variety of tissues, in a pattern less ubiquitous than the original unduplicated Kindlin gene. Ultimately, it seems that this subfunctionalization of Kindlin expression patterns may have provided a degree of selective advantage associated with the diversification of higher order functions performed by integrins in various tissues.
Conclusion
Kindlins represent a recently identified family of integrin interacting proteins. They play an important role in cell migration, cell spreading and cancer progression through a core molecular mechanism of integrin activation. In this study we have shown that the ancestral Kindlin gene was an unduplicated single gene found in organisms as primitive as hydra. This gene then underwent a lineage specific duplication in insects, giving rise to two Kindlin paralogs, while the three paralogs of Kindlin found in Vertebrates are the result of duplication events that occurred in fish. Of the three Kindlins, Kindlin-2 has undergone the least evolutionary divergence, probably due to stringent functional constraints it associated with its virtually ubiquitous expression pattern in body tissues and especially in embryonic stem cells. On the other hand both Kindlin-1 and Kindlin-3 showed significantly greater divergence as a result of significantly weaker functional constraints, possibly resulting from their subfunctionalization into very specific portions of the body. The comparison of synonymous to non-synonymous substitutions both by a pairwise method as well as a lineage specific method also indicate that all three Kindlins have been evolving under strong negative selection.
Author Contributions
AAK conceived the idea. AAK, AJ, TS performed the analysis. AAK, AJ, TS and HZ analyzed the data. AAK, AJ and HZ wrote the paper.
Disclosure
This manuscript has been read and approved by all authors. This paper is unique and is not under consideration by any other publication and has not been published elsewhere. The authors and peer reviewers of this paper report no conflicts of interest. The authors confirm that they have permission to reproduce any copyrighted material.
Footnotes
Acknowledgments
This work was supported by Grants to H.Z. from The Swedish Cancer Foundation, The Swedish Society of Medicine, The Swedish Research Council, The Karolinska Institute, The Nature Science Foundation of China grant 30830048, the Basic Research Project of the Ministry of Science and Technology of China (973) 2010CB912203 and 2010CB529402, and the Peking University 985 Project, to SS from The Center for Biosciences at Karolinska Institutet and to AAK from Higher education commission of Pakistan. We are thankful to Beston Nore for his helpful suggestions and to John Lock for reading and editing the manuscript.
