Abstract
Letter to the Editor on Wang M, Wang Q, Wang Z, Zhang X, Pan Y. The molecular evolutionary patterns of the insulin/FOXO signaling pathway.
Wang et al have recently analyzed the pattern of selective pressures acting on genes along the insulin/FOXO pathway of vertebrates. 1 In this study, which is similar to those that we and other authors performed in the Insulin/TOR (IT) signal transduction pathway,2–5 the authors questioned some of our previous findings. Here we revise previous analyses of the pattern of selective constraints along the IT and other signal transduction pathways and discuss a number of issues that might account for the apparently different results obtained for the IT pathway by the different groups.
Alvarez-Ponce et al
2
analyzed the selective pressures acting on genes involved in the
In order to evaluate whether the correlation observed in
Genes analyzed by Alvarez-Ponce et al,2,3 Jovelin and Phillips, 5 and Wang et al. 1
Number of steps required to transduce the signal from the insulin or IGF1 receptors to each protein in the pathway. The same numbering as in ref. 2 is used. In the other works, a consistent numbering is used;
gene families not included in network-level analyses, as they do not directly participate in the main signaling cascade;
Wang et al 1 included in their analyses genes encoding pyruvate dehydrogenase kinases 1–4 and not the 3-phosphoinositide dependent protein kinase-1.
Noticeably, a similar distribution of selective constraint levels has been observed in the yeast HOG signal transduction pathway, whose downstream genes are the most selectively constrained. 15 Indeed, this appears to be the most common distribution of levels of selective constraint across signal transduction cascades: analysis of the global human signal transduction network has shown that, on average, genes acting at the downstream part of the network tend to be more selectively constrained than those occupying more highly hierarchical positions. 16
In their recent study in vertebrates, Wang et al 1 used a gene set that only partially overlaps with that used in our 2011 analysis. Their gene set encompasses a total of 27 genes (representative of 9 gene families) acting at the upper part of the IT pathway (the insulin/FOXO pathway), of which only 15 (representative of 5 gene families) were also used in our study (Table 1). 3 Consistent with previous observations,2–5 they also detected a negative significant correlation between ω estimates and pathway position, even when potentially confounding factors were controlled for. However, they noticed that statistical significance vanished when genes encoding insulin, IGF1, and IGFBP1–6 (genes not included in any of the previous analyses2–5) were removed from their analysis. From this observation, the authors conclude that the distribution of selective constraints that they observe across the pathway “is different from [our] previous similar reports.” They claim that their observation that the detected trend vanishes after removing the above-mentioned genes “demonstrates that the relaxed selective constraint at position 1 explains the purifying selection polarity along the IIF pathway,” adding that “the presented correlation [between pathway position and ω] comes from the relaxed selective constraint operated on genes of position 1, which give us an erroneous perception of truth.” 1 Wang et al's conclusions, however, are not supported by their observations, as none of the previous analyses included genes encoding insulin, IGF1, and IGFBP1–6 (genes at position 1 in their study) (Table 1).2–5 Therefore, the polarity that we had detected cannot be caused by the presence of these genes.
Furthermore, we have reanalyzed our published data sets to evaluate whether the correlation between ω and the IT pathway position previously detected vanishes when analyses are restricted to the gene families also analyzed by Wang et al (Table 1). Remarkably, when we reanalyzed our
Correlation between ω estimates and pathway position in the vertebrate insulin/TOR pathway.

Correlation between ω estimates and pathway position in the
Here we highlight the most important differences between the studies conducted by the different groups, which might account for the observed dissimilarities: quality of gene annotations, genes analyzed, and genes not involved in the main insulin signaling cascade.
Quality of Gene Annotations
In our previous analyses, we visually inspected all multiple sequence alignments to identify putative gene annotation errors (eg, unannotated exons or exons erroneously annotated in some species). Such annotation errors were, wherever possible, manually fixed using the information of the original genome sequences. After this manual improvement of gene annotations, alignment regions that remained problematic were removed from the analysis. Jovelin and Phillips 5 also manually annotated exon/intron structures. 5 Although Wang et al manually removed “unreliably aligned regions,” 1 they do not mention having corrected annotation errors through manual annotation. Because genome annotations of nonmodel organisms are mostly the output of automatic computational pipelines, they generally contain a high proportion of errors (See, for example, Tu et al 17 and Collins et al.). 18 Manual correction of such errors is very important, since ω estimates are highly sensitive to errors in the gene structure annotation (See, for example, Schneider et al.). 19
Genes Analyzed
Wang et al included only 27 genes in their analysis (representative of 9 gene families), all of which act at the upper part of the vertebrate IT pathway, whereas in our vertebrate analysis we used a total of 72 genes (belonging to 23 families), with only 15 (5 families) being shared between both datasets (Table 1).1,3 Their dataset includes genes encoding insulin, insulin-like growth factor 1 (IGF1), and IGF-binding proteins 1–6 (IGFBP1–6), which were not included in our previous analyses. Conversely, their dataset does not contain genes encoding proteins IGF1R, DOK, FRS, p110δ, PKC, Melted, GSK3, TSC1/2, Rheb, mTOR, S6K, RpS6, 4E-BP, eIF4E, Myc, eIF2B-ε, Glycogen Synthase, PTEN, and Cyh, which were included in our previous analysis (Table 1). The fact that they used a reduced dataset may have decreased the statistical power of their correlation analyses, particularly when genes encoding insulin, IGF1 and IGFBP1–6 (ie, 30% of their dataset) were removed from their analyses. Notice that trimming a small dataset often involves a dramatic reduction in the power of statistical tests, which can result in certain trends becoming statistically nonsignificant, despite their relevance.
Genes not Involved in the main Insulin Signaling Cascade
According to their report, Wang et al
1
included in their analyses 4 genes encoding 3-phosphoinositide dependent protein kinases (“position 5” in their Fig. 1, downstream of the PI3K complex and upstream of AKT1–3 proteins). However, the accession numbers listed in their additional files 1 and 2 correspond to genes encoding pyruvate dehydrogenase kinases 1–4. Although the activity of the latter enzymes is regulated by insulin, they do not act at “position 5” (in their Fig. 1); indeed, they are not directly involved in the main insulin signal transduction cascade. While insulin activates the 3-phosphoinositide dependent protein kinase-1, it inhibits pyruvate dehydrogenase kinases 1–4.20–23 The fact that the same acronym (PDK) is often used for both enzymes (even though the gene symbols are
Conclusions
In conclusion, the issues raised above might explain the observation by Wang et al 1 that the negative correlation between pathway position and ω vanishes after removing some genes from the analysis. These concerns, indeed, may have added some noise to their dataset, reducing the statistical power of their analyses. Therefore, there is no support for claiming that levels of selective constraint do not exhibit a polarity along the upstream/downstream axis of the insulin/TOR pathway.
Footnotes
Author Contributions
Conceived and designed the experiments: DA-P, MA, JR. Analyzed the data: DA-P. Wrote the first draft of the manuscript: DA-P. Contributed to the writing of the manuscript: DA-P, MA, JR. Agree with manuscript results and conclusions: DA-P, MA, JR. Jointly developed the structure and arguments for the paper: DA-P, MA, JR. Made critical revisions and approved final version: DA-P, MA, JR. All authors reviewed and approved of the final manuscript.
Funding
Work supported by grants from the Ministerio de Educación y Ciencia of Spain (BFU2010-15484 to JR and BFU2012-35168 to MA) and from the Comissió Interdepartamental de Recerca i Innovació Tecnològica of Spain (2009SGR-1287). JR was partially supported by ICREA Academia (Generalitat de Catalunya), and DA-P by a Juan de la Cierva post-doctoral fellowship from the Spanish Ministerio de Economía y Competitividad (JCI-2011-11089).
Competing Interests
Author(s) disclose no potential conflicts of interest.
Disclosures and Ethics
As a requirement of publication the authors have provided signed confirmation of their compliance with ethical and legal obligations including but not limited to compliance with ICMJE authorship and competing interests guidelines, that the article is neither under consideration for publication nor published elsewhere, of their compliance with legal and ethical guidelines concerning human and animal research participants (if applicable), and that permission has been obtained for reproduction of any copyrighted material. This article was subject to blind, independent, expert peer review. The reviewers reported no competing interests.
