Abstract
Numerous risk factors for heart disease or dementia harbor over 10% valine plus glycine content. Interestingly, TDP-43 contains 6.0% valine and 13.3% glycine, and the buildup of this protein in the brains of patients with limbic-predominant age-related TDP-43 encephalopathy has dire consequences. The two γ-methyl groups in valine enable hyperconjugation, which enhances the van der Waals interaction between its side group and the carbonyl carbon. This extends the C=O bond length, and this weakened C=O bond augments the secondary chemical bonding of the carbonyl oxygen atom to cations. This, in turn, promotes the formation and buildup of insoluble and rigid salts such as calcium oxalate, which is postulated to be a major cause of heart disease. Similarly, the long C=O bond length in glycine results in a weakened C=O bond with an enhanced affinity toward cations and the formation of insoluble salts. Further, several prion proteins possess a high glycine content of approximately 20%. The insoluble calcium salts produced may promote aggregate formation via secondary chemical bonding between calcium and glycine, as well as between calcium and valine. Chemical and biochemical insights will help us to better understand the etiology of disorders linked to protein aggregates.
Keywords
Introduction
The repeated failures in drug development for Alzheimer’s disease and dementia suggest that additional toolkits are needed in addition to the interrogation of biochemical pathways and disease pathology. Limbic-predominant age-related TDP-43 encephalopathy (LATE) is a common disease of the “oldest-old.” 1 Many of the aggregated proteins that are causal factors of heart disease or Alzheimer’s disease possess over 10% valine plus glycine (V + G) content. Methyl groups can be either electron-donating or electron-withdrawing, depending on the local milieu. The two γ-methyl groups in valine enable σ-σ hyperconjugation and electron delocalization. The increased electron density in Cβ-H enhances potent van der Waals interactions between the side group and carbonyl carbon. 2 This strengthened secondary bonding with the carbonyl carbon results in extension of the C=O bond length, 3 and this weakened C=O bond then augments the secondary chemical bonding of the carbonyl oxygen atom to cations, particularly divalent cations. 2 This attribute of valine residues enables its affinity toward calcium and enhances the formation of insoluble and rigid calcium salts such as calcium oxalate, which has been postulated as a major cause of heart disease. 2 Because ethanol and acetic acid are structurally similar to oxalate, red wines reduce the risk of heart disease and dementia and can extend one’s lifespan, possibly via the inhibition of oxalate generation (Figure 1). 4 In older people who have attenuated respiratory chain activities, the Krebs cycle and its shunt produce excess protons and organic acids such as oxalate (Figure 1). Alzheimer’s disease and cancer are mutually protective. This can be explained by calcium supplements substantially reducing cancer risk, whereas local strong acids help dissolve insoluble and rigid calcium oxalate and other insoluble salts. 5 The C=O bond length in glycine is slightly longer than its counterpart in valine, 3 and this weakened C=O bond enhances the formation of insoluble salts, triggering aggregation. Some prion proteins possess around 20% glycine content. Interestingly, TDP-43 contains 6.0% valine and 13.3% glycine, and the buildup of this protein in patients with LATE has negative effects on the brain. The primary structure of TDP-43 is bipartite. The C-terminal fragment of TDP-43 is particularly glycine-rich at 27.0% while being poor in basic amino acids at 2.8% (Table 1). The N-terminal fragment of TDP-43 possesses 15.0% basic amino acids and 15.4% V+G residues (Table 1). The higher-order structure of TDP-43 can bring positively charged basic amino acids, glycine, and valine together (Figures 2 and 3),6–8 enhancing the formation of calcium oxalate and, in turn, the formation of aggregates via secondary chemical bonding between calcium (in the form of insoluble salts) and glycine, and between calcium and valine. Other protein risk factors for LATE also possess relatively high V + G content, possibly contributing to the formation of insoluble and rigid salts such as calcium oxalate. Table 2 shows the glycine, valine, and basic amino acid content of the four causative factors of Alzheimer’s disease, indicating that the V + G percentage is also high in these proteins. Amino acid polymorphisms of causative factors promoting the traffic of either divalent cations or negatively charged oxalate can have a significant impact on neurodegenerative diseases. For example, the ApoE4 polymorphism confers high risk for Alzheimer’s disease and atherosclerosis, with homozygous carriers (arg/arg) having a higher risk than heterozygous carriers. 9 The positively charged arginine residues in the ApoE4 polymorphic site enhance the traffic of anions such as oxalate, consequently augmenting the formation of calcium oxalate.

Biochemical pathway leading to the generation of oxaloacetate and oxalate.
Amino acid content of the TDP-43 protein.

The distribution of valine and glycine residues on the TDP-43 protein. No complete protein structure is available for TDP-43; therefore, the protein structure of its three regions are shown separately. (a) Residues 1 to 77, PDB 2N4P; 6 (b) residues 102 to 269, PDB 4BS2; 7 (c) residues 286 to 331, PDB 6N3C; 8 which together cover most of TDP-43 polypeptide. The valine and glycine residues are depicted in blue and red, respectively. In (a) and (b), the protein regions were rendered in surface representation, with the front and back faces shown. In (c), the structure is shown in the ribbon representation and only one view angle is provided.

The distribution of the valine, glycine, and basic amino acid residues on the TDP-43 protein. The valine and glycine residues are depicted in blue and red, respectively, while the basic residues (histidine, arginine, and lysine) are represented in yellow.
Amino acid content of the causative protein factors of Alzheimer’s disease.
A previous study showed that the N-terminal domain of prion protein, PrPC, contains repeats of PHGGGWGQ in five to six sites that bind divalent Cu2+ via glycine chelation. 10 High glycine content has also been reported for the causative factors of amyotrophic lateral sclerosis, which include SOD1, TARDBP, and FUS, ranging from around 10% to 30%. Oxaloacetate is metabolized to phosphoenolpyruvate (PEP) by PEP carboxykinase (PEPCK), and the overexpression of PEPCK can confer an extended lifespan, 11 perhaps by channeling oxaloacetate toward PEP formation and thus minimizing oxalate buildup. Further, calcium oxalate is a primary component of renal stones. These phenomena corroborate the hypothesis that the formation of insoluble salts may be destructive to cells.
A high proportion of one or more particular amino acids in proteins is not rare. It is a hallmark of many risk factors for human diseases and numerous virulence factors in viruses. For instance, red meat is thought to be mutagenic, and it is marked by the presence of myoglobin which possesses around 20% to 21% basic amino acids, attracting anions such as Cl− and contributing to the local formation of HCl, which generates mutations. 12
A carbohydrate/vitamin diet taken at intervals and under the guidance of a physician does not provide essential amino acids13,14 and can be used to reduce a patient’s level of full-length, disease-causative factors and their fragments, thereby addressing symptoms caused by the overrepresentation of essential and/or non-essential amino acids. A plant-based diet also reduces the intake of essential amino acids. Reduced food consumption or occasional fasting may also be beneficial to patients carrying protein risk factors with high proportions of non-essential amino acids such as glycine. However, this should only be conducted under the supervision of medical professionals. Caution should be exercised in the treatment of diseases with food regimens, as these need to be carefully designed.
In summary, the secondary chemical bonding of the carbonyl oxygen in glycine residues is critical in the etiology of LATE, and this phenomenon may also contribute, at least in part, to the pathogenesis of other human disorders with glycine-rich causative factors, such as amyotrophic lateral sclerosis and prion diseases.
Footnotes
Acknowledgements
We thank Wei Xie, Tao Gan, Li Xu, and Weiguo Cao for discussions and Yan Shi for editing.
Declaration of conflicting interest
The authors declare that there is no conflict of interest.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by grants from the Science and Technology Transformation Program of Sun Yat-sen University of China (33000-18843234), the Guangzhou Science and Technology Program (201804010328) to Q. Liu, and the National Science and Technology Major Project of the Ministry of Science and Technology of China (2017ZX10103011) to Y. Zhang.
