Abstract
Epigenetic mechanisms such as DNA methylation have been implicated in a number of diseases including cancer, heart disease, autoimmune disorders, and neurodegenerative diseases. While it is recognized that DNA methylation is tissue-specific, a limitation for many studies is the ability to sample the tissue of interest, which is why there is a need for a proxy tissue such as blood, that is reflective of the methylation state of the target tissue. In the last decade, DNA methylation has been utilized in the design of epigenetic clocks, which aim to predict an individual’s biological age based on an algorithmically defined set of CpGs. A number of studies have found associations between disease and/or disease risk with increased biological age, adding weight to the theory of increased biological age being linked with disease processes. Hence, this review takes a closer look at the utility of DNA methylation as a biomarker in aging and disease, with a particular focus on Alzheimer’s disease.
INTRODUCTION
Dementia is an umbrella term that encompasses a wide variety of neurodegenerative disorders including vascular dementia, Lewy Body dementia, frontotemporal dementia, and Alzheimer’s disease (AD), which is the most common form and accounts for 60-80% of all dementia cases [1]. AD is characterized by cortical atrophy [2] and thinning [3], aggregation of hyperphosphorylated microtubule-associated tau protein (MAPT) resulting in intracellular neurofibrillary tangles, and accumulation of extracellular amyloid-β (Aβ) plaques [4]. Age is regarded as the most important risk factor for neurodegenerative diseases, including AD [5], and as the worldwide population ages, there will be an increase in the prevalence of dementia and AD. In addition to age, there are several accepted modifiable and non-modifiable risk factors for AD [6, 7]. The most common modifiable factors include physical inactivity, smoking, excessive alcohol consumption, air pollution, head injury, infrequent social contact, lower educational attainment, obesity, hypertension, diabetes, depression, and hearing impairment [8]. Non-modifiable risk factors include sex, family history, and genetics, most notable being variation in the Apolipoprotein E (APOE) gene [7]. Currently, several biomarkers for dementia are available that may aid in the diagnosis of possible/probable AD [9]. Cerebrospinal fluid (CSF) obtained via lumbar puncture and brain imaging measures, such as positron emission tomography (PET), computed tomography (CT), and magnetic resonance imaging (MRI), have been the focus in the search for robust and easily accessible biomarkers for AD [10]. These methods focus on the measurement of proteins important in the disease process including tau and Aβ. Notably, while CSF and brain imaging markers have shown good predictive accuracy, particularly in preclinical studies, they are not ideal for use as routine clinical biomarkers due to their cost, level of invasiveness, and need for interpretation of results by specialists [11]. Even though the aforementioned biomarkers are useful in assisting in the diagnosis of AD, a definitive diagnosis of AD can only be made on biopsy postmortem [2]. Currently, there is an urgent need for an easily obtainable, less invasive, cost-effective routine biomarker, that is specific and sensitive to predicting disease progression.
EPIGENETICS AND DNA METHYLATION
Epigenetic modifications are heritable phenotypic changes that do not alter the DNA sequence [12]. These modifications fall into one of three categories; histone modification, non-coding RNAs, DNA hydroxymethylation, and DNA methylation [12]. Epigenetic modifications regulate gene expression patterns by altering DNA accessibility and chromatin structure [13]. Epigenetic processes are a natural occurrence and have critical roles in cellular processes such as gene regulation, chromosome stability, X chromosome inactivation, and genomic imprinting [14]. This review will focus on the most well-studied epigenetic mechanism [15], DNA methylation. Approximately 28 million cytosine-phosphate-guanine (CpG) dinucleotides are distributed unevenly throughout the mammalian genome [16]. These CpG dinucleotides are the target of an epigenetic modification known as DNA methylation, which involves the transfer of a methyl group from S-adenosyl-L-methionine (SAM) onto the 5’ position of a cytosine ring, resulting in the formation of 5-methylcytosine (5mC). The covalent attachment of the methyl group is catalyzed by one of several members belonging to the enzyme family known as DNA methyltransferases (DNMT). DNA methylation is essential in several processes, including silencing retroviral elements, regulation of tissue-specific gene expression, genomic imprinting, and X chromosome inactivation. It has also been found that DNA methylation exerts different influences on gene activities in different genomic regions depending on the underlying genetic sequence. Within intergenic regions, one of the main functions of DNA methylation is to repress the expression of genetic elements that might be potentially unfavorable. These elements, when expressed, may have harmful consequences, with their replication and insertion potentially leading to gene disruption and DNA mutation [17]. Varying degrees of methylation at genes and regulatory sequences can determine the level and integrity of gene expression [16]. Importantly, unmethylated CpGs are not distributed randomly throughout the genome, but are clustered in formations called CpG islands (CGIs), usually located in the promoter regions of genes [18]. CGIs are usually unmethylated in normal cells regardless of their expression levels and gene silencing is promoted post-methylation [16]. Due to their location, they are able to regulate gene expression through silencing of transcription [19]. DNA transcription can be affected in two ways; firstly, methylation of CGIs can impair transcription factors binding and, secondly, methyl-CpG-binding proteins bind to the methylated DNA sequences [20].
Assessment of DNA methylation in neurodegenerative diseases
As stated above, DNA methylation regulates tissue-specific gene expression, complicating its exploration in neurodegenerative diseases, due to the requirement of brain tissue. Brain tissue is only available postmortem making it infeasible for large-scale or longitudinal studies, where repeated measurements are required. Additionally, the heterogeneity of brain tissue poses a barrier in terms of analysis as it is widely known that different cells possess different patterns of methylation [21]. Importantly, postmortem brain tissue only offers an insight into the end stage of processes associated with AD and may not reflect the processes involved in disease development.
To address these concerns, proxy tissues such as blood or saliva have been investigated to determine if their level of DNA methylation is reflective of the methylation state of the tissue of interest, i.e., the brain. Specifically, are the changes observed in the tissue of interest truly reflected in peripheral tissues. Early studies sought to clarify this, with a number demonstrating conservation of DNA methylation across different tissues, using several different methodologies for the assessment of DNA methylation, including the widely used Illumina 450k array, which offered the most comprehensive coverage of the methylome available at the time (now superseded by the Illumina EPIC 850k array). A study by Horvath et al. [22] showed evidence for conservation of DNA methylation patterns across four brain regions and blood tissue, with a correlation of r = 0.85-0.91, using publicly available genome-wide methylation data. This finding was supported in a study assessing 80 matched blood and postmortem brain tissues, which found that subset of probes on the Illumina 450K array were significantly correlated across tissues [21]. Walton et al. [23] have likewise published on the 450K array, reporting that 7.9% of CpGs were strongly correlated between tissues. Further, through the development of the BECon (Blood-Brain Epigenetic Concordance) tool, Edgar and colleagues [24] demonstrated that 9.7% of CpGs from the Illumina 450K array were correlated between tissues. When assessing data from methylated DNA immunoprecipitation sequencing (MeDIP-seq) a high correlation between blood and the cortex and cerebellum (r = 0.82 and 0.77, respectively) was observed in three individuals [25]. More recently, Braun [26] investigated correlations between brain, saliva, blood, and buccal samples obtained from patients undergoing neurosurgical resection for epilepsy, and found that DNA methylation in the brain was most highly correlated with saliva (r = 0.90), followed by blood (r = 0.86) and buccal cells (r = 0.85).
While evidence exists for conservation of DNA methylation between tissues, contradicting studies reporting no significant associations have been published. Specifically, a recent study reported no evidence of a correlation between DNA methylation in peripheral blood CD4 + lymphocytes and prefrontal cortex samples [27]. Currently, the evidence is unclear, and the relationship between DNA methylation in the brain and blood requires additional characterization. However, it has been suggested that a suitable peripheral biomarker does not necessarily need to mirror disease-associated changes observed in the brain and could independently reflect disease responses in the periphery.
DNA METHYLATION AND ALZHEIMER’S DISEASE
Global DNA methylation
DNA methylation can be classified as either global or gene-specific. Global methylation refers to the average percentage methylation across the entire genome, whereas gene-specific DNA methylation refers to the average percentage methylation within a specific gene. Global methylation or overall methylation of the entire genome provides an over-arching picture of methylation status in a sample [28]. However, this measurement should be used with caution as differences in global methylation can result from disease state, tissue collection site, sex, and age [29].
In a 2009 twin study, monozygotic twins discordant for AD were found to have differences in global methylation in the anterior temporal cortex and the superior front gyrus, where the twin with AD had significantly decreased levels of methylation in those two areas of the brain [30]. While both twins had similar levels of education and worked in similar fields, the twin diagnosed with AD worked extensively with pesticides, suggesting this environmental exposure may have contributed to epigenetic changes and the development of AD [30]. Several epidemiological studies have reported phenotypic discordance in monozygotic twins, with the older sibling typically reflecting more discordance for age-related diseases [31, 32]. This is hypothesized to be due to an increased rate of loss of epigenetic architecture, or epigenetic drift, which is described as an increase in DNA methylation errors across the genome during aging [33]. Similarly, several other studies have demonstrated that older monozygotic twins have greater global differences in DNA methylation patterns when compared to younger twin pairs [34–36].
Since it was recognized that DNA methylation could be cell and region-specific, researchers began utilizing immunohistochemical methods to assess brain regions typically affected by AD. In individuals with AD, significant decreases in global DNA methylation have been observed in regions of the brain typically affected by AD, such as the entorhinal cortex [37]. In contrast, within the cerebellum, a region known to be spared in AD, differences in those with AD were not observed [37]. However, in the frontal cortex, a significant site of synaptic loss in AD [38], individuals with AD have been reported to have higher levels of DNA methylation when compared to those classified as cognitively unimpaired. Additional studies have reported increased global DNA methylation levels in the middle frontal gyrus and frontal cortex, in individuals diagnosed with AD compared to those without [39, 40].
The hippocampus is involved in memory formation and is one of the first regions of brain to display atrophy in AD [41]. Similar to the results outlined above, studies assessing DNA methylation in the hippocampus have been inconclusive with higher [42] and lower [43] levels of DNA methylation observed in those diagnosed with AD. Further, DNA methylation is reported to differ depending on the hippocampal subregion and cell type [43]. Similarly, Phipps et al. [44] demonstrated that pyramidal neurons, a particularly vulnerable cell type in AD, but not calretinin interneurons or microglia, have decreased DNA methylation in AD cases compared to controls.
As well as interrogating DNA methylation within brain regions, peripheral blood cells are becoming more common due to the ease of sample access. In 2015, Di Francesco et al. [45] evaluated global DNA methylation in peripheral blood mononuclear cells in AD patients and cognitively unimpaired controls, finding a significant increase in the global methylation levels of AD patients [45]. Additionally, higher global DNA methylation was observed in the presence of the APOE ɛ4 allele, highlighting the importance of combining epigenetic and genetic markers of AD [45]. In a study by Bjornsson et al. [46], peripheral blood was sampled from community cohorts to assess time-dependent changes in global DNA methylation over 11-16 years. Changes in DNA methylation over time varied between individuals, with both increases and decreases observed [46]. These findings are consistent with those published in studies of cancer, where hypomethylation (decreased levels of methylation) and hypermethylation (increased levels of methylation) work in tandem to activate and suppress oncogenes and tumor suppressor genes, respectively and concurrently [47]. It is possible that a similar interplay of gene-specific hypo- and hyper-methylation may be employed during AD progression.
Gene-specific DNA methylation
Initially, studies of gene-specific DNA methylation in AD focused on genes that encode the key proteins implicated in AD pathology, such as Aβ precursor protein (APP), presenilin 1 (PSEN1), microtubule associated protein (MAPT), and apolipoprotein E (APOE). The APOE ɛ4 allele is the strongest genetic risk factor for AD. The ApoE protein functions as the primary cholesterol carrier for the maintenance, growth and repair of neurons and is typically highly expressed in the central nervous system [48]. Wang et al. [49] reported hypermethylation of the APOE promoter region in the prefrontal cortex of AD patients, which is in line with observations of lower levels of circulating ApoE in AD patients [50]. Similarly, Karlsson [51] found an association between dementia and AD, and increased methylation levels at the APOE promoter. However, a number of studies have reported no significant difference in the methylation of APOE as a result of AD [52–55].
The analysis of DNA methylation in genes encoding other important proteins in AD pathogenesis (APP, PSEN1, BACE1, and MAPT), has likewise returned varied results. The level of DNA methylation of these genes within brain [52, 56] and blood [55, 58] samples, was observed in several studies to be associated with AD diagnosis [52, 58], and in other studies not associated [52, 57]. Another commonly targeted gene is BDNF, which encodes the brain-derived neurotrophic factor (BDNF), a protein that is commonly found in decreased levels in AD and is involved in neuronal differentiation, plasticity, and survival [59]. Significantly higher levels of DNA methylation in the BDNF promoter in peripheral samples have been reported in patients with AD [60–62]. However, studies of peripheral blood samples have also found no significant differences in the methylation of BDNF between AD and controls [55, 63]. Neuroinflammation is known to occur during the development of AD [64], and as such the methylation of genes involved in this biological process have been investigated with regard to AD. Nicolia et al. [65] found evidence of hypomethylation of the promoter region of Interleukin-1β (IL-1β) in the early stages of AD, which increased in later stages of the disease, matching the DNA methylation of control participants. Conversely, the same study observed decreasing levels of Interleukin-6 (IL-6) DNA methylation with the progression of the disease [65]. Neuroplasticity and memory formation are also known to be compromised in AD, and a gene thought to be involved in these processes, Sorbin and SH3 Domain Containing 3 (SORBS3), has been found to be hypermethylated in the brains of AD patients [66] and mouse models of AD [67]. Finally, the assessment of genes involved in the process of DNA methylation (DNMT1, DNMT3A, DNMT3B, MTHFR) has revealed associations with AD diagnosis [49]; however, this finding was not replicated in a subsequent study [57].
The development of large-scale DNA methylation arrays has allowed for the unbiased assessment of site-specific DNA methylation across the epigenome. An epigenome-wide association analysis of longitudinal peripheral blood samples identified differentially methylated genetic loci, near known AD risk genes including BDNF, BIN1, and APOC1, to be associated with the diagnosis of mild cognitive impairment and AD in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) cohort [62]. More recently, an additional study in the ADNI cohort identified a CpG site cg00386386, mapped to MED22, as associated with cognitive decline as measured by the Pre-Alzheimer’s Cognitive Composite (PACC) [68].
DNA METHYLATION DYNAMICS DURING AGING
Epigenetic factors, when compared to genetic factors, could be more appropriate in explaining the observed anomalies in LOAD, as DNA methylation patterns are altered during development. In particular, the epigenome is susceptible to deregulation during early embryonal and neonatal development, puberty and most notably, old age [69], the greatest risk factor for AD. Numerous studies have aimed to categorize the cellular and molecular characteristics of aging [70]. Epigenetic alterations represent one of the most crucial mechanisms driving the deterioration of cellular functions, which can be observed during aging and age-related diseases [71]. Specifically, changes in the methylome have been observed to occur over time and are thought to be one of the driving factors for the development of age-related diseases [72]. Epigenetic changes either occur randomly or are driven by internal and external influences [71]. This may explain why aging patterns can be vastly different between genetically identical individuals, such as identical twins, or between animals with similar genetic makeups [34, 73]. Studies in mammals and single cellular models (e.g., Saccharomyces cervevisiae) reveal that the epigenome suffers a progressive loss of architecture during aging, resulting in changes in chromosomal architecture, genomic integrity, and gene expression patterns [71].
Epigenetic drift
DNA methylation changes with age, regardless of the input of maintenance DNA methyltransferases. It has been hypothesized that a complex interplay of genetic, environmental (diet and lifestyle), and stochastic factors (defects in the transmission of epigenetic information through cell division) result in epigenetic drift over time, with the contribution of these factors increasing with age [74]. Epigenetic drift is non-directional, with both hyper- and hypomethylation being observed. Further, it is non-uniform across the genome and variable within individuals of the same age [75]. While epigenetic drift is known to result in unpredictable differences between the methylomes of aging individuals, some changes involve specific genetic regions and are age-associated. This indicates that DNA methylation changes are not purely random but have a role in the aging process.
Age-associated differentially methylated positions/regions
The development of high throughput arrays, such as the Illumina 27k, 450k, and EPIC arrays, has provided an impetus to studying age-specific DNA methylation patterns. These investigations reveal that epigenetic drift appears to be restricted to specific sites in the genome (referred to as age-associated differentially methylated positions/regions; a-DMPs/a-DMRs) and is not purely random [74]. Specific age-related regulatory mechanisms are thought to be involved in age-associated hyper- and hypomethylation of sites detected in a number of epigenome-wide association studies (EWAS) [74]. A study by Bormann et al., in human skin, found that methylation deteriorated with age, with hypermethylated and hypomethylated sites trending towards an intermediate (50%) level of methylation, with a smaller dynamic range [76]. While age-associated, tissue-specific changes in DNA methylation have been identified [77, 78], consistent tissue-type independent a-DMPs have been reported in multiple tissues and cell types [22, 79], and in stem cells [77]. Notably, the causes and functional consequences of age-related changes in DNA methylation, and their relationship to aging, longevity, and disease, are yet undermined [79]. It has been speculated that the identification of age-associated DNA methylation differences between diverse groups of individuals could aid in determining the epigenetic basis of aging and age-related health disparities [80]. Recently, these age-associated changes have been leveraged to aid in the development of DNA methylation clocks.
EPIGENETIC AGE AND DNA METHYLATION CLOCKS
There is great potential for the use of DNA methylation clocks as biomarkers for disease risk and predictors of life expectancy, morbidity, and mortality. Biological aging, like chronological aging, is associated with disease, morbidity and mortality, health, and cognitive and physical decline. Indeed, the varied progression in biological age may allow for the prediction of the aspects of aging outlined above. Measures of biological aging are in development for use in the prediction of disease onset, risk, and progression [81]. Several potential biomarkers of aging, including telomere length, metabolomic, transcriptomic and proteomic markers, and of interest here, DNA methylation, have been proposed [70, 82]. While many biomarkers only capture certain aspects of aging, an increasing body of literature suggests that manifestations of aging are due to epigenetic processes [83]. The “epigenetic clock” is a DNA methylation-based estimate of biological age and has been strongly correlated with chronological age. A measure of age acceleration can be calculated based on the difference between an individual’s biological age (estimated) and chronological age (actual) [84]. Accelerated aging has been linked to an increase in the pathological hallmarks of AD, reduced cognitive and physical fitness and an increase in all-cause mortality.
Approximately 28 million CpG sites exist within the human genome, usually located in clusters of hundreds or thousands, known as CpG islands. Epigenetic clocks have been developed using a select number of these CpG sites, widely distributed throughout the genome. An epigenetic clock, or a DNA methylation clock, is built from DNA methylation markers strongly correlated with chronological age (r ≥ 0.8) [82]. The process of building a DNA methylation clock utilizes supervised machine learning methods such as lasso (least absolute shrinkage and selection operator), elastic net or ridge and training against chronological age. During this process, an informative and predictive set of CpGs is identified [81]. Newer generation epigenetic clocks are specifically trained on age-related biological and health-related measures.
Broadly, epigenetic clocks have been classified as first-generation chronological DNA methylation (DNAm) clocks or second-generation biological DNAm clocks. Chronological DNAm clocks reflect age-related DNAm changes that are common among individuals and reflect the intrinsic aging process. In a pioneering study, Bocklandt et al. [85] demonstrated that DNA methylation levels from saliva could generate an accurate age prediction. Since then, numerous DNAm clocks have been developed that accurately estimate the chronological age of a sample based on DNA methylation levels within the blood and other tissue and cell types (Table 1) [83]. The DNAm clocks developed by Horvath [83] and Hannum [100] were built using similar regression models, are correlated with age and have the ability to predict all-cause mortality. The most well validated of the DNAm clocks is the Horvath clock, which is predictive in all cell types, and encompasses 353 CpGs all weakly correlated with age but when combined are highly predictive of chronological age [83]. The Hannum clock is also well correlated with age, comprising of 71 CpGs sites; however, while works in blood, it requires further calibration for use in other tissues. More recently, the Zhang DNAm clock was developed correcting for cellular composition so it can be accurately used in blood and non-blood samples [86]. Currently there is a movement towards the development of clocks encompassing a smaller number of CpG sites to minimize cost/increase time efficiency [87]. However, these estimations have been demonstrated to be unreliable when used for multiple tissues [82], and have high average errors in their age predictions when compared to clocks with a greater number of CpG sites [74, 88–90].
Summary of DNA methylation clocks
Biological DNAm clocks, considered “second generation” clocks, are reflective of DNAm changes that vary between individuals and may highlight age-related phenotypes and external factors that may influence age-related DNAm [91]. Specifically, these clocks have been developed against age-related biological processes such as mitotic activity (e.g., epiTOC [92]; MiAge [93]), clinical measures of phenotypic aging (PhenoAge [94]), telomere length (DNAmTL [95]), plasma biomarkers and physiological and stress factors (GrimAge [96]), and all-cause mortality (Zhang [97]). Finally, the Dunedin Pace of Ageing clock, has incorporated the rate of change of 18 blood chemistry and organ system function measurements into the calculation of its score to improve predictive utility [98].
Age acceleration and health and disease
As outlined above, and summarized in Table 1, DNAm clocks have been developed using several different training models and as such reflect different aspects of disease and the aging process. It has been hypothesized that DNA methylation age (DNAm age) measures the cumulative effect of an epigenetic maintenance system [83]. Studies have demonstrated that DNAm age of blood predicts all-cause mortality in later life, which suggests that methylation age is a process that reflects aging [99]. However, it is more probable that the epigenetic clock is a result of the emergent properties of the epigenome [83]. Several studies have proven the usefulness of using age acceleration as a measure for healthy biological aging. For example, semi-supercentenarians and their offspring have a lower biological age compared to chronological age, or age deceleration, when compared to age-matched controls [100]. Similarly, women have shown to have less significant age acceleration compared to men, which is in line with their representative longer lifespans [101, 102].
Moreover, a number of studies have demonstrated the utility of age acceleration as a predictor of disease risk and mortality. A study assessing four longitudinal cohorts identified an association between all-cause mortality and increased epigenetic age in blood [103]. Further, a number of genetic syndromes have been reported to accelerate biological aging, such as Down syndrome [104] and Werner’s syndrome [105]. There is ample evidence of premature biological aging in those with viral infections such as HIV [106, 107], individuals with decreased mental and physical fitness [108], body mass index and metabolic disease [109], non-alcoholic steatohepatitis [110], obesity [111], insomnia [112], and extreme stress [113]. Additionally, individuals with neurodegenerative diseases such as Parkinson’s disease [114], Huntington’s disease [115], and AD [116] show accelerated biological aging.
METHYLATION AGE AND AD PHENOTYPES
The limited research which has been undertaken to explore the relationship between DNAm age and AD related phenotypes has yielded inconsistent findings. The association between DNAm age and AD phenotypes has primarily been characterized by measures of DNA methylation within the blood [108, 117–125], while only one study investigated brain samples [116].
Further, research has focused largely on “first generation” estimates of DNAm age, specifically the Horvath and Hannum clocks, which have been trained to approximate chronological age. Age acceleration, as estimated using the Horvath DNAm age clock in blood, has been associated with lower cognition cross-sectionally [108, 116], longitudinal cognitive decline [122, 123], and reduced white matter integrity. Additionally, Horvath age acceleration in prefrontal cortex samples has been associated with diffuse and neuritic Aβ plaques and higher global Aβ burden [116]. Despite these findings, a number of studies have also reported no significant association between Horvath DNAm age acceleration and cognitive functioning or brain volume, both cross-sectionally [117, 121] and longitudinally [108, 121]. Likewise, inconsistent findings have been reported when assessing DNAm age using the Hannum estimation. Hannum age acceleration has been associated with smaller hippocampal volume [125], but conversely, increased brain connectivity and decreased mean diffusivity (improved axonal integrity) in MRI quantified studies [117]. Additionally, several studies have failed to identify any significant associations between Hannum age acceleration and cognitive functioning [117, 121] or longitudinal cognitive decline [121, 124].
The association between “second generation” biological DNAm age estimates and AD phenotypes has also been explored, although less broadly. GrimAge acceleration has been associated with lower cognitive ability, accelerated cognitive decline, lower intelligence test scores, lower white matter and grey matter volumes, total brain volume, and higher volume of white matter hyperintensities [119]. However, both DNAm telomere length and the Pace of Ageing score were not associated with cognitive decline [120]. Inconsistency in the current literature underscores the need for further investigation of DNA methylation before it can be considered as a robust biomarker for aging-related disease such as AD.
CONSIDERATIONS WHEN UTILIZING DNA METHYLATION
The “tissue issue”
The studies outlined above, and summarized in Table 2, have presented DNA methylation findings from a range of tissue and cell types. As is evident in Table 2, there has been a trend towards large-scale discovery studies (EWAS) using peripheral blood and a shift away from studies exploring global methylation. When exploring these findings, it is important to consider that DNA methylation levels at specific CpG sites typically cannot be compared across different tissue types and DNA methylation at many CpG sites is only weakly correlated across tissue types such as the brain (cortex and cerebellum) and blood [21]. Tissue specificity is one of the most common uncertainties that arise when using peripheral tissues in association studies of DNA methylation. When considering the utility of DNA methylation markers in either the target organ, the brain, or the periphery, the blood, it is important to be aware of the challenges posed by sample types (Table 3).
Summary of DNA methylation studies in AD
Advantages and Disadvantages of commonly used cell types
When assessing brain tissues in epigenetic clock analysis of neurodegenerative or neuropsychiatric disorders, limited information is available regarding the specific brain regions and cell types present, which has been reflected in differences in epigenetic patterns and gene function [25]. However, with regard to cell-type specific methylation, computational methods, such as the Houseman [126] (cellular proportion estimates in blood) and Guintivano [127] (neuronal and non-neuronal proportion estimates) methods have been developed to address the issues resulting from varying cell-type proportions between individuals [127, 128]. The gold standard for controlling for cell proportions is using sorted cell populations such as laser capture microdissection or fluorescence-activated cell sorting; however, these are relatively expensive options and hard to achieve logistically in large scale studies. Importantly, brain tissue is typically obtained postmortem, which needs to be considered when interpreting data [129]. Specifically, are epigenetic patterns observed post-mortem consequence of the disease process itself, rather than causal? Additionally, detected epigenetic signals could be due to the cause of death, tissue pH and the premortem agonal state [130]. Careful study design via targeted validation employing cell sorting within the tissue of interest can help isolate cell type-specific changes, resulting in a clearer explanation of observed biological effects, such as which epigenetic markers mediate risk for disease or associate with a phenotype. Characterizing and exploring the effects of cellular heterogeneity is a crucial step in the analytical pathway of methylome-wide DNAm data in heterogenous tissues, especially peripheral blood, thus we recommend that it not be overlooked. Moving forward, more research focus needs to be placed on cross-tissue study designs to identify DMPs/DMRs common across two or more cell types such as brain tissue and peripheral blood to uncover common correlations. If the aforementioned limitations can be overcome, the next major issue is sample availability and the capacity to undertake appropriately powered studies to detect associations, which may besubtle.
Conversely, sampling of peripheral tissue offers many advantages over target tissues, such as ease of access, large sample sizes, replication opportunities and longitudinal sampling, both prior to disease onset and during treatments and/or interventions. However, as with brain samples, several limitations need to be considered when performing and interpreting analyses utilizing DNA methylation results from peripheral blood. In addition to tissue specificity, cellular heterogeneity is a significant potential confounder of quantifying DNA methylation levels in the blood and should be considered when interpreting results. Whole blood is composed of many different cell types, including neutrophils, lymphocytes, monocytes, eosinophils and others, and the proportions of these cell types can be influenced by numerous factors such as bacterial or viral infections, inflammation, diet, stress, medication, and environmental exposures [131]. There is an increasing number of studies utilizing peripheral DNA methylation; however, very few take into consideration the proportions of different cell types, and thus do not control for them in analyses, which likely drives some observed associations or masks potential differences between groups [132].
There is currently little evidence to suggest that AD-related epigenetic modifications in the brain are reflected in the periphery [133]. Several studies have measured DNA methylation in specific brain regions and in blood; however, only a few studies reported changes in both tissues, which are speculated to be driven by underlying genetic repeats within the C9orf72 gene [134–136]. While a peripheral biomarker, such as one identifiable in blood, has been proposed as a useful surrogate marker to monitor disease status in major organs or systems, it could independently reflect disease processes in the periphery and be important in disease tracking and prognosis.
Underestimation of age in epigenetic clocks
Several studies have now reported an underestimation of age [137–140], with deviations in biological age from chronological age particularly noticeable in the higher age brackets. This observation is likely due to several contributing factors. One hypothesized factor contributing to an underestimation of age is the saturation of methylation sites, where large proportions of sites are approaching the lower (0%) and upper (100%) limits of methylation, caused by epigenetic drift in aging individuals [141]. Dhingra et al. [139] speculate that differences in results between epigenetic clocks within the same cohort are likely caused by missing probes between arrays, irrespective of probe imputation procedures leading to an underestimation of age. Additionally, survivorship bias, a type of selection bias, may be driving the underestimation of age, particularly in cohorts with relatively high mean age. Individuals with a negative age acceleration are maintained within the sample, and those with a positive age acceleration, are not present within the sample, leading to a skewed cohort. The underlying populations on which the clocks are trained are also an important factor to consider when interpreting results. For example, the Hannum and Horvath clocks were trained on chronological age and given their overall high accuracy in estimating age, they are likely not ideal estimators of biological age. A clock that is capable of predicting chronological age perfectly would contain no information on variation in biological age at an individual level [16]. For this reason, clocks such as the PhenoAge clock, which were trained on age-related and disease phenotypes in combination with chronological age, are preferred when investigating biological age more directly. Further, DNAm age estimates may be susceptible to a ceiling effect, where a plateau is observable. The Horvath clock requires an age transformation past a certain age threshold, indicating that the relationship between chronological age and estimated age is not linear and, past a certain age, estimates need to be treated differently [83]. Recently machine learning and deep learning approaches have been employed in an attempt to mitigate shortfalls such as the ones mentioned here, and have observed more accurate age predictions for older individuals than the original clocks [142].
CONCLUSION
DNA methylation has been implicated in a number of diseases in the last decade including, cancer [143], heart disease [144], autoimmune disorders [145], and neurodegenerative diseases [146]. Many studies, leveraging both global and site specific CpG DNA methylation, have provided evidence for age-associated changes in DNA methylation. However, more recently, clusters of CpG sites whose DNA methylation status provide an accurate measure of chronological age have been identified. Together these sites provide the foundation for epigenetic clocks. Although initially, the clock’s main purpose was to serve as a biomarker of chronological age, there is growing evidence suggesting that epigenetic clocks may have value as biomarkers of biological age. Emerging research supports the idea that age and age-associated diseases or phenotypes are characterized by increased age acceleration in comparison to healthy individuals. Consequently, research focus has been placed on epigenetic clocks as biomarkers to detect aging and aging-associated diseases. The utility of peripheral blood as a biomarker of diseases in specific organs is often questioned, given cells in the periphery may not accurately capture and reflect more central disease processes. However, a good peripheral biomarker does not have to mirror disease-associated changes in the brain. Rather, it could represent a co-occurring peripheral response to central pathology. Despite its limitations, peripheral blood remains the most obvious surrogate tissue for epigenetic studies. DNA methylation is a highly promising biomarker for AD disease-associated pathology. Further investigation of its association with disease trajectories will be essential in identifying individuals at risk of age-related diseases, thus optimizing treatment and intervention strategies.
Footnotes
ACKNOWLEDGMENTS
The authors have no acknowledgments to report.
FUNDING
LM is supported by an Edith Cowan University Higher Degree by Research scholarship and a scholarship top-up from the Australian Alzheimer’s Disease Research Foundation. TP is supported by an Edith Cowan University Strategic Research Fellowship. The authors have no other funding to report that is relevant to this manuscript.
CONFLICT OF INTEREST
The authors have no conflict of interest to report.
