Abstract
Alzheimer’s disease (AD) and frontotemporal dementia (FTD) represent the first cause of dementia in senile and pre-senile population, respectively. A percentage of cases have a genetic cause, inherited with an autosomal dominant pattern of transmission. The majority of cases, however, derive from complex interactions between a number of genetic and environmental factors. Gene variants may act as risk or protective factors. Their combination with a variety of environmental exposures may result in increased susceptibility to these diseases or may influence their course. The scenario is even more complicated considering the effect of epigenetics, which encompasses mechanisms able to alter the expression of genes without altering the DNA sequence. In this review, an overview of the current genetic and epigenetic progresses in AD and FTD will be provided, with particular focus on 1) causative genes, 2) genetic risk factors and disease modifiers, and 3) epigenetics, including methylation, non-coding RNAs and chromatin remodeling.
INTRODUCTION
Most neurological disorders, including Alzheimer’s disease (AD) and frontotemporal dementia (FTD), are multifactorial diseases. Despite a small percentage of these diseases occurring in families with an autosomal dominant pattern of transmission, the majority of cases are sporadic, and derive from complex interactions between a number of genetic and environmental factors. Therefore, these diseases are defined as “multifactorial” or “complex” [1]. The familial clustering can be explained by recognizing that family members share a greater proportion of their genetic information and environmental exposures than do individuals chosen randomly in the population. Thus, the relatives of an affected individual are more likely to experience the same gene-gene and gene-environment interactions that led to disease in the first place than are individuals who are unrelated to the patient. The multifactorial inheritance pattern represents an interaction between the collective effect of the genotype at one or, more commonly, multiple loci (polygenic or multigenic effects) either to increase or to decrease the susceptibility to the disease, combined with a variety of environmental exposures that may trigger, accelerate, or protect against the disease altered mechanisms.
The gene-gene interactions in polygenic inheritance may be simply additive or more complicated. Gene-environment interactions, including systematic exposures or chance encounters with environmental factors in one’s surroundings, add even more complexity to individual disease risk and the pattern of disease inheritance.
Herein, main genetic variations, either causative or conferring risk for AD and FTD will be described, together with epigenetic factors.
GENETICS: BASIC CONCEPTS AND METHODOLOGIES
Genetic background differs from thousands to millions of genetic variants that are the differences in DNA sequences within the genome of individuals in the population. These variations can take many forms, including single nucleotide polymorphisms (SNPs), tandem repeats (short and variable), small insertions and deletions, duplications or deletions that change the copy number of a large segment of a DNA sequence (≥1 kb), i.e., copy number variations (CNVs), and other chromosomal rearrangements such as inversions and translocations (also known as copy-neutral variations) [1–3].
Common variants are synonymous with polymorphisms, defined as genetic variants with a minor allele frequency (MAF) of at least one percent in the population, whereas rare variants have a MAF of less than 1% [1]. The large majority of genetic variants are hypothesized to be neutral [4], as they do not contribute to phenotypic variation.
Four strategies have been applied so far in genetic studies: genetic linkage analysis, candidate gene studies, genome wide association studies (GWAS), and next generation sequencing (NGS) technology based studies: whole genome sequencing (WGS) and whole exome sequencing (WES).
Linkage analyses were the first kind of strategy used to unravel the genetic basis of Mendelian traits, involving families presenting autosomal dominant inheritance. Genetic linkage studies led to the identification of chromosomal regions associated with the disease segregation, but does not identify the causal gene associated, which requires fine mapping [5].
The candidate gene approach aims to determinate whether frequencies of genetic variants of people with a specific disease differ significantly from a control population. Susceptibility genes are defined when cases and controls showed significant differences in occurring genetic variants frequencies. Candidate gene approach led to the identification of the Apolipoprotein E gene (
The advent of microarray technology era revolutionized genetics research, allowing the contemporaneous determination of millions of SNPs in thousands of samples. GWAS are based on the testing common genetic variants in a hypothesis-free manner. Thus, it provides information on how common genetic variability confers risk for common diseases [7]. Several susceptibility genes for common neurodegenerative disorders have been revealed by GWAS studies, although the odds ratios associated with these risk alleles are relatively low [8].
Recent advances, collectively referred to as NGS, allowed for high-throughput sequencing, giving massive data results, that need to be analyzed by specific bioinformatics software. Moreover, in opposition to the first generation sequencing, NGS can produce the same genome sequence within a few weeks and with reduced costs. This allows for simultaneous investigation of multiple genes and has been demonstrated to be an effective alternative for establishing the genetic base for Mendelian diseases in the research setting [9, 10] and recently also in clinical settings [11, 12]. NGS relies upon multiple, short, overlapping reads of fragmented DNA that can be aligned against a reference genome or assembled “de novo” if no information on the reference genome is available. If just the protein-coding regions are amplified when sequencing all the genes, the method is referred to as WES, whereas when the target is the whole genome, it is known as WGS.
Genetics of Alzheimer’s disease
AD is a multifactorial and complex neurodegenerative disorder and the leading cause of dementia among elderly people. Genetically, AD can be subdivided into a rare familial form, accounting for 2-3% of all patients presenting with autosomal dominant inheritance, and a multifactorial sporadic form in which specific environmental exposures in combination with genetic susceptibility contribute to the exacerbation of the disease [1]. Genetically inherited AD usually develops before 65 years of age early onset AD (EOAD), whereas the sporadic type of disease often occurs later in life in individuals older than 65 years and is referred as LOAD [13].
Three genes, discovered thanks to linkage analysis, are responsible for familial AD: Presenilin 1 (
The proportion of cases of autosomal dominant AD explained by mutations in these genes is high but vary widely from 12% to 77% [16, 17], suggesting that there are additional genetic factors involved in the pathogenesis of EOAD. Recently, thanks to the NGS approach, some new genetic variants were found in small families with unexplained EOAD. Guerreiro et al. [18] identified a missense mutation in
Another study [19] identified mutations in the Sortilin related receptor 1 (
By using a NGS WES based approach, an association between
A recent study [26] identified, by GWAS, a novel missense mutation in phospholipase D family member 3 gene (
Recent GWAS studies [28, 29] confirmed that
Causal genes and genetic risk factors for AD
A rare variant is the Nicastrin gene, recently identified by NGS as risk factor for LOAD in a Greek population [31].
Lastly, Kohli and colleagues, using WES on 11 affected individuals in a large kindred with apparent autosomal dominant LOAD, found damaging missense mutations in the Tetratricopeptide repeat domain 3 gene (
GENETICS OF FRONTOTEMPORAL DEMENTIA: MAJOR CAUSAL GENES
The majority of FTD cases are sporadic and likely caused by the interaction between genetic and environmental factors. A number of cases, however, present familial aggregation and are inherited in an autosomal dominant fashion, suggesting a genetic cause [32–34]. Up to 40% of patients have a positive family history, with a diagnosis of dementia in at least one extra family member [33, 35]. At present, three major causal genes have been identified: Microtubule Associated Protein Tau (
Causal genes and genetic risk factors for FTD
MAPT
The first evidence of a genetic cause for familial FTD came from the demonstration of a linkage with chromosome 17q21.2 in autosomal dominantly inherited form of FTD with parkinsonism [36], named FTDP-17. The gene responsible for such association,
The pathology of all
To date, more than 40 pathogenic
The pathogenic mechanism of each different mutation depends on the type and location of the genetic defect, and affects the normal function of tau, i.e., the stabilization of microtubules promoting their assembly by binding tubulin. Some mutations increase the free cytoplasmic portion of the protein promoting tau aggregation, while others lead to an aberrant phosphorylation of tau protein, which damages microtubule stabilization [41]. Regarding mutations localized in the donor splicing site following exon 10, it was shown that these intronic mutations increase the inclusion of
Alternatively, other mutations affect the alternative splicing, thus producing altered ratios of the different isoforms (3R/4R tau). Most of missense mutations, such as the p.P301L mutation, reduce the ability of tau to bind microtubules leading to a decreased tau capacity to promote microtubules assembly [43]. Moreover, it was observed in
The clinical presentation in
GRN
After the discovery of
Since the original identification of null-mutations in FTLD, more than 70 different mutations have been described so far. Most of the known pathogenic
At neuropathological examination,
A collaborative study [53] analyzing
From the clinical point of view, mutations in GRN are associated with extremely heterogeneous phenotypes, but the main clinical diagnosis is FTD following by diagnosis of primary progressive aphasia [56]. Language impairment seems to be more relevant as the disease progresses. About 40% of patients have parkinsonism, and episodic memory impairment is frequently observed, leading to a clinical diagnosis of AD in some cases [57]. Although rarely, an overlap between psychiatric disorders and genetically determined FTD can occur, as shown by Rainero et al. [58], who described a patient with heterosexual pedophilia who was a carrier of a
The penetrance for
A major contribution to achieve a correct diagnosis independent of the phenotypic presentation is the demonstration that progranulin plasma levels are extremely low in
Regarding the function of progranulin, Pickford et al. [64] demonstrated, in an
Abnormalities of several cytokines and chemokines has been observed in cerebrospinal fluid (CSF) of
C9ORF72
One of the most intriguing discoveries in the genetics of FTD has been the investigation of FTD/motor neuron disease (MND) families linked to a locus on chromosome 9q21-22. The first evidence of linkage with this locus comes from a study carried out in families with autosomal dominant FTD-MND [67]. Additional data confirmed the linkage to chr9q21-22 in FTD-MND families [68], until, in 2011, two international groups identified the gene responsible for the disease in this locus,
In healthy subjects, most individuals carry between 2 and 20 repeats, but FTD and ALS patients had from 100 to also 1000 s of copies of repeats. The minimum repeat length to confer risk of disease is unknown, probably due to the presence of somatic mosaicism. In fact, the length of repeats is different between tissues even in the same individual and this phenomenon complicates genotype-phenotype correlation studies [71].
Clinical phenotypes are very variable [73] as well as the age at onset and disease duration; in fact, age at onset can range between 27 and 83 years and disease duration from 1 to 22 years. The most common clinical presentation is FTD, ALS, or both. As mentioned above, in families where FTD-ALS is the clinical phenotype, the
From a neuropathological point of view, postmortem examination showed that
Reddy et al. [83] demonstrated that the r(GGGGCC)n RNA forms extremely stable G-quadruplex structures, which theoretically may affect promoter activity, genetic instability, RNA splicing, translation and neurite mRNA localization.
Moreover, several studies, conducted in derived cells and tissue of patients, demonstrated that these foci are able to sequester RNA binding protein, including hnRNP h, hnRNP A1, and SC35, affecting the mRNA nuclear transport system [84]. However, the clear mechanism linking RNA foci and sequestered proteins to neurodegeneration has not been fully understood. Together with the formation of RNA foci and DPR, another suggested pathological mechanism of the
In cultured cells and primary neurons, Poly-GA overexpression led to the generation of p62-positive inclusions and neurotoxicity attributed to impaired ubiquitin proteasome function [86]. On the other hand, arginine-rich dipeptide (poly-GR and poly-PR) led to the formation of nucleolar inclusions in fly models [87]. Since the clinical utility as well as the significance and the temporal course of DPRs in the pathogenesis of the disease is still unclear, Lehmer et al. [88] established a poly-GP immunoassay from CSF in order to identify and characterize
GENETICS OF FRONTOTEMPORAL DEMENTIA: RARE CAUSAL GENES
CHMP2B
Few FTLD families display mutations in the charged multivesicular body protein 2B gene (
All mutations described (missense and truncation mutations) show a common mechanism of action: the deletion of the C-terminus of the protein with the loss of the Vsp-4 binding domain [90]. This causes the accumulation of mutated CHMP2B on the endosomal membrane and prevent the recruitment of other proteins necessary for endosomal fusion with lysosomal. This phenomenon leads to the impairment of the late endosomal trafficking and contributes to neurodegenerative processes in FTD [91]. This can be observed as enlarged and abnormal endosomal structures in postmortem brain tissue from patients [92]. From a histological point of view, patients with
VCP-1 and SQSTM1
Mutations in the Valosin Containg Protein gene (
Another gene involved in the mechanism of protein degradation as well as in FTLD pathogenesis is Sequestosome 1 gene (
CHCHD10
A coiled-coil-helix-coiled-coil-helix domain containing 10 (
TBK1
In 2015, a large exome sequencing case-control study identified mutations in the TANK binding kinase 1 gene (
TARDBP
FUS
Similar to TDP-43, Fused in sarcoma (
UBQLN2
TUBA4A
GENETICS OF FRONTOTEMPORAL DEMENTIA: GENETIC MODIFIERS
In addition to genes mentioned above and generally involved in familial autosomal dominant transmission, several genetic risk factors have been studied. The most important and replicated is the transmembrane protein 106b gene (
Common SNPs in the major causal genes have been studied to determine their association as FTD risk factors. For example, rs5848, located in 3’UTR of the
EPIGENETICS
Epigenetics is focused on the investigation of mechanisms able to influence the expression of genes without altering the DNA sequence. DNA methylation, chromatin remodeling, and non-coding RNAs (ncRNAs) are the three most investigated epigenetic modifications [116]. Epigenetic processes are able to regulate DNA replication and repair, RNA transcription, and chromatin conformation, that influence in turn transcriptional regulation and protein translation.
Methylation
DNA methylation is the best characterized epigenetic modification that involves the addition of a methyl group to the carbon-5 of a cytosine residue in DNA and is carried out by one of the several DNA methyltranferase (DNMT) enzymes. DNMT1 is the enzyme responsible for the maintenance of DNA methylation patterns during DNA replication. It localizes to the DNA replication fork, where it methylates nascent DNA strands at the same locations as in the template strand [117]. DNMT3a and DNMT3b are involved in the
Early epigenetic investigations related to AD focused on DNA methylation, finding non AD specific hypomethylation of the
More recent studies support an overall reduction in DNA methylation in AD patients thus highlighting the importance of DNA methylation in AD [123]. Interestingly Aβ has also been implicated as a trigger of epigenetic changes as it was found that Aβ induces global DNA hypomethylation [124]. Moreover, a DNA methylome paper found genes with altered methylation in AD brains [125].
Tau gene expression is also subject to complex epigenetic regulation, involving differentially methylated binding sites for transcription factors [126].
Recently, Bollati and colleagues investigated the methylation status of repetitive elements in blood, including
Regarding FTD, two studies analyzed the
A recent GWAS on DNA methylation pattern in peripheral blood of patients with FTD and progressive supranuclear palsy compared to healthy subjects found a specific methylation signature associated pathologically with tauopathy, suggesting this signature as a risk factor for neurodegeneration [130].
Regarding the
ncRNAs
It was widely believed in the past that most of the human genome consisted in “non-functional” DNA. It was later discovered that almost the whole genome is transcribed, but that just about 2% in translated into proteins [134].
It is now instead ascertained that most of this “junk” is functional and composed by ncRNA, whose signaling and editing is able to play a crucial role in chromatin and nuclear structure. In particular, ncRNAs are involved in epigenetic regulation by recruiting chromatin-modifying complexes. ncRNAs operate through repressive control but have also the potential to act as gene activators [134].
ncRNAs comprise small RNAs (sRNAs) of less than 200 nucleotides and long non coding RNA (lncRNAs) of more than 200 nucleotides. sRNAs are further subdivided as micro (mi)RNAs, short interfering (si)RNAs, and PIWI-associated (pi)RNAs, whereas lncRNA are categorized according to their direction and position of their transcription in: antisense, intergenic, exonic, intronic, overlapping [135].
miRNAs are single stranded, non-coding small RNAs that are abundant in plants and animals, and are conserved across species [136]. The raw transcripts undergo several nuclear and cytoplasmic post-translation processing steps to generate mature, functional miRNAs. In the cytoplasm, mature miRNAs associate with other proteins to form the RNA-Induced Silencing Complex (RISC), enabling the miRNA to imperfectly pair with cognate miRNA transcripts. The target mRNA is then degraded by the RISC, preventing its translation into protein [137, 138]. miRNA-mediated repression of translation is involved in many cellular processes, such as differentiation, proliferation, and apoptosis, as well as other key cellular mechanisms [139, 140].
It is now well established that altered RNA processing could act as a contributing factor to several neurological conditions including aging-related neurodegenerative diseases such as AD, FTD, ALS, and Parkinson’s disease [141–143].
In AD, the implication of miRNAs in Aβ production, via BACE1 modulation, and in tau phosphorylation, that leads to hyperphosphorylated neurofibrillary tangle formation, has been demonstrated [142].
Altered miRNA signatures were also identified in AD and FTD. In particular, several miRNAs have identified differentially expressed in postmortem tissue, blood, and CSF that also differ by disease stage [145, 146].
Regarding lncRNAs, they also have been involved in neurodegenerative diseases [146].
These ncRNAs are involved in different functions; they act as scaffolds for chromatin modifiers and nuclear paraspeckles, as transcriptional co-regulators, and even as decoys for other RNAs [145]. Dysregulations in lncRNAs can influence any one of these processes, thus contributing to neurodegeneration. lncRNAs associated with disease condition can post-transcriptionally increase gene expression, as it happens with the lncRNA BACE1-antisense whose expression is selectively increased in AD brains and competes with miR-545-5p binding to stabilize BACE1 mRNA. This will finally result in increased expression of BACE1 that contribute to the formation of the toxic Aβ peptides that is a major hallmark for AD [146].
Another lncRNA, BC200, likely plays a role in AD as increased levels were found in specific brain regions mostly affected by AD, such as the Brodmann’s area 9 [147]. MALAT1 and NEAT1 are other two lncRNAs very important for splicing and synapse formation [148, 149].
Chromatin remodeling
In mammalian cells, histone proteins interact with DNA to form chromatin, the packaged form of DNA. Histones are octamer consisting of two copies of each of the four histone proteins: H2A, H2B, H3, and H4. Each histone octamer constitutes in 146 bp of the DNA stand wound around it to make up one nucleosome, which is the basic unit of chromatin. Histone proteins can be modified by post translational changes, including: acetylation, methylation, phosphorylation, ubiquitination, and citrullination. These histone modifications induce changes to the structure of chromatin and thereby affect the accessibility of the DNA strand to transcriptional enzymes, resulting in activation or repression of genes associated with the modified histone [150]. The best-understood histone modification is acetylation, which is mediated by histone acetyltransferases and deacetylases [151]. Acetylation of histones is usually associated with upregulated transcriptional activity of the associated gene, whereas deacetylation of histones to transcriptional silencing [152].
Histone acetylation was found to be largely decreased in the temporal lobe of AD patients compared to controls and in mouse models of AD [153]. Moreover, increased H3 acetylation at the promoter region of the BACE1 gene in AD patients was found [153].
Besides acetylation, different forms of histone methylation exist [154], and may be linked to neurodegenerative diseases.
CONCLUSIONS
Herein, we provided an overview of the current genetic and epigenetic progresses in AD and FTD. We reviewed current knowledge on causative genes and altered mechanisms leading to the two diseases, genetic risk factors and disease modifiers shown to influence the age at onset and clinical course of the diseases, and the role of epigenetics, including methylation, non-coding RNAs, and chromatin remodeling, in influencing gene expression. Data obtained so far suggest a crucial role of microglia and immunity in AD and a role of autophagy and proteasomal degradation in FTD. Future challenges will be a better understanding of the interplay among genetic and epigenetic factors in order to correlate pathogenic mechanisms with clinical phenotypes and pave the way for novel therapeutic approaches such as miRNA mimics or miRNA antagonists (antagomirs), specifically designed to either reverse the downregulation or upregulation of disease-associated miRNAs.
DISCLOSURE STATEMENT
Authors’ disclosures available online (https://www.j-alz.com/manuscript-disclosures/17-0702r2).
