Abstract
Background:
Dementia due to Alzheimer’s disease (AD) is a complex neurodegenerative disorder, which much of heritability remains unexplained. At the clinical level, one of the most common physiological alterations is the slowing of oscillatory brain activity, measurable by electroencephalography (EEG). Relative power (RP) at the conventional frequency bands (i.e., delta, theta, alpha, beta-1, and beta-2) can be considered as AD endophenotypes.
Objective:
The aim of this work is to analyze the association between sixteen genes previously related with AD:
Methods:
An Iberian cohort of 45 elderly controls, 45 individuals with mild cognitive impairment, and 109 AD patients in the three stages of the disease was considered. Genomic information and brain activity of each subject were analyzed.
Results:
The slowing of brain activity was observed in carriers of risk alleles in
Conclusion:
Endophenotypes reduce the complexity of the general phenotype and genetic variants with a major effect on those specific traits may be then identified. The found associations in this work are novel and may contribute to the comprehension of AD pathogenesis, each with a different biological role, and influencing multiple factors involved in brain physiology.
INTRODUCTION
Dementia due to Alzheimer’s disease (AD) is a common, age-related, neurodegenerative disorder leading to progressive memory loss and impairments in speech and behavior. The prevalence of AD is higher among females than males, even when individuals at the same age are compared [1–4]. Indeed, AD diagnosis is not straightforward in early stages, and symptoms are frequently dismissed and confused with normal aging. AD progression is often characterized in three main stages: mild (or early-stage), moderate, and severe (or late-stage) AD, depending on the level of functionality of the patient [5]. Mild cognitive impairment (MCI) is characterized by a slight but perceptible decline in cognitive abilities, including memory and thinking skills, and plays an important role in early diagnosis. Individuals with MCI do not necessarily develop AD, but are in greater risk, especially those with amnestic MCI, which is considered a prodromal stage of the disease [6, 7].
Heritability plays an important role in AD [8],
Electroencephalography (EEG) measures the electrical activity of the brain, acquiring voltage fluctuations (derived from synaptic potentials) by electrodes placed on the scalp. In previous works, physiological alterations caused by neurodegeneration have been studied by means of EEG analyses [18–20]. AD is known to be strongly correlated with a general slowing of the EEG, measured through relative power (RP) calculations. Specifically, a progressive increase of RP in low frequency bands (i.e., delta and theta), along with a progressive decrease at high frequency bands (i.e., alpha and beta), have been consistently associated with AD progression [21–23]. These alterations seem to be related with neurophysiological and anatomical disturbances, such as hippocampal atrophy [24], cortical disinhibition or hyper excitability [25, 26]. For comprehensive reviews on this topic see [27–30].
RP calculations at each frequency band are objective and reliable measurements, and thus effective as AD endophenotypes [31–34]. Genetic variants with a major effect on those specific traits may be then identified, as analyzed in previous studies considering four AD risk variants in
The main goal of this work is to analyze the correlation between: (a) the EEG relative power in the conventional frequency bands (i.e., delta, theta, alpha, beta-1, and beta-2) measured in resting state, and (b) genetic variants in sixteen candidate genes previously associated with AD in an Iberian cohort composed by AD patients, individuals with MCI and controls. To achieve this, the correlation between EEG measurements and both (c) the strongest risk factors for AD: age, sex, and
MATERIAL AND METHODS
Subjects
We studied an Iberian cohort of: 1) LOAD patients in different stages of the disease: mild (MIL), moderate (MOD), and severe (SEV) AD; 2) individuals with MCI; and 3) cognitively healthy elderly controls (CON). AD patients and MCI individuals were clinically diagnosed following the criteria of the National Institute on Aging and Alzheimer’s Association (NIA-AA) [39]. AD staging was based mainly on the Mini-Mental State Examination (MMSE) [40]. The control group was composed by individuals over 68 years old, with no signs of dementia or history of neurological disease. Neither patients nor controls were diagnosed with any other neurologic and psychiatric diseases (other than AD and MCI), and were not using any drugs that might affect EEG signal.
At the time of sample collection, subjects were residents of the autonomous community of Castile and Leon, northwestern Spain, or of the northern region of Portugal. Saliva and buccal swabs were selected as sources of biological sampling to maintain the process as noninvasive as possible. Biological samples and EEG data were collected from 253 individuals, mostly equally distributed by population: Portugal (PT) and Spain (SP), and subgroup: controls (25 PT + 26 SP), individuals with MCI (26 PT + 25 SP), and patients with mild (25 PT + 26 SP), moderate (25 PT + 25 SP), and severe LOAD (25 PT + 25 SP). This project was approved by the Ethics Committee of the University of Porto (report # 38/CEUP/2018), Portugal, and written informed consent was obtained from all participants, family and/or legal representatives.
After DNA quality control assessment, prior to genotyping (with a minimum quantity of 10 ng/μl in 45μl minimum volume and integrity of 90%of gDNA greater than 10 Kb in size; see below for more details), 54 subjects were removed before microarray processing since their biological samples did not fulfill the minimum requirements. The distribution of the 199 subjects which biological samples passed the quality control procedures are presented in Table 1, along with information on sex, mean age, and MMSE scores. Situation of the cohort regarding the strongest risk factors for AD: sex, age, and
Number of subjects that qualified for further analyses after genotyping quality control, and respective mean age and MMSE scores
CON, controls; MCI, individuals with mild cognitive impairment; MIL, patients with mild Alzheimer’s disease; MOD, patients with moderate Alzheimer’s disease; SEV, patients with severe Alzheimer’s disease; PT, subjects residents of northern Portugal; SP, subjects residents of Castile and Leon, Spain.
Genotyping
A sample of saliva or three buccal swabs were collected from each of the initial 253 participants of the study. Preference was given for collecting a saliva sample of 2 ml with the Oragene DNA 500 collection kit (DNAgenotek). Buccal swabs were only used for patients at a more advanced stage of the disease, unable to voluntarily spit. After DNA extraction and quality control assessment, samples were genotyped using the genome wide Axiom Spain Biobank Array (Thermo Fisher Scientific) at the Spanish National Center for Genotyping (CEGEN, Santiago de Compostela, Spain, CEGEN-PRB3-ISCIII; supported by grant PT17/0019, of the PE I + D+i 2013-2016, funded by ISCIII and ERDF).
Variant calling quality control (QC) was performed in accordance with the Affymetrix’s best practices guide, and a widely used protocol was followed for both individual and marker analysis [41]. All the analyses were computed with Affymetrix Power Tools and PLINK [42]. In the variant calling QC, individuals with dish QC or QC call rates below the defined thresholds were not considered for further analysis, as well as those with heterozygosity rate greater than the defined acceptance threshold. Finally, the probes belonging to the recommended calling categories were selected and the corresponding variants annotated according to the Genome Reference Consortium Human Build 37 (GRCh37) single nucleotide polymorphism (SNP) assembly. In per-individual QC analysis, the sex of the individuals was confirmed through the sex chromosomes’ homozygosity rate, and duplicates or related individuals were identified through the estimation of identity by descent (IBD) rates. For the pairwise cases where IBD estimates revealed to be greater than the established, one of the two individuals was removed (preference for remaining in the study was given first to patients, then to females and finally to individuals with higher call rates). Finally, individuals with divergent ancestry were disregarded. For this, principal component (PC) analyses were computed using the software EIGENSOFT [43, 44] and merging the dataset with one, publicly available, from the 1000 Genomes Project (1KGP) [45], containing 12 different populations from four ancestry groups: East Asian, African, European and Admixed American. In per-marker QC analysis, SNPs with a significant deviation (α= 1E–06) from Hardy-Weinberg equilibrium in control samples were eliminated, as well as those with missingness rate greater than 5%. Significant differences in missing genotype rates between cases and controls (α= 1E–05) were also considered, and SNPs with no variance in the sample were disregarded.
Gene selection
Among the plethora of markers associated with AD, sixteen candidate genes with functional relevance in the brain were selected for further analyses:
The set of sixteen selected candidate genes and the QC procedures described above, resulted in the analysis of 796 common variants (minimum allele frequency ≥5%) in 199 subjects (Supplementary Table 3). PC analysis showed no population substructure, the gain to the understanding of data with each additional PC revealing to be approximately linear (Supplementary Figure 1). At this point is noteworthy that one of the two variants that determine the
Electroencephalographic recordings
Five minutes of resting-state EEG activity was acquired for each subject using a 19-channel Nihon Kohden Neurofax JE-921A System at the electrodes: Fp1, Fp2, Fz, F3, F4, F7, F8, Cz, C3, C4, T3, T4, T5, T6, Pz, P3, P4, O1, and O2, following the International System 10–20. Signals were recorded at a sampling frequency of 500 Hz with common average reference. Subjects were asked to remain awake with closed eyes during acquisition. EEG data were preprocessed according to the following steps [18, 71]: i) mean removal; ii) digital filtering using a Hamming window bandpass finite impulse response (FIR) filter in the band of interest (i.e., 1–30 Hz); iii) independent component analysis (ICA) to remove oculographic and cardiographic artefacts; iv) segmentation into 5 s epochs; and v) visual rejection of epochs with artefacts. In this study, conventional EEG frequency bands were considered: delta (1–4 Hz), theta (4–8 Hz), alpha (8–13 Hz), beta-1 (13–19 Hz), and beta-2 (19–30 Hz). Gamma band was not included in the analyses due to the possible interference of muscle artefacts in its frequency range [72, 73]. In order to quantify the relative contribution of the previous frequency bands to the global power spectrum, RP was computed. Specifically, RP was calculated from the normalized power spectral density by summing the contribution of each spectral component in a specific frequency band [74].
Statistical analysis
Power analysis
Considering the set of 796 variants and the cohort of 199 subjects, the statistical power of the study for identifying variants explaining a range of phenotypic variance was quantified in the non-centrality parameter [75], taking into account the interplay between experimental sample size, allele frequency and effect size (Supplementary Table 4). For detecting a SNP explaining 10%of trait variance, the statistical power reached 75.8%, increasing this figure to 97.3%considering a proportion of variance of 15%(Bonferroni-corrected significance level α= 6.28E – 05).
The variance in phenotype Y: Var(Y), explained by genetic variant X, can be decomposed into two components:
Correlation between EEG data and covariates: age, sex, APOE ɛ4 presence, and AD status
The normality of the EEG data was assessed (Shapiro-Wilk test), as well as the correlation between EEG data and the strongest risk factors for AD: sex (Kruskal-Wallis’ test), age (Pearson’s correlation test), and
Before assessing correlation with the genetic data, RP values of each frequency band were corrected for the covariates significantly correlated with each one of them. This was computed by adding them as covariates in the linear model:
Correlation between genetics and EEG measures
Correlation between (corrected) RP values at each EEG frequency band and the genetic variants exhibited by the subjects was assessed through Kruskal-Wallis testing. Information of linkage disequilibrium (LD) between pairs of genetic variants was obtained from LDlink [78], accessed on July 15, 2020, for “Iberian Populations in Spain”. For each one of the 796 variants where differences reached the significance level α= 0.005, allele analysis were performed to assess the possibility of identifying a ‘risk allele’, consistently associated with the slowing of brain activity. Indeed, the results concerning the relationship between the set of 796 genetic variants and the EEG measurements were presented considering α= 0.005 and α= 0.05/796 = 6.28E–05. Aware of the modest size of the sample compared to the number of variants analyzed, the first significance level was considered to identify candidate variants to be further analyzed in the future.
RESULTS
Correlation between EEG data and covariates: age, sex, APOE ɛ4 presence, and AD status
Evidence to refute the null hypothesis regarding normality of the EEG data was found for all brainwaves (delta
RP values showed no association neither with the sex of the subjects nor with the presence of
Statistically significant correlations between EEG-based measurements and sampling groups: AD patients, individuals with MCI and controls, were found for the five frequency bands (α= 3.3E-03, Supplementary Table 6) and were in accordance with previous studies. Particularly, when controls were compared with AD patients, statistically significant differences were found for all frequency bands, with patients showing higher RP values for low frequency bands (i.e., delta and theta), while lower for alpha, beta-1, and beta-2. Noteworthy, theta rhythms differentiated between controls and both MCI and AD groups, at a statistically significant level, which may point to the suitability of RP in theta band as case-control biomarker. We also explored the RP spatial patterns for AD, MCI, and control groups. Supplementary Figure 2 shows the previously mentioned slowing process of brain activity associated to AD and MCI. The statistically significant differences between groups mainly appear at parietal and right frontal areas in delta band, at central and parietal areas in theta band, at temporal areas in alpha band, and across all the scalp in beta-1 band. Beta-2 band only showed statistically significant differences at the P3 channel.
When AD patients were analyzed in subgroups according to the disease stage, delta and alpha RP values showed statistically significant differences within AD subgroups (α=1.0E-03, Supplementary Table 7, Supplementary Figure 3).
For further analyses, theta RP values were then corrected for the age of the subjects, RP values of all frequency bands were corrected for the status of the subject: AD patient, individual with MCI and control, and RP values of delta and alpha were also corrected for the stage of the disease. Correlation analyses considering separately the sampling subgroups were also computed.
Correlation between genetics and EEG measures
Some of the analyzed 796 common variants showed to be correlated with the (corrected) EEG-based measures. Considering a significance level α= 0.005, six variants within
Genetic variants showing statistically significant differences in RP values of at least one frequency band (Kruskal-Wallis’ test, significance level α= 0.005). RP values corrected for: 1AD, MCI, and CON status, 2MIL, MOD, and SEV stage, or 3 the age of the subjects. The risk allele is the one associated with the EEG slowing, reflected in an increase of RP in delta and theta bands, and a decrease of RP in alpha and beta bands. Proportions of variance in phenotype explained by the genetic variants (PVE) are presented in parentheses
*Significance level reached after Bonferroni’s correction (α= 6.28e–5).
When the sampling groups: AD patients, individuals with MCI, and controls were analyzed separately, statistically Bonferroni corrected significant differences were found for the three
1RP values corrected for the age of the individuals; *Significance level reached after Bonferroni’s correction (α= 6.28e–05).
For all the six variants where the significance level α= 0.005 was reached, allele correlation was analyzed and in all the cases a risk allele was associated with the slowing of brain activity (higher delta and theta, and lower alpha and beta mean RP values). It is noteworthy that no evidence of differentiation were found between the frequency of the risk allele among AD patients and controls (Supplementary Table 8). The distribution of genotypes according with AD status is presented in Supplementary Table 9.
Concerning the four variants previously associated with RP values in the context of AD [35–38], only
DISCUSSION
Correlation between EEG data and covariates: age, sex, APOE ɛ4 presence, and AD status
When analyzed independently, EEG data provided results in accordance with the literature [80]. As expected, AD patients showed a significant slowing of brain oscillatory activity compared with controls, and within AD subgroup a progressive slowing of EEG was observed along with an increasing of the severity of the disease. Also, theta RP values showed to be correlated with the age of the subjects. From the obtained results is noteworthy that theta rhythms can differentiate between controls and both MCI and AD groups, at a statistically significant level. The suitability of theta band to reflect dementia conditions was already reported in previous studies [81–83]. This agrees with our results, appointing EEG theta band alterations as a potential biomarker to detect hints of neural deterioration, even in pre-clinical states. Indeed, our results supports a likely association between EEG power in theta band and cognitive impairment. Previous studies reported correlation between healthy cognition and reduced tonic theta power [84–86], indicating higher values during resting state as a potential consequence of cognitive impairment. In addition, negative correlation between theta power and hippocampal volume was obtained by means of magnetic resonance imaging [87, 88], which could be related with loss of CA1 hippocampal pyramidal neurons [89]. Another well-known negative correlation in AD is delta and theta power with MMSE [90], which may provide diagnostic value to EEG.
Correlation between genetics and EEG measures
After correlating the genetic information of 796 variants from 16 genes previously associated with LOAD and the RP at each EEG frequency (corrected accordingly for the different covariates), three genes harbored the highest number of variants correlated with EEG-related measures:
Overall, theta RP values showed the highest correlation with the genetic variants tested. Previous studies considered alterations in theta-related EEG activity to be associated with amyloid plaque deposition, which is strongly correlated with AD [92]. Also, correlations between theta band disturbances and known AD biomarkers, such as total-tau (
Relationship between IL1RAP and RP in delta and theta frequency
A genome-wide association study of longitudinal amyloid accumulation in AD patients implicated the microglial activation of
Relationship between UNC5C and RP in alpha frequency band
Relationship between NAV2 and RP in theta and beta frequency bands
Limitations and future research lines
Some limitations were faced in this work and should be examined in future research. Since this study aimed to ascertain relations between RP values and a reasonable quantity of genetic features, the reliability of the results may be sensitive to database size. This was particularly noticeable when the analyses were computed considering separately the sampling subgroups: AD patients, MCI individuals, and controls. The genetic associations found in this work are novel and future research is important to ensure the correlation with the disease. In order to improve statistical power, enlarging the sample of study should be taken into account. It is also noteworthy that RP was the only feature considered in this study to characterize neurodynamic alterations. Although RP is reliable describing the deterioration process of dementia, connectivity measures (amplitude envelope correlation, weighted phase lag index, etc.) or network parameters (clustering coefficient, characteristic path length and betweenness centrality, among others), could provide additional insights in this regard. In future research, we aim to apply alternative metrics that allow to obtain further information on physiological deterioration associated to the progression of AD. In addition, whole scalp EEG activity was considered, losing spatial-relative data. Several studies pointed particular brain areas to be associated with more marked EEG alterations in AD [125–128], and hence the average activity from all electrodes may diminish the statistical significance of the associations, which in fact may be stronger.
CONCLUSIONS
RP values at the conventional EEG frequency bands (i.e., delta, theta, alpha, beta-1, and beta-2) showed to be correlated with some genetic variants in genes previously associated with AD. Globally, theta frequency band showed the greatest correlation with the analyzed genetic variants and is noteworthy its potential as case-control biomarker.
Novel associations between variants in
These associations may contribute to the comprehension of AD pathogenesis, each with a different biological role, and influencing multiple factors that contribute to brain physiology.
Footnotes
ACKNOWLEDGMENTS
We deeply thank the three anonymous reviewers, whose comments and suggestions deeply contributed to the improvement of this work.
We deeply thank all participants, families and institutions involved, namely: Asociación de Familiares de Enfermos de Alzheimer de Ávila, Ávila; Associação de Pensionistas e Reformados de Viana do Castelo, Viana do Castelo;Casa do Povo de Alvito S.Pedro, Barcelos; Santa Casa da Misericórdia de Vila Nova de Gaia; Obra Social Nossa Senhora da Boa Viagem, Porto; Gero Vida, Villaralbo (Zamora); Asociación de Familiares de Alzheimer de León; Residencia San Raimundo en Coreses; Centro de Dia S. João de Deus, da Santa Casa da Misericórdia do Porto; Lar Santa Rita, da Santa Casa da Misericórdia de Caminha; Centro Social e Cultural de Vila Praia de Âncora; Lar Casa de Magalhães; Freixo, Ponte de Lima; Centro de Dia Memória de Mim; Lavra; Armonía Centro de Día, Zamora.
This work was supported by ‘European Commission’ and ‘European Regional Development Fund’ under the project “Análisis y correlación entre el genoma completo y la actividad cerebral para la ayuda en el diagnóstico de la enfermedad de Alzheimer” (Project 0378_AD_EEGWA_2_P), (Co-operation Programme INTERREG V-A Spain-Portugal POCTEP 2014-2020) and the COMPETE 2020-Operacional Programme for Competitiveness and Internationalisation (POCI), Portugal 2020. Portuguese funds are supporting this work through FCT-Fundação para a Ciência e a Tecnologia/Ministério da Ciência, Tecnologia e Inovação in the framework of the project “Institute for Research and Innovation in Health Sciences” (POCI-01-0145-FEDER-007274). SM, AML, IG, and NP are funded by FCT: CEECIND/00684/2017, IF/01262/2014, and CEECIND/02609/2017, and through the Decreto-Lei nº 57/2016 de 29 de Agosto, respectively. Spanish funds are supporting this work through ‘Ministerio de Ciencia e Innovación – Agencia Estatal de Investigación’ and ‘European Regional Development Fund’ under project PGC2018-098214-A-I00 and by ‘CIBER en Bioingeniería, Biomateriales y Nanomedicina (CIBER-BBN)’ through ‘Instituto de Salud Carlos III’ co-funded with ‘European Regional Development Fund’ funds. MA is funded by the Grant RYC-2015-18241 from the Spanish Government.
