Abstract
Background
Genetic and clinical observations have indicated T cells are involved in MS pathology. There is little insight in how T cells are involved and whether or not these can be used as markers for MS.
Objectives
Analysis of the gene expression profiles of circulating CD8+ T cells of MS patients compared to healthy controls.
Methods
RNA from purified CD8+ T cells was sequenced and analyzed for differential gene expression. Pathway analyses of genes at several p-value cutoffs were performed to identify putative pathways involved.
Results
We identified 36 genes with significant differential gene expression in MS patients. Four genes reached at least 2-fold differences in expression. The majority of differentially expressed genes was higher expressed in MS patients. Genes associated to MS in GWAS showed enrichment amongst the differentially expressed genes. We did not identify enrichment of specific pathways amongst the differentially expressed genes in MS patients.
Conclusions
CD8+ T cells of MS patients show differential gene expression, with predominantly higher activity of genes in MS patients. We do not identify specific biological pathways in our study. More detailed analysis of CD8+ T cells and subtypes of these may increase understanding of how T cells are involved in MS.
Introduction
Multiple sclerosis (MS) is characterized by immune cell infiltration and inflammation of the central nervous system (CNS) leading to loss of neurological function. Recent advances in genetics also indicate a large role of the immune system in the genetic risk factors, 1 in addition to epigenetic2,3 and environmental factors. 4 T-helper cell specific pathways have been shown to be involved through pathway analysis of the genetic risk variants. 5 Further corroboration for T cell involvement is provided by biological pathway and gene-set enrichment analyses of MS genetic and genomic studies. 6 Whilst CD4+ T cells have been considered to be major contributors in the pathological processes observed in MS,7–10 histological examinations of MS lesions show that CD8+ T cells outnumber CD4+ T cells in active lesions.11,12 Further studies on the role of CD8+ T cells in MS indicate that these cells may act as disease effectors.13,14
Earlier studies of gene expression in MS performed in whole blood and peripheral blood mononuclear cells (PBMCs) have not shown large scale differences between MS patients and healthy controls.15–17 A study of whole blood gene expression by Parnell
Whilst genome-wide association studies (GWAS) are not affected by a putative patient treatment, the gene expression profiles are affected by immunosuppressive and immunomodulatory drugs in use for treatment of MS. In our study, we therefore included untreated MS patients only. We performed RNA sequencing of peripheral CD8+ T cells from 20 MS patients and 20 healthy controls to identify genes that are differentially expressed by these cells in MS patients.
Materials and methods
Patients and healthy controls
Female MS patients with relapsing remitting MS and no other autoimmune disorders (N = 20) were recruited from the MS clinic at the Oslo University Hospital. 18 of these patients never received immunomodulatory treatment for MS, whereas two patients underwent interferon-beta treatment that was stopped respectively two and four years prior to inclusion in this study (Supplementary Table S2). None of the patients was treated with any medication at the time of inclusion in this study. All patients met the 2010 McDonald diagnostic criteria. 19 Unrelated female, age-matched (N = 20) healthy controls were recruited from the patients’ social network or hospital employees. All patients and healthy controls provided informed consent for this study, which was approved by the local medical ethical committee (REK2011/1846).
Purification of CD8+ T cells
Whole blood was drawn into EDTA coated vacuum tubes (Med-Kjemi AS, Norway). PBMCs were purified within two hours of venipuncture using lymphoprep (Axis Shield, Scotland). The PBMCs were washed with PBS and resuspended at a density of 1*108 cells per ml in purification buffer (1 mM EDTA and 2% FCS in PBS).
CD8+ T cells were isolated by positive selection using an Automacs cell separation column (Milteny, Israel) and the CD8+ positive selection kit (Milteny, kit #130-045-201). Flow cytometry analysis using mouse anti-human CD8 (clone RTF-8, Southern Biotech) was performed to confirm that at least 95% of the purified cells were CD8 positive, mouse IgG1 (clone 15H6, Southern Biotech) was used as isotype control. Cells were pelleted and resuspended in 350 μl RNAprotect cell reagent (Qiagen, The Netherlands).
RNA library preparation and sequencing
Cells were lysed using QIAshredder columns (Qiagen) and RNA was isolated using RNAeasy mini columns (Qiagen) according to the manufacturers protocol. The RNA concentration was measured on the Nanodrop ND-1000 Spectrophotometer (Thermo Fisher Scientific Inc., Norway). A random selection of RNA samples was checked for integrity using an Agilent 2100 Bioanalyser (Agilent, UK) yielding RIN-values above 7.0. 100 ng of RNA was processed using the TruSeq stranded mRNA library preparation kit # RS-122-2001 (Illumina, United States of America) according to the manufacturers protocol. Indexed libraries were sequenced by multiplexing 12 barcoded libraries per lane on an Illumina NextSeq using a 75 bp paired-end sequencing run. In total four sequencing runs were performed, with 4 samples undergoing library preparation and sequencing in duplicate.
Data preprocessing
Using “kallisto” 20 the fastQ-files were aligned to the “HomoSapiens.GRCh38.cDNA” transcriptome which was downloaded from the Ensembl FTP-server. Quality controls of the resulting mapped reads and differential gene expression analysis were performed in R3.4.2. Per-sample-per-transcript read counts were loaded into the DESeq2 package 21 using the “tximport” function of the “tximport” package. 22 To identify putative outlier samples, a Euclidean distance matric was calculated for all transcripts with at least 50 observations in every sample. Surrogate variable analysis was performed using the “svaseq” package 23 in R to account for hidden confounders in the data. Only genes with at least one read in more than half the analyzed samples were kept for differential expression analysis. Furthermore, genes in the HLA region were excluded to avoid potential mapping issues given the hypermorphic nature of these genes. 24
Differential expression analysis
The “DeSeq2” package in R was used for differential expression analysis. The design matrix included surrogate variables identified through the SVA-package in addition to the case-control status. In order to account for multiple testing, we applied a false discovery rate correction using the option “Benjamini and Hochberg” in the DeSeq2 package. Adjusted p-values below 0.05 were considered significant.
Testing for enrichment of nominally significant genes in GWAS, CD4+ T cell gene expression and CD8+ T cell protein expression studies
From the annotation provided for the 200 non-HLA genetic variants associated to MS, 1 272 gene IDs were extracted. From the genes expressed in the current CD8+ T cell data set, the MS-associated proportion of those that were not significantly (nominal p-value >0.05) differentially expressed was compared against those that were significantly (nominal p-value ≤0.05) differentially expressed using Fishers’ exact test.
We compared the genes that were nominally differentially expressed in our earlier gene expression study of CD4+ T cells 18 against those that were nominally significantly differentially expressed in the current CD8+ T cell gene expression data. We only considered genes that were expressed in both cell types. Fishers’ exact test was used to test for significant enrichment or depletion of genes that were significant or not significant in both datasets.
Similarly, we compared genes encoding proteins that were nominally significant differentially expressed in an independent proteomics analysis that included CD8+ T cells from 13 MS patients and 14 healthy controls 25 against the genes nominally significant in the current analysis. Fishers’ exact test was used to test for significant enrichment or depletion of genes that were significant or not significant in both datasets.
Pathway analysis
Genes with adjusted p-values below 0.05; 0.1 and 0.4 in the differential expression analysis were imported into QIAGEN’s Ingenuity® pathway Analysis software (IPA®, QIAGEN, Redwood City, CA, USA, version 45868156, build version: 4,84,108 M). The input for these three pathway analyses were the differential expression ratios and associated p-values for the respective genes. Default analysis settings were used with the following confidence for species and tissues and cells, “
Targeted pathway analysis for pathway dysregulation scores 26 were performed for 18 pathways involved in T cell homeostasis (Supplementary Table S1), these pathways were selected from the gene set enrichment analysis database. 27 Pathways that contained the term “GO_T_CELL*” were analysed for differences between MS patients and controls using the R package “pathifier”. 26
Results
A summary of the characteristics of the patients and healthy controls in this study is provided in Table 1. Detailed characteristics are provided in Supplementary Table S2. The mean number of mapped reads was 32.3 million (range 13.1–62.8 million). Duplicate samples showed highly similar profiles based on the Euclidean sample distance (Supplementary Figure S1). Duplicated samples were merged into one pseudo-sample in further analyses using the “collapseReplicates” function of the DeSeq2 R package.
Basic characteristics of MS patients and healthy controls.
EDSS: expanded disability status scale.
We performed differential gene expression analysis with multiple testing correction according to Benjamini and Hochberg (results of this analysis are provided in Supplementary Table S3). 36 genes reached an adjusted p-value below 0.05 (Table 2). Four genes displayed absolute fold-changes above 2 (
Genes reaching a significant Benjamini-Hochberg corrected p-value in the differential gene expression analysis of CD8+ T cells from MS patients and healthy controls.
aBenjamini and Hochberg corrected p-value.
bFold-change of gene expression for MS patients compared to healthy controls.
cFold-change of gene expression between MS patients and controls, irrespective of the direction of the change.
drs-ID of SNP that was identified at this gene locus in the latest IMSGC association study. 1

Boxplots of significantly differentially expressed genes with absolute fold-changes above 2 in CD8+ T cells from MS patients compared to healthy controls, ordered by the most significant genes. Individual expression levels are presented for healthy controls (squares) and MS patients (triangles). The boxes delimit 25% and 75% of the values; the horizontal bars represent the median value. The whiskers represent values that do not exceed a distance of 1.5 times the interquartile range from the middle 50% of the data. (a) The
Gene pathway analyses
In order to investigate whether differentially expressed genes were enriched in specific pathways, Ingenuity Pathway Analysis was used for analyses for the 36 genes that were significant, as well as for genes that reached an adjusted p-value below 0.1 (83 genes) and 0.4 (530 genes). These genes and their adjusted p-values are provided in Supplementary Table S3. We did not observe significant enrichment in specific biological pathways (Supplementary Table S5-S7). After correction for multiple testing, pathways in the GSEA database that contain the terms “GO_T_CELL_*”, including “GO_T_CELL_ACTIVATION” did not show significant pathway dysregulation scores when comparing the MS patient scores against the controls (Supplementary Table S8).
Analysis of MS susceptibility genes
The IMSGC recently published a genomic map for MS, which identified 200 non-HLA single nucleotide polymorphisms (SNPs) with association to MS. 1 We investigated whether genes most proximal to these MS-associated SNPs were overrepresented among the differentially expressed genes in the CD8+ T cell dataset. Both for the comparison of the nominally significant genes (25 genes with MS-associated SNP, Supplementary Table S3) as well as for the multiple testing corrected significant genes (3 genes with MS-associated SNP, Table 2), we observed a higher number of MS-susceptibility genes than expected by chance (comparison provided in Table 3).
Two-by-two table of genes reaching a multiple testing corrected p-value below 0.05 (or nominal p-values below 0.05 between brackets) in the differential gene expression analysis of CD8+ T cells from MS patients and healthy controls, divided into genes that are located in MS risk loci according to the latest GWAS study by the IMSGC. 1 Statistical testing of significance was performed according to Fishers’ exact test.
aGenes annotated to the IMSGC GWAS significant SNPs. 1
bGenes with Benjamini-Hochberg corrected p-values ≤0.05 in the differential expression analysis (nominal p-values ≤0.05).
Analysis of CD4+ T cell differential gene expression analysis data
We previously analyzed RNA sequencing data of CD4+ T cells of MS patients and controls. 18 In order to identify possible overlap between differentially expressed genes we compared these datasets. For this, we only considered genes that were expressed by both cell types (indicated in Supplementary Table S3). We analyzed whether there was more overlap than expected by chance for genes reaching a nominally significant differential expression. Amongst the 11,902 genes expressed in both CD4+ and CD8+ T cell we observed no significant overlap of genes that reach nominal significance in either dataset (Fishers’ exact test p-value 0.26, Supplementary Table S9A).
Analysis of CD8+ T cell protein differential expression analysis data
We previously analyzed the protein expression profile of CD8+ T cells from MS patients and healthy controls. 25 We compared the genes encoding the proteins in this differential expression analysis against the differential expression of genes in the current study. We only considered genes detected in both datasets (2,113 genes). No significant enrichment of nominally differentially expressed genes/proteins was observed in this comparison (Fishers’ exact test p-value 0.07, Supplementary Table S9B).
Discussion & conclusion
We show that gene expression of CD8+ T cells is significantly different between MS patients and healthy controls. We identified 36 genes that are significantly differentially expressed upon multiple testing correction. Four of these genes have a more than 2-fold difference in expression between MS patients and healthy controls. A striking majority of the differentially expressed genes have higher levels of expression in MS patients compared to the healthy controls. The predominant higher expression of differentially expressed genes may be attributable to a more active state of the CD8+ T cells. However, we did not identify specific T cell activation pathway signatures amongst the differentially expressed genes that can confirm this hypothesis. Further investigation of the activation status and potential of CD8+ T cells obtained from MS patients and healthy controls should be performed in order to conclude whether or not activation state underlies the observed higher expression of the majority of differentially expressed genes in CD8+ T cells from MS patients.
Amongst the differentially expressed genes that are nominally significant and those that remain significant after multiple testing correction, there was a significant enrichment of genes that were implicated in the latest large-scale GWAS. 1 It should be noted that the annotation of GWAS results to the underlying genes is an ongoing process and it is likely that not all the annotated genes at these loci are relevant for MS. This would however only lower our power to detect a significant enrichment. We therefore conclude that genes that are differentially expressed in CD8+ T cells of MS patients are more likely to be at MS-associated genetic loci.
Although patients were not treated for MS at the time of inclusion in this study, we sampled these patients after disease onset. Therefore, we cannot exclude that the observed differences are an indirect consequence of the disease activity. Whether the gene expression of CD8+ T cells plays a role in the onset of the disease remains to be investigated in a different study design. Furthermore, a larger sample size of healthy controls and patients with different disease durations, or a collection of patients that have samples available at multiple timepoints is needed in order to better evaluate the impact of disease duration on the gene expression differences. Similarly, this study is not sufficiently powered to assess whether specific clinical parameters explain outlier samples observed in some of the significantly differentially expressed genes (e.g. outlier samples are observed in
Pathway analyses of differentially expressed genes with different p-value cutoffs, did not point out specific biological pathways that are affected in the MS patients. A targeted pathway analysis for T cell pathways did not reveal differences in the dysregulation scores 26 for these between the patients and controls. It should be noted that biological pathways that are not currently annotated in the databases used are not included in the pathway analyses performed, therefore the possibility remains that there are biological pathways in CD8+ T cells that are affected by MS.
Comparison of this analysis against an earlier differential gene expression study of CD4+ T cells from 20 MS patients (17 included in the current study) and 19 healthy controls (all included in the current study) did not show a significant overlap of results, indicating that the observed changes in gene expression may not be observed in other circulating T cells. Furthermore, comparison of the differential gene expression analysis of CD8+ T cells against an earlier differential protein expression analysis of CD8+ T cells did not show overlap in nominally significant genes in these studies. It should be noted that both the current gene expression study and the earlier protein expression study have relatively low power. Furthermore, due to additional regulatory steps between the transcription into RNA and translation to protein, the correlation between gene expression and protein expression is not linear and influenced by many additional factors for a large number of genes. 28 It remains to be further investigated how differences in gene expression in CD8+ T cells from MS patients relate to putative differences in protein expression.
The most significant differentially expressed gene
This study was performed on purified CD8+ T cells, which provides a detailed insight in the gene expression of these specific cells. It should however be noted that CD8+ T cells can be divided further into subclasses. This leaves the possibility open that some or all of the observed differential gene expression is attributable to different proportions of CD8+ T cell subsets. This could be overcome by future experiments that include flow cytometry analysis of proteins of interest, combined with cell type markers or experiments that apply single cell RNA sequencing, in which each individual cell can be traced back from the sequencing data and subdivisions can be made based on signature genes of subsets of larger cell populations. The patients in this study were untreated at the time of inclusion, which overcomes potential immune system changes induced by MS medications. Whilst some patients were recently diagnosed, others had been diagnosed years before the inclusion. In general, these patients were benign based on their expanded disability status scale (EDSS) scores at the time of inclusion. These patients were included in this study during remission; it remains possible that sampling during a relapse will provide additional insights in differential gene expression of MS patients’ immune cells compared to healthy controls.
In conclusion, our study shows that CD8+ T cell gene expression is different in MS patients as compared to controls. Whether this is the result of different active processes in the T cells of MS patients, or the result of different subsets of CD8+ T cells for MS patients remains to be investigated.
Supplemental Material
sj-pdf-1-mso-10.1177_2055217320978511 - Supplemental material for CD8+ T cell gene expression analysis identifies differentially expressed genes between multiple sclerosis patients and healthy controls
Supplemental material, sj-pdf-1-mso-10.1177_2055217320978511 for CD8+ T cell gene expression analysis identifies differentially expressed genes between multiple sclerosis patients and healthy controls by IS Brorson, AM Eriksson, IS Leikfoss, V Vitelli, EG Celius, T Lüders, T Berge, HF Harbo, H Nilsen and SD Bos in Multiple Sclerosis Journal—Experimental, Translational and Clinical
Supplemental Material
sj-xlsx-2-mso-10.1177_2055217320978511 - Supplemental material for CD8+ T cell gene expression analysis identifies differentially expressed genes between multiple sclerosis patients and healthy controls
Supplemental material, sj-xlsx-2-mso-10.1177_2055217320978511 for CD8+ T cell gene expression analysis identifies differentially expressed genes between multiple sclerosis patients and healthy controls by IS Brorson, AM Eriksson, IS Leikfoss, V Vitelli, EG Celius, T Lüders, T Berge, HF Harbo, H Nilsen and SD Bos in Multiple Sclerosis Journal—Experimental, Translational and Clinical
Footnotes
Acknowledgements
We acknowledge the contributions from the patients and healthy controls in our studies and the health personnel involved in drawing blood samples.
Conflicts of Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Data access
For the purpose of MS-related research, requests to access raw data in this study can be addressed to grants@ous-hf.no.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship and/or publication of this article: Funding was provided by the Norwegian Research Council grant ID 240102; the South-Eastern Norway Health Authorities grant IDs 276940/2017114 and the MultipleMS consortium in the Horizon 2020 Framework Programme for Research and Innovation. Further funding was provided by unrestricted research grants from the Odd Fellow society Norway and Sanofi Genzyme.
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
