Abstract
INTRODUCTION
GNE myopathy (MIM # 605820) is a rare auto-somal recessive myopathy caused by mutations inthe gene encoding a key enzyme in sialic acid biosynthesis, UDP-N-acetylglucosamine-2-epimerase/N-acetylmannosamine kinase (GNE) [1, 2]. This disorder was initially described as a distal myopathy in Japanese [3], and as a quadriceps sparing myopathy in Iranian Jews [4], associated with rimmed vacuoles on muscle biopsy. However, the phenotype and histopathological features seem to be more heterogeneous than initially expected, as limb-girdle presentation [5] as well as cases with histological inflammation [6, 7] or without rimmed vacuoles [8] associated to GNE myopathy have been reported.
A recent overview of the GNE mutational spectrum published by Celeste et al. [9] added 7 novel pathogenic variants to the 147 different pathogenic variants associated with GNE myopathy already reported in the literature. Here, we describe 13 novel pathogenic GNE variants associated with GNE myopathy identified in the first large French cohort analyzed to date, raising the total number of different pathogenic variants associated with GNE myopathy to 167. We also analyzed the possible deleterious effects on splicing of the newly discovered as well as all previously published missense variants. Bioinformatics analysis indicates that 6 previously published missense variants were likely to have medium to severe deleterious effects on splicing, allowing possible re-interpretation of the cause of pathogenicity of these variants.
MATERIALS AND METHODS
Patients and genetic analysis
We analyzed mutational data from 32 GNE myopathy index patients diagnosed genetically in the Department of Medical Genetics in Marseille, the reference center for genetic diagnosis of this disorder in France. Following informed consent, genomic DNA was extracted from blood samples. Mutational analysis was done using PCR amplification of GNE coding exons and flanking intronic boundaries (primer sequences available on request) followed by direct Sanger sequencing on a 3500XL Genetic Analyzer ® (Applied Biosystems-Life Technologies, Carlsbad, CA). We used the new hGNE2 reference nomenclature (GenBank accession numbers: Protein: NP_001121699.1; mRNA: NM_001128227.2), as recommended by the latest publication on GNE myopathy nomenclature [9, 10].
Pathogenicity predictions
We used the following predictive algorithms to analyze the pathogenicity of the novel variants associated with GNE myopathy: UMD-Predictor (http://www.umd-hts.eu) [11], SIFT (Sort Intolerant From Tolerant human Protein; http://sift.jcvi.org) [12] and PolyPhen-2 (Polymorphism Phenotyping v2; http://genetics.bwh.harvard.edu/pph2) [13]. Possible deleterious effects of novel and published missense variants on splicing were assessed using the latest version of Human Splicing Finder (HSF 3.0, http://www.umd.be/HSF3), a predictive tool based on a combination of different matrices for auxiliary sequence prediction, and Position Weight Matrices to assess the strength of 5’ and 3’ splice sites and branch points [14]. HSF has been shown previously to precisely predict the effect of mutations affecting 5’ and 3’ splice sites and branch points, and/or activating cryptic splice sites [14, 15]. Correct prediction of activating or inactivating effects of mutations on exonic splicing enhancers (ESE) and exonic splicing enhancers (ESS) has also been validated [14], though it remains critical to exactly determine the precise effects generated by affecting these cis-acting elements, such as identifying the motif/protein couple implicated.
Based on these prediction algorithms (UMD-Predictor, SIFT, PolyPhen-2 and HSF) we classified the overall pathogenicity of the variants into 3 groups of severity (severe, medium and mild) as detailed in Table 1. Allele frequencies were evaluated using the 1000G (1000 Genomes, http://www.1000genomes.org[genomes.org]) and ESP (Exome Sequencing Project, https://esp.gs.washington.edu/drupal) database.
In vitro minigene splicing assay
Functional analysis of possibly abnormal splicing caused by c.717T>G was done using an in vitro minigene splicing assay [16]. The variant c.717T>G was selected for specific functional analysis using the in vitro minigene splicing assay because of the discrepancy between a predicted “benign” missense change at the amino-acid level versus a predicted deleterious effect on splicing. According to previously described methods [15], GNE Exon 5 and approximately 150 bp of flanking intronic sequences were amplified from a control DNA sample and then cloned in the pCAS2 vector. The c.717T>G substitution was subsequently introduced using site-directed mutagenesis and analyzed for specific splicing abnormalities following transfection in HEK 293 cells. Forty-eight hours after transfection, transcriptional analysis was performed and the effect on splicing identified using RT-PCR and further confirmed using direct Sanger sequencing [15].
RESULTS
13 novel pathogenic GNE variants were identified in patients with GNE myopathy
Among the 32 patients with genetic diagnosis of GNE myopathy, 11 presented with homozygous and 21 with compound heterozygous pathogenic variants (Supplementary Table). Of these, 13 pathogenic GNE variants (12 missense and 1 nonsense variants) were novel and were not present in 15190 alleles from exome sequence databases. Using different pathogenicity prediction programs (UMD-predictor, SIFT, PolyPhen-2, and HSF) combined, 10 out of the 12 novel missense GNE variants were found to have severe predicted pathogenic effects (Table 1). Absence of these variants in control datasets (1000G and ESP databases, data not shown), as well as predicted pathogenicity effects, suggest that they are indeed the likely cause of GNE myopathy in patients analyzed in this study.
Possible effects on splicing of GNE missense variants
Prediction algorithms used for pathogenicity assessment of variants based on their effect on protein sequence do not take into account possible effects of these variants on splicing of the GNE gene. Celeste et al. have recently reported possible splice effects for 5 exonic and 7 intronic variants. However, since only missense variants within 5 basepairs of a splice junction were examined in this study, 119 other missense variants remained unexplored [9]. As splice-disrupting variants can be found deeper in exons, we expanded these results by analyzing all published pathogenic missense variants as well as the 13 novel variants from our cohort (Table 1). Using the HSF prediction tool [14], we classified GNE variants in 4 groups based on their effects on splicing (probable, possible, uncertain, not affected). Interestingly, we found that 6 of the published pathogenic variants (NM_001128227.2: c.98A>G, c.179T>G, c.271A>G, c.731A>T, c.1394A>G, c.2098G>A), and 5 out of the 13 novel pathogenic variants reported here, were likely to have a previously unreported potential splicing effect (Table 1).
Moreover, 2 pathogenic variants (c.271A>G and c.715G>A) [9] with mild predicted effects on protein, and even a supposed polymorphism (c.717T>G) [9] were predicted to have a possible effect on splicing, suggesting that they might be more deleterious than previously thought. Since we encountered the c.717T>G variant in one patient (P15), we analyzed in detail its possible deleterious effect on splicing using an in vitro minigene splicing assay [15, 16]. For this variant, the corresponding amino-acid change (p.Asp239Glu) is classified as “benign” (PolyPhen-2 [13] pathogenicity score 0.245), while in silico analysis using Human Splicing Finder [14] predicts the activation of a cryptic donor site and the inactivation of an exonic splicing enhancer (ESE). In vitro analysis using the minigene splicing assay confirms that the c.717T>G variant activates a cryptic donor site resulting in the deletion of 145 bp in the 3’ extremity of exon 5. The abnormally spliced transcript (Fig. 1) is probably deleterious as it leads to a frameshift mutation introducing a premature translation termination codon in the amino-acid sequence (Asp239GlufsX9).
DISCUSSION
In this study, we describe 13 novel pathogenic variants associated with GNE myopathy, raising the total number of published pathogenic GNE variants to date to 167. We also examined all currently known pathogenic GNE missense variants for predicted deleterious effects on splicing to get a better understanding of the pathogenicity range of the GNE mutational spectrum. Taken together, a total of 9.7% (13 out of 134) among all pathogenic GNE missense variants associated to GNE myopathy are predicted to have a deleterious (possible or probable) effect on splicing. This result is in accordance with previous reports underlining the importance of deleterious effects on splicing caused by exonic substitutions [17], but has not been pointed out to date in the context of GNE myopathy. Splice effect prediction should be taken into account for mutational data interpretation in genetic diagnosis, as efficient correlation between predictive data and confirmation at the transcriptional level has been shown [14]. However, it is important to keep in mind that even though in silico tools for splicing defect prediction are useful for genetic diagnosis, they have inherent limitations depending on the underlying algorithms [18] and warrant functional confirmation if possible. Importantly, in silico tools for splicing defect prediction may be of interest for selecting variants for further functional transcriptional analyzes. In particular, substitutions predicted to cause benign or moderately severe missense changes at the amino-acid level (as evaluated in example using PolyPhen-2 [13]), might have a severe predicted deleterious effect on splicing. In our cohort, this was the case for the variant c.717T>G, predicted to cause a “benign” amino-acid change while possibly activating a cryptic splice-donor site, a fact that was further confirmed using a functional minigene splicing assay.
Most of the reported pathogenic GNE missense variants are considered to have a qualitative functional, and not quantitative, deleterious effect [9]. However, several mutations including, as expected, nonsense or frameshifting mutations, have been shown to cause GNE mRNA [19] or protein instability [20] resulting in severely decreased levels of expression and therefore implicated as causative of quantitative pathogenic effects. We emphasize that, as for numerous other genetic diseases [17], a proportion of substitutions predicted to cause deleterious missense changes at the amino-acid level in fact cause deleterious changes at the mRNA level due to abnormal splicing, affecting quantitatively or qualitatively the GNE enzyme function. Moreover, variants predicted as “possibly pathogenic” or even apparently “benign”, based on missense pathogenicity prediction algorithms such as PolyPhen-2 [13] could likewise have deleterious effects due to abnormal splicing. Given the variability of the onset and the evolution of GNE myopathy [21, 22], intra-individual variability in splicing efficiency and of nonsense-mediated mRNA decay [23, 24] could be a possible explanation for the observed clinical heterogeneity. Another possible component of this heterogeneity could be the association of disease-causing mutations in trans. Indeed, the potential pathogenicity of different variants in the GNE gene may implicate variant combinations modulating the clinical presentation and evolution of GNE myopathy.
Altogether, our study contributes novel GNE mutational data from the first large French cohort analyzed to date, in a context of recent advances in therapeutic trials for this disease [25], for which the importance of providing an accurate genetic diagnosis for inclusion of patients becomes even more important.
CONFLICT OF INTEREST
The authors have no conflict of interest to report.
Footnotes
ACKNOWLEDGMENTS
We sincerely thank Christophe Pécheux, Ghadi Raï, Christophe Béroud and the referring neurologists, in particular Pascal Laforêt, Bruno Eymard, Françoise Darcel, Laila Bastaki, Satish Khadilkar, Meena A. Kannan, Shahram Attarian and Jean Pouget, for their contribution to this work. We sincerely thank the patients for their implication.
We sincerely thank the APHM, INSERM and Aix Marseille University for supporting this work.
