Abstract
Next-generation sequencing (NGS) panels are used widely in clinical diagnostics to identify genetic causes of various monogenic disease groups including neurometabolic disorders and, more recently, lysosomal storage disorders (LSDs). Many new challenges have been introduced through these new technologies, both at the laboratory level and at the bioinformatics level, with consequences including new requirements for interpretation of results, and for genetic counseling. We review some recent examples of the application of NGS technologies, with purely diagnostic and with both diagnostic and research aims, for establishing a rapid genetic diagnosis in LSDs. Given that NGS can be applied in a way that takes into account the many issues raised by international consensus guidelines, it can have a significant role even early in the course of the diagnostic process, in combination with biochemical and clinical data. Besides decreasing the delay in diagnosis for many patients, a precise molecular diagnosis is extremely important as new therapies are becoming available within the LSD spectrum for patients who share specific types of mutations. A genetic diagnosis is also the prerequisite for genetic counseling, family planning, and the individual choice of reproductive options in affected families.
Introduction
Next-generation sequencing (NGS), also referred to as massively parallel sequencing, is a high-throughput DNA sequencing technology that enables the fast generation of data on thousands to millions of base pairs of DNA from an individual patient by sequencing large numbers of genes in a single reaction. 1 The concept behind NGS technology is similar to traditional capillary electrophoresis sequencing. The critical difference lies in the fact that instead of sequencing a single DNA fragment, NGS can sequence millions of fragments in a massively parallel fashion. The general workflow of NGS includes 4 main steps—library preparation, cluster generation, sequencing, and data analysis. Sequence reads are produced from fragment libraries, a pool of adaptor-ligated and enriched DNA fragments. One advantage is that relatively little patient DNA is needed to produce a library. In step 1, patient DNA is randomly fragmented by a variety of methods and then prepared for sequencing by ligating specific adaptor oligonucleotides to both ends of each DNA fragment. Adapter-ligated fragments are further enriched with specific oligonucleotides designed for the target genes included in the NGS panel and are then polymerase chain reaction (PCR) amplified. The prepared library is loaded into a flow cell for cluster generation and subsequent sequencing. During sequencing, short read lengths (35-250 bp, depending on the platform) sequences that are produced are then aligned to a reference genome with bioinformatics software. 1 During data analysis, variant calling can be achieved by various standard and in-house analysis pipelines. All detected variants are checked against standard databases (eg, dbSNP137, 1000 Genomes Project, Exome Variant Server, ExAC Browser, OMIM catalog, ClinVar, Human Gene Mutation Database) to enable interpretation of the pathogenicity of a given variant. For exclusion of technical artifacts and segregation testing of all likely disease-causing variants, PCR and Sanger sequencing are usually performed according to standard protocols.
The rapid emergence and huge success of these technologies in the research field paved the way for a new era in genetic diagnostics. Next-generation sequencing panels are used widely in clinical diagnostics to identify genetic causes of various monogenic disease groups, such as epilepsy, 2 intellectual disability, 3,4 neurodevelopmental disorders, 5 and neurometabolic disorders, 6 among others. However, many new challenges have been introduced through these new methods, both at the technical level and at the bioinformatic level, with consequences including new requirements for interpretation of results, and for genetic counseling. In order to define the precise role of NGS in diagnostics, consensus guidelines have been proposed, which mostly deal with NGS testing for rare and monogenic diseases. 7
Lysosomal storage disorders (LSDs) are a group of rare diseases in which a genetic mutation leads to the accumulation of specific macromolecules that impair the function of the lysosome often causing secondary disturbances in other cellular and metabolic processes. With a combined incidence of ∼1 in 1500 to 7000 live births, they encompass >50 different entities. 8 Unfortunately, there is considerable clinical variability within each disease phenotype, and also overlapping symptomatology among LSDs which hampers the precise diagnosis of these disorders. 9 This makes a diagnosis solely based on the clinical presentation difficult in many cases.
In the past, many methods have been used to approach a diagnosis in the LSDs. 10 Laboratory assays based on identification or quantification of the storage products have been one of the most favored methods. Tests for elevated secreted substrate material are routinely used to screen for glycosaminoglycans and oligosaccharides in patients suspected of having mucopolysaccharidoses and disorders that present with oligosacchariduria. The urine screens are usually very sensitive; however, affected individuals with normal urine screens and healthy individuals with elevated excretions have been reported. 11 The second main pillar of establishing a diagnosis in LSDs is enzyme activity measurements. Enzyme activity detected in dried blood spots is very useful in the diagnosis of a small number of LSDs; however, this method usually requires verification with a second type of assay. Measurement of enzyme activity in leukocytes, plasma, or fibroblasts is often a logistic and technical problem, with particular storage and transport requirements as well as laboratory expertise. Enzyme activities are usually not reliable in detecting heterozygous carriers of a disease. While biochemical methods are still the gold standard of diagnosis, they are often very laborious, time consuming, and require a preselection by clinical phenotype to reduce the number of biochemical tests used for each patient suspected of LSDs. 12 Diagnosis of LSDs is still a clinical challenge and can often take several years.
Even having achieved a diagnosis with traditional assays, establishing a molecular genetic diagnosis is still important for several reasons. The precise molecular diagnosis is of great importance in the LSDs as new therapies are becoming available for specific types of LSDs and for patients who share specific types of mutations. 9,13 The complexity underlying the pathogenesis of LSDs and the small population of patients, who are many times children, make the development of therapies for these diseases challenging. Current treatments are only available for a small subset of LSDs and have, for the most part, not been effective at treating or preventing neurological symptoms. Clinical trials have demonstrated some clinical benefit of enzyme replacement therapy in Gaucher disease, Fabry disease, mucopolysaccharidoses types I, II, IV, VI, VII, and Pompe disease. However, the usefulness of enzyme replacement therapy is limited due to the fact that a given enzyme preparation does not have beneficial effects on all aspects of a disorder to the same degree. The effectiveness of each therapy is determined by whether the symptoms it affects are those which are most important to the patients’ overall health and quality of life. Additionally, clinical studies have shown that the pathologic changes underlying many symptoms of a LSD are not reversible, even with long-term treatment. 9 Thus, a precise molecular diagnosis early in the course of the disease can be of great significance. A definitive diagnosis is also the prerequisite for patients to have access to studies of newly developed therapeutic strategies, such as small molecules for substrate reduction or for enzyme-enhancing therapy. 9,13
The Application of NGS in the Diagnosis of LSDs: Overview of the Literature
Studies of Purely Diagnostic Targeted NGS Panels
As outlined earlier, there are many factors hampering the diagnosis of LSDs, among others the phenotypic and penetrance variability, common signs and symptoms between certain disease groups, the genetic heterogeneity in some forms, and the difficulties of biochemical diagnostics. The application of a broad diagnostic tool could enhance sometimes very long diagnostic process for the affected families and thereby grant quicker access to current and investigated therapeutic options while providing the basis for more appropriate genetic counseling. There is only scarce published data on the application of NGS in the diagnostics of LSDs (see Table 1).
Aspects of Selected Studies for NGS Analysis of LSDs.
Abbreviations: LSDs, lysosomal storage disorders; NGS, next-generation sequencing; WES, whole exome sequencing.
The recent study by Fernandez-Marmiesse et al 12 reports for the first time the results of a pilot project to evaluate the application of NGS to mutation screening in the diagnosis of LSDs. With the help of a NGS-based approach for the simultaneous analysis of 57 lysosomal genes, they were able to correctly diagnose 18 positive blinded controls and provide genetic diagnosis to 25 patients with a high to moderate clinical suspicion index for LSDs from Spain and Portugal. In their study, they analyzed 84 probands, including 18 positive controls in whom biochemical testing and Sanger sequencing had already ascertained a diagnosis of LSD and 66 patients with a suspected LSD. They used a custom SureSelect oligonucleotide probe library to capture all coding exons and exon–intron boundaries of 57 genes known to be associated with LSDs according to GeneReviews. 12 Sequencing was performed on SOLiD4 and HiSeq2000 Illumina platforms, with the latter yielding better coverage (99.97% of bases covered by at least 20 reads and only 0.03% of target bases with a coverage of less than 20 reads). As to the technical difficulties encountered, the authors emphasize that 2 factors limit enrichment. The local sequence architecture has a strong effect on the efficiency of DNA enrichment for individual exons, 14 so that exons close to repetitive regions are not fully covered. For exons located in CpG islands (high frequency of CpG sites—regions of DNA where a cytosine nucleotide is followed by a guanine nucleotide in the linear sequence of bases along its 5′→3′ direction), coverage also decreased dramatically, producing gaps in coverage of certain exons of LSD genes, mostly in IDUA, GBA, and GAA genes. The problem was overcome, however, with a greater overall coverage achieved on the HiSeq platform. To establish an NGS–LSD assay as a diagnostic tool for routine use, it is essential to eliminate sequence gaps in order not to miss mutations in certain exons. 12
The authors also point out the difficulties encountered with data filtering and interpretation in the NGS assay. One of the main challenges of using NGS for diagnostic applications is the interpretation of a massive number of genomic variants detected by sequencing platforms. Coverage irregularities contributed to false-positive variants. Thus, by improving enrichment efficacy or increased coverage, false-positive variants can be largely avoided. Efficient and reliable identification of causative variants is crucial to the implementation of this technology in routine diagnostics. The authors conclude that they would not propose their assay as the sole diagnostic tool but as a useful adjunct to diagnosis for specialists in the clinical management of patients with LSD especially given its advantage to provide accurate information in a short time. 12
Another example for the clinical utility of gene panels in the diagnosis of LSDs is the diagnosis of late-onset Pompe disease by a NGS approach published by Lévesque et al in 2016. 15 The authors developed a gene panel to analyze the coding sequences and splice site junctions of the GAA gene along with 77 other genes causing muscle disorders with overlapping phenotypes. They achieved a median coverage of 200× with all GAA exons successfully covered with >20× and only 0.3% of exons across all genes in the panel were <20×. Using known variants in patients with Pompe disease and controls, the authors could demonstrate an excellent sensitivity (100%) and specificity (98%) across all selected genes. Analyzing 34 patients with suspected muscle disorders of unknown etiology, the study could show a detection rate of 32%. Especially in patients with an atypical presentation, including 1 late-onset Pompe disease case, the gene panel was instrumental in reaching a diagnosis. In all diagnosed patients biochemical testing (acid α-glucosidase activity) confirmed the molecular results. The NGS panel was shown to facilitate the diagnosis in patients showing nonspecific muscle weakness or atypical phenotypes. 15
Diagnostic and Research NGS Panels to Detect LSDs
According to the “guidelines for diagnostic NGS” on behalf of the EuroGentest and the European Society of Human Genetics, 7 “for diagnostic purpose, only genes with a known (ie, published and confirmed) relationship between the aberrant genotype and the pathology should be included in the analysis” of NGS assays. However, several applications with a research focus have ventured to apply NGS technologies to identify sequence variations in known, but also in potentially new, candidate genes for LSDs. Among the first publications of an extensive research panel for the identification of LSDs was the study by Di Fruscio et al in 2015 aiming to detect DNA sequence variation in the autophagy–lysosomal pathway. 16 The autophagy–lysosomal pathway regulates cell homeostasis and plays an important role in LSDs and common neurodegenerative diseases. 17 The authors developed Lysoplex, a targeted NGS approach enabling the parallel analysis of 891 genes involved in lysosomal (194 genes), endocytic (627 genes), and autophagic (106 genes) pathways. Lysoplex was successfully validated on 14 different types of LSDs and used to analyze 48 patients with unknown mutations and a clinical phenotype of neuronal ceroid lipofuscinosis (NCL). The average coverage depth for 95% of the target sequences was 40× and for 80% of the target sequences 100× was achieved. With a training set of 16 genomic DNAs from patients affected by 14 different forms of LSDs (including MPSI, II, IIB, VI, Fabry disease, NPC1, Metachromatic Leukodystrophy(MLD) Pompe disease among others), whose molecular diagnosis was already known except for 2 cases, Di Fruscio and coworkers were able to detect all of the previously known mutations and to identify disease-causing mutations in the 2 patients with undiagnosed LSD. 16
The authors also demonstrate the efficacy of Lysoplex in the analysis of patients with unknown molecular defect with a clinical diagnosis of NCL. The NCL is a family of autosomal recessive neurodegenerative disorders that annually affect 1:100 000 live births worldwide. This family of diseases results from mutations in 1 of the 14 different genes that share common clinical and pathological etiologies. Although the disease phenotypes may vary in their age and order of presentation, all typically include progressive visual deterioration and blindness, cognitive impairment, motor deficits, and seizures. Advances have been made in genetic diagnosis and counseling for families. However, comprehensive treatment programs that delay or halt disease progression have been elusive. Current disease management is primarily targeted at controlling the symptoms rather than “curing” the disease. 18 A definite molecular diagnosis in such patients is the prerequisite for the enrollment in ongoing and future therapeutic trials. Di Fruscio and coworkers analyzed 48 individuals with the clinical diagnosis of NCL and detected in 29 of 48 patients homozygous or compound heterozygous variants in known NCL genes. In 3 additional cases, only a single heterozygous variant in known NCL genes was found, postulating that the second variant was not detectable because of a deep intronic or a large copy number variation (overall detection rate of 67% for causative mutations in NCL genes). In addition, the research-based NGS approach allowed the identification of recessive mutations in 3 potential novel NCL candidate genes. However, further functional analyses and in vivo studies in animal models are needed to confirm the causative nature of those variants. 16
Clinical Exome Sequencing to Detect LSDs
In the last few years, whole exome sequencing (WES) or whole genome sequencing (WGS) approaches have gained unlimited consideration as universal tests for the identification of most Mendelian disorders 19 -21 with the exceptions of those caused by complex structural variations. As a more focused alternative to WGS, WES has been demonstrated to be a cost-effective solution with a higher throughput on coding sequences that favors the identification of novel disease genes.
A recent publication demonstrates the complexity and clinical variability of LSDs as well as the potential role of WES in exposing gaps in clinical knowledge due to the rarity of many of these diseases. Three siblings, between 40 and 60 years of age, developed severe osteolysis of bones in the hands and feet. Known skeletal dysplasia genes were sequenced, but no mutation was found. Whole exome sequencing eventually revealed deleterious mutations in the ASAH1 gene, causing acid ceramidase deficiency or Farber disease, which was confirmed by enzymatic testing. 22 The patients did have “typical” symptoms of the disease in childhood, but they were so mild that an LSD was not suspected. These became the oldest patients with Farber disease ever diagnosed (by 40 years!). In addition, the ASAH1 gene can now be added to those tested in patients with osteolysis.
Discussion
Recent reports of the application of NGS technologies in the diagnosis of LSDs highlight the high clinical utility of gene panels in patients with clinical suspicion of LSDs and the potential to reduce diagnostic delay in a group of patients where a definitive diagnosis can be delayed for years. With decreasing sequencing costs and increasing availability of tests fulfilling guidelines for diagnostic NGS testing, there are several advantages of using gene panels early in the investigation of patients with LSDs.
However, there are several issues raised by the application of high-throughput sequencing that needs to be considered and harmonized across different laboratories.
Since the technology and applications change constantly and rapidly, the available NGS platforms are not yet stable enough to ensure that all results obtained are equivalent. However, the technical variability cannot hamper the implementation of NGS assays, only the implementation of insufficiently validated and poor quality tests should be prevented. 7 Before implementing a diagnostic test, the aim of the assay should be clearly stated. It should be clear whether the test may be used to exclude a diagnosis or to confirm a diagnosis. The “diagnostic yield” of a test has to be certain before introducing an assay in the routine clinical diagnostic setting. The diagnostic yield refers to the chance that a disease-causing variant is identified and a definitive molecular diagnosis can be made. 7 The value should always refer to a patient cohort. This defines the performance of NGS from clinical point of view and indicates the efficiency of the test. Currently, most laboratories offer their own selection of genes in the frame of a diagnostic gene panel. From the standpoint of the patients and medical practitioners, it would be important to harmonize genetic testing. For the diagnosis of LSDs, it would make sense to have “core disease gene lists” established by clinical and laboratory experts of the field. Adding additional genes to standard core lists will increase the diagnostic yield, but it should not compromise the detection rate and the identification of disease-causing mutations. A simple rating system on the basis of coverage and diagnostic yield has already been proposed on behalf of EuroGentest and the European Society for Human Genetics (ESHG), 7 with a type A test offering >99% variant calls of the coding region and flanking intron sequences and fills all the gaps with Sanger sequencing or another complementary analysis. Type B tests would clearly state which regions are sequenced at >99% reliable reference and fill some of the gaps with resequencing. Type C NGS tests only rely on the quality of NGS sequencing without offering additional resequencing. The ordering physician has to be fully informed not only about the limitations of the NGS assay but also about the potential unanticipated effects of a particular type of genetic testing. Thus, the laboratory should specify the diseases not relevant to the clinical phenotype of the patient that could be caused by mutations in the tested genes. Especially in a broad panel of LSDs, only a part of the analyzed genes will be relevant to the disease phenotype of the patient. However, heterozygous mutations in recessive conditions might be detected, leading to the unsought detection of disease carriers. 7 This, in turn, can have consequences for counseling and reproductive choices. Laboratories, therefore, should provide information of the chance of unsolicited findings.
A pitfall of using gene panels as first-tier analysis in the place of biochemical assays for identification of LSDs include the identification of variants of unknown significance (VUS) or unclassified variants (or class 3 pathogenicity variants). 7,15 The VUS rates will greatly vary from gene to gene, but large collaborative efforts to share variant data, such as ClinVar, disease-specific databases, and other public databases are attempting to minimize this limitation. 23 In patients harboring a VUS, biochemical testing is essential to attempt to define the functional consequence of the sequence variant; however, this is not possible for pathologies that have no biochemical marker and in which confirmation will need to be obtained by other methods. 12 Another important policy for laboratories is to set up a local variant database to manage disease variants and to collect data on VUS with the aim to eventually classify these variants definitively. 7
The detection of a single heterozygous mutation in a gene for an autosomal recessive disorder (as in most LSDs) and the limited performance of bioinformatic tools to detect deletions and duplications from targeted NGS data 15,24 are other limitations encountered in the application of NGS assays in the diagnosis of LSDs. Currently, bioinformatic identification of deletions and duplications in many targeted NGS-based approaches is of limited sensitivity, especially for fewer than 3 exons. 24
Due to the characteristics of the technology, NGS also creates uncertainties and limitations of an order never encountered before in genetic testing. Uncertainties in genetic testing have always been existed, and this prompts us to constantly develop new tools to deal with those issues. Targeted NGS assays can have a role as support genetic tools in the diagnosis of LSDs always in combination with biochemical and clinical data. The NGS assays have proven to be very powerful for making diagnoses that are particularly challenging, for example, due to an atypical or late-onset presentation. Another strength of the method is to bring diagnostic odysseys to a more rapid conclusion than with approaches used previously.
The pilot study of Fernandez-Marmiesse et al 12 ended with 40 patients of the 84 analyzed probands still undiagnosed, although it must be stated that 64% of these patients had a low or moderate index of suspicion of LSD. The considerable phenotypic overlap between certain LSDs and non-LSD neurometabolic or neurodevelopmental conditions poses a great limitation to the application of targeted NGS panels with genes specific only for LSDs. To address this problem, several approaches are being used. The application of broad-range genetic panels that encompass most known neurometabolic disorders 12 or the application of the Mendeliome, an NGS panel encompassing all known genes for Mendelian disorders 25 might be a solution to enhance the detection rate in those patient groups. On the other hand, many reports already underline the diagnostic use of WES as an ultimate test and the subsequent application of virtual panels in the stepwise diagnostic process of monogenic disorders with phenotypic and genetic heterogeneity. 19,20,21
Conclusion
Genetic analysis has not been the primary screening tool used in the diagnosis of LSDs due to the cost and time requirements of the sequential genetic tests necessary to diagnose most of the disorders. 10,12 However, with the wide availability of NGS technologies, a genetically based diagnosis can be established in 4 to 6 weeks, while providing cost reductions compared to the sequential Sanger sequencing of several single genes. Although the advantages of applying NGS in the diagnosis of LSDs are numerous, validation of a genetic diagnosis often still needs biochemical testing to confirm the functional consequences of a sequence variant.
Given that NGS can be applied in a way that takes into account the many issues raised by international consensus guidelines, it can have an important role in the diagnostic process of LSDs. We emphasize the importance of a precise molecular genetic diagnosis early in the disease course of patients affected by LSDs, particularly as new therapies are becoming available for patients who share specific types of mutations. Further significance of NGS technologies in establishing a genetic diagnosis in patients is due to the benefit derived by the opportunity for genetic counseling, family planning, and the individual choice of reproductive options (including preimplantation genetic diagnosis) for affected families. It will be imperative to continually optimize this powerful technology with the potential benefits of patients and their families in mind.
Footnotes
Declaration of Conflicting Interests
The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: Alexander Solyom is Consultant/Employee of Enzyvant Sciences Inc., which is developing therapies for lysosomal storage disorders.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Janos Bolyai Research Scholarship of the Hungarian Academy of Sciences to KK.
