Abstract
Duchenne muscular dystrophy (DMD) is a severe genetic disorder caused by loss of function of the dystrophin gene on the X chromosome. Gene augmentation of dystrophin is challenging due to the large size of the dystrophin cDNA. Emerging genome editing technologies, such as TALEN and CRISPR-Cas9 systems, open a new erain the restoration of functional dystrophin and are a hallmark of bona fide gene therapy. In this review, we summarize current genome editing approaches, properties of target cell types for
INTRODUCTION
Duchenne muscular dystrophy (DMD)
Duchenne muscular dystrophy (DMD) is one of the most common and severe forms of genetic muscle disorders and is described by the progressive degeneration of proximal muscular tissue with the characteristic of calf muscle enlargement. Muscle weakness becomes obvious around 3 to 5 years of age, resulting in compromised running, climbing, or standing (known as Gowers’ sign). Patients at infant stage show a massive elevation of creatine kinase (CK) or creatine phosphokinase (CPK) levels (>10,000 IU/L in comparison to normal levels of <200 IU/L) in the blood, which is an indication of muscle or heart damage, since CK is a muscle specific kinase mediating the production of energy storage material phosphocreatine in muscle. Between ages 7 and 12, DMD patients gradually start to rely on wheel chairs due to difficulties in walking and a waddling gait. By their teenage years, they may require assistance in activities that involve the use of arms and legs. With time, muscle weakness affects not only proximal muscle tissues, but also cardiac and respiratory function. The development of artificial ventilators in the 1980s have helped improve life expectancy, however, cardiac or respiratory defects remain the main cause of mortality in DMD patients, with a life expectancy of around the age of 30.
Dystrophin gene
The genetic cause of DMD was first identified by Dr. Louis M. Kunkel’s group in 1986 [1], and the identified cDNA was named “Dystrophin” [2]. The dystrophin gene spans about 2.2 Mbp on the X chromosome and consists of 79 exons for the Dp427 m isoform, whichis mainly expressed in skeletal muscle cells. Dystrophin protein has an actin-binding domain on the N’-terminus and a dystrogly can complex-binding domain on the C’-terminus, and the two are connected by a rod domain which consists of spectrin-like repeats (Fig. 1). The function of dystrophin protein is to physically anchor the cellular skeletal actin fibers and sarcolemma membrane in muscle fibers [3]. Muscle cells without functional dystrophin become vulnerable during physical muscle movements, resulting in spontaneous cellular death which induces inflammation and fibrosis [4].
DMD mutation patterns
Due to the large size of the dystrophin gene, mutation patterns found in DMD patients largely vary. Approximately 60–70% of DMD patients harbor a large deletion of one or more exons, with many occurring with in exons 44 to 54 [5–9].
Notably, a milder form of DMD, Becker muscular dystrophy (BMD), is also caused by mutations in the same dystrophin gene. Complete loss of functional dystrophin protein due to out-of-frame mutations is classified as DMD, while partial loss of functionality (such as smaller protein size) or decreased level of dystrophin protein expression due to in-frame deletions, insertions, or small amino acid substitutions are classified as BMD.
BMD patients suffer from weakened muscles and mobility in their old age, although the disease phenotype is highly heterogeneous. Depending on the type of genetic mutation, functionality of the resulting dystrophin protein ranges from barely functional (similar to DMD) to almost normal [3, 10]. Therefore, the symptoms of BMD patients vary, and some patients have nearly normal life spans.
Considering the function of dystrophin as a structural protein that connects cytoskeletal actin fibers and the sarco lemma membrane, minor amino acid substitutions or truncation at the rod domain is not critical for its function. This is a unique and important property when it comes to designing strategies on how to restore the function of dystrophinprotein for treating DMD.
Antisense oligonucleotide-mediated exon skipping
The deletion mutations found in roughly two thirds of exons are not in multiples of three base pairs, thus resulting shifts of one or two base pairs in the protein reading frame. Notably, if an exon with the sizeof (3n+1) bp is deleted, further translational skipping of another adjacentor neighboring exon with the size of(3n+2) bp would restore the out-of-frame mutation. This approach is known as exon skipping. Although exon skipping causes some amino acid deletion, in general truncations in the rod domain are tolerated, as seen in BMD patients.
To induce exon skipping, several methods have been investigated, including the use of anti sense oligonucleotides to bind to a targeted exon. Several different types of anti sense oligonucleotides have been tested and demonstrated as effective and functional in various animal models [11–16]. Encouragingly, anti sense oligonucleotide analogues are now being investigated in clinical trials, with some showing promising results [17–20]. However, due to the transient nature of antisense oligonucleotide analogues, DMD patients are required to be repeatedly treated throughout their lifetime (Fig. 2).
Gene augmentation therapy
The concept of gene therapy, which is to treat or cure congenital disorders, emerged through the development of viral vectors in the early 1970s. Since then, the field of gene therapy has significantly expanded with the concept of supplying a healthy gene to replace the function of the mutated gene [21–24]. So far, several genedelivery vectors based on viruses have been developed, such as retroviral vector [25–27], lentiviral vector [28–30], herpes simplex virus (HSV)-based vector [31,32, 31,32], adenoviral vector (HDAdv) [33,34, 33,34], adeno-retroviral hybrid vector [35], and adeno-associated virus (AAV) vector [36–40]. Each vector has its own advantages and disadvantages, as listed in Table 1.
In the case of gene therapy for DMD, there are three major challenges: the large size of dystrophin cDNA (14 kbp for the Dp427 m isoform); the transduction of muscle tissues that are surrounded by layers of connective tissues; and the abundance of affected tissue, which consists of 30∼40% of body weight. In general, oversized viral vectors have significantly lower production efficiency, therefore leading to poor transduction efficiency. HSV vectors [31,32, 31,32] and HDADv vectors [33, 34] have been demonstrated to deliver full-length dystrophin cDNA. However, other vectors require the dystrophin cDNA to be cut down in size by trimming the rod domain (as seen in mild BMD patients), resulting in mini-dystrophin (6.2 kb) or micro-dystrophin genes (3.6 to 4.2 kb) [41]. As an alter native,the utrophin gene (official gene symbol: UTRN), which shares both structural and functional similarities with the dystrophin gene, was shown to functionally replace defective dystrophin [42, 43].
Apart from viral vectors, non-viral vectors have also been investigated for DMD gene therapy.
Genome editing technologies for DMD
Due to difficulties in delivering a functional dystrophin gene into myogenic cells, alternative approaches were anticipated. In this regard, the development of engineered nucleases opened a new era of gene therapy for DMD. Artificial nucleases, which are engineered to cut genomic DNA once at the designated target sequence, can induce DNA repair pathways, such as non-homologous end joining (NHEJ) and homologous directed repair (HDR). The NHEJ pathway results in the generation of small deletions or sometimes insertions, whereas the HDR pathway uses a repair template called the donor DNA.
Meganucleases for DMD
The first demonstration of genome editing technology for the dystrophin gene was reported in 2010 by Dr. Jacques Tremblay’s group in collaboration with the biotechnology company Cellectis [47]. They used meganucleases, which are homing end on ucleases that contain a large DNA recognition domain to target 12- to 40-bp DNA sequences. For example, I-SceI mega nuclease derived from the mitochondrial DNA of
ZFN mediated indels for DMD
Zinc finger domain is one of the most abundant DNA binding domains in mammals, and most of itspossible combinations of DNA binding motifs have been identified. Custom conjugation of several zinc finger domain swith a nuclease domain allows scientists to target specific sequences of interest. To target one target site, typically three or four zing finger domains are conjugated with FokI nuclease domain to form zinc finger nuclease (ZFN). FokI domain cleaves DNA only when it is dimerized, so a pair of zinc finger nucleases (ZFNs) is used to cleavage the target sequence and also to increase specificity.
Dr. Jacques Tremblay’s group investigated the utility of ZFNs that target endo genous the human dystrophin gene together with several mega nucleases and found that ZFNs can induce small deletions (typically 1–20 bp in size) or small insertions (typically 1–5 bp in size) at the target site of the human dystrophin gene [49]. These results demonstrate that ZFNs are able to restore the reading frame of endo genous dystrophin simply by introducing a small deletion or insertion (indel).
ZFN-mediated knock-in for DMD
Site-specific DNA cleavage not only resulted in NHEJ mediated indels, but also enhances the efficiency of HDR. In human DMD myoblasts carrying a deletion of exons 45–52, Dr. George Dickson’s group demonstrated that mega nucleases targeting intron 44 of the dystrophin gene facilitated the insertion of cDNA corresponding to the exon 45–52 region in a bulk cell population [50]. This is a promising gene therapy approach for patients who carry deletions of multiple exons. Unfortunately, restored dystrophin protein expression was not detected, presumably due to the low incidence of cDNA knock-in events or the insertion of an artificial exon altering the expression pattern of dystrophin protein. Further enhancement of knock-in events or selection of a collected cell pool is required.
While ZFN is a powerful tool to edit a target site of interest, it does have drawbacks. Firstly, construction of an active ZFN pair is challenging, as simple assembly of the corresponding zinc finger domain does not guarantee full DNA binding activity due to cross interactions between adjacent zinc finger domains (known as context dependency). Secondly, even though zinc finger domains that recognize all possible combinations of trinucleotide motifs (43 = 64) have been identified, there are some biases or preferences to certain sequences, such as “GNN”. Therefore, designable target sequences are limited (Fig. 3).
TALEN for DMD
To this end, a more flexible DNA targeting domain is preferred. TALE (Transcriptional Activator-Like Effector) domain, which was originally derived from the plant pathogenic bacteria
Thanks to its greater flexibility in target sites, the TALEN system has high probability at targeting exonic regions, which are typically less than 200 bp in size. By targeting the protein-coding region in exons, the dystrophin coding frame can be modulated when the correct size of a deletion or insertion is introduced. Deletion is solely mediated by programmable nucleases via the NHEJ repair pathway and since a donor template is not required, the gene delivery process can be simplified. This NHEJ-mediated approach is unique in its targeting the rod domain region of the dystrophin gene, as small truncations or amino acid changes can be tolerated or have only minor effects on dystrophin protein function, as seen in BMD patients. Dr. Charles Gersbach’s group utilized the TALEN system to target exon 51 in fibroblasts or an immortalized myoblast cell line, which was originally derived from a DMD patient with deletion of exons 48–50 [51]. Successful introduction of a small deletion (i.e. 5 bp) in exon 51 resulted in a frame shift of the protein reading frame and restored dystrophin protein expression in differentiated myoblasts. We tested a similar approach in induced pluripotent stem (iPS) cells derived from a DMD patient who lacks exon 44 [52]. By targeting exon 45 with TALEN, we successfully introduced a small deletion or insertion (i.e. 1 bp insertion), which restored the proper protein reading frame to express dystrophin protein after myogenic differentiation. However, the frame shift approach is highly stochastic, as the size of the indelby programmable nucleases cannot be controlled. Therefore, out of all the indelclones analyzed, only one-third of cells showed a proper protein reading frame.
To enhance the likelihood of reading frame correction, we also tested the possibility of skipping exon 45 by targeting a splicing acceptor site. As consensus sequences of splicing acceptor sites mainly consist of a 2-bp “AG” sequence, we initially thought the disruption of a splicing acceptor site should be more efficient than the frame shifting approach. However, it was difficult to design TALENs or other programmable nuclease swith high specificity, due to the low complexity of the polypyrimidine tract (which is a “C and T” rich region) around the splicing acceptor site. Nevertheless, by targeting around 10 bp downstream from the splicing acceptor site, we managed to show that some NHEJ-mediated deletions can reach the splicing acceptor site and succeeded in skipping exon 45 to restore the proper reading frame of dystrophin protein [52]. A similar approach is also reported in skipping exon 51 by using ZFNs [53].
For NHEJ-mediated deletion approaches, the control or prediction of the deletion pattern is important. Interestingly, deep sequencing analysis of deletion patterns identified apredominant deletion pattern (up to 30–40% of the deletion events) harboring microhomology sequences (typically 3–5 bp in size) on both sides of the cleavage site [52, 54]. By using web tools to predict microhomology-mediated end joining patterns [52, 55], we sought to enhance the success rate of the frame shift approach.
Thus, by taking advantage of iPS cells and their propensity to self-renew indefinitely, we examined a knock-in approach of missing exon 44 by conjugating antibiotic selection followed by subsequent removal of the selection cassette by the Cre-loxP system. We conducted TALEN-mediated targeting experiments and observed high knock-in events amongst antibiotic resistant clones. Successful expression of full-length dystrophin protein was detected by immunostaining and western blotting upon differentiation into myoblasts. Two rounds of sub cloning is demanding and time consuming, but homologous recombination (HR)-mediated targeted knock-in of a missing exon might be a feasible approach in DMD patient-derived iPS cells.
CRISPR-Cas9 for DMD
The TALEN system is versatile enough to target almost any DNA sequence, however, construction of TALEN is laborious, especially for newcomers to genome editing experiments. To construct one TALEN, a library of RVD domains is required to conjugate 14 to 20 RVD domains into an expression vector [56–59]. The assembly process normally takes two to three steps of cloning cycles (i.e. two to three weeks) depending on the assembly systems.
Mean while, bacterial adaptive immunity systems [60], namely CRISPR (clustered regularly interspaced short palindromic repeats) and Cas (CRISPR associated) proteins [61], have been identified as a DNA cleavage machinery against invasive DNA elements, such as bacteriophages [62–64]. Soon after this finding, Drs. Jennifer Dundna and Emmanuelle Charpentier’s groups recognized that the combination of Cas9 protein and two small RNAs, crRNA and tracrRNA, can induce a DNA double strand break at a site complementary to the crRNA sequence [65]. Furthermore, crRNA and tracrRNA can be conjugated into a single guide RNA (sgRNA) by a tetranucleotide linker loop to simplify the system. This finding has had significant impact on the genome editing field, as only the construction of ansgRNA (∼100 bp in size) is required to determine the target sequence, and the Cas9 protein component is common for any target sequences. Construction of sgRNA is simple and straight forward, as target sequences can be synthesized and inserted into an expression vector driven by a PolIII promoter.
One major limitation of the CRISPR system as a genome editing tool is the requirement of a “proto spacer adjacent motif (PAM)” sequence for the target site, as this sequence is essential for the initial step of the target sequence recognition [66]. For example, the most widely used CRISPR-Cas9 system derived from
Strikingly, it is reported that DNA cleavage activity with the
Another concern about the CRISPR system is its targeting specificity. There are a number of reports on off-target mutagenesis even with several base pair mismatches, especially in immortalized cancer cell lines [69–73]. On the other hand, several other groups reported high specificity and minimum off-target mutation load by whole genome sequencing in human pluripotent stem cells [74, 75]. We tested the risk of off-target mutagenesis by karyotyping, CNV analysis, and exome sequencing, and found minimum incidents of undesired mutagenesis in DMD-iPS cells. Therefore, at least in human pluripotent stem cells, the CRISPR system can be regarded as a precise recognition machinery. Further study is required to shed light on whether immortalized cell lines have abnormal DNA repair pathways similar to cancer, or whether pluripotent stem cells equip a particular DNA repair mechanism to maintain genomic integrity, such as Zscan4 [76] or Filia genes [77].
Strategies to correct DMD mutations must considerer the type of mutation in the patient, as genome editing approach is highly custom tailored. Therefore, pathogenic mutation patterns must be carefully examined before deciding on the appropriate therapeutic strategy. To cover as many mutation types as possible, Dr. Gersbach’s group took advantage of the CRISPR-Cas9 system for its ability to target multiple exons simultaneously [78]. It is difficult to target all the mutations known to cause DMD, but this approach greatly expands the range of treatable DMD patients. Targeting other types of mutations, especially large bulk deletions or duplication of multiple exons, is a future challenge.
Ex vivo gene therapy
Considering the stochastic nature of the deletion patterns as well as the risk of off-target mutagenesis by custom nuclease treatment, it is advantageous to predetermine the edited DNA sequence in isolated cells from patients in advance and then, only after proving the gene correction was successful, transplant the cells back to the patient. This “
As another cell source to be expanded
In comparison to most somatic cells, except for neoplastic cells, ES cells have unlimited self-renewal capacity and grow rapidly while maintaining normal karyotyping. In addition, ES cells remain pluripotent for the differentiation into different cell types even after massive expansion. Establishment of ES cells requires fertilized embryos, however, the invention of iPS cell technology now allows us to establish cells almost identical to ES cells from the patient’s own somatic cells by the transient expression of transcriptional and transforming factors [82, 83]. So far, hundreds of patient-derived iPS cell lines have been established, including DMD patient-derived iPS cells, which were first reported in 2008 [84].
Due to its unlimited self-renewal capacity and differentiation ability into myoblasts, iPS cell holds great promise as a cell source for
To materialize iPS cell-mediated
Genome editing in zygotes
Apart from
Muscle is one of the most abundant tissues in the human body, and DMD affects the entire muscle mass, therefore, correction of the systematic phenotype is a great challenge. Due to the lack of effective DNA delivery methods into adult tissues, one might consider introducing gene therapy for DMD at the early developmental stage. Dr. Eric Olson’s group reported “prevention” of the muscular dystrophy phenotype in
Will germ line editing be necessary to treat DMD patients? To correct the dystrophin mutation, it is first necessary to diagnose where the genetic mutation is located. Recent deep sequencing technologies require only small amounts of DNA template for the identification of genomic mutations from biopsies of pre-implantation embryos. However, the inheritance of DMD follows an X-linked recessive fashion, therefore all female embryos and half of male embryos are not affected by the mutations. This begs the question, will it be necessary to select the affected male embryos and correct the mutated dystrophin gene by genome editing despite the various technical risks described above? Ethically, it would be more appropriate to select the non-affected population of embryos than genetically manipulate the affected one. Of course, the selection of embryos brings up another ethical issue, which is related to the rights of early human embryos. Considering the severity, incurability and lethality of DMD, pre-implantation diagnosis could be justified, and most parents will probably choose to give up on the embryo if DMD is detected. Further discussion will be necessary among scientists, patients, and broadersociety.
Genome editing in vivo
One of the most exciting approaches for DMD gene therapy is
For conventional gene augmentation therapy, consistent and high-level expression of a therapeutic transgene is essential. However, for genome editing technologies, a simple “hit-and-away” strategy is feasible, as transient expression of the genome editing nuclease is sufficient to induce a stable alteration of the genomic DNA sequence.
Cas9 protein from
Future challenges
The development of easy and efficient genome editing tools, such as TALENs and CRISPR-Cas9 systems, have opened a new era for DMD gene therapy. The collection of disease mutations in cultured cell or mouse models is the beginning of a long journey for clinical applications.
CONFLICT OF INTEREST
The author has no conflict of interest to report.
Footnotes
ACKNOWLEDGMENTS
The author would like to thank Hongmei Lisa Li for her technical contributions, and Drs. Mandy Siu Yu Lung and Peter Karagiannis for their critical reading of the manuscript. A. H. is supported in part by JSPS KAKENHI grant (15H05581) and AMED Research Center Network for Realization of Regenerative Medicine grants.
