Abstract
The
Before the introduction of next generation sequencing (NGS) methods, the molecular analysis of
The main aim of this review is to describe the wide spectrum of muscle diseases caused by
Keywords
INTRODUCTION
With its 363 coding exons and a full-length transcript of more than 100 kb [1]
Titin acts as a scaffold protein aiding in myofibrillar assembly during myogenesis [2], as a molecular spring determining the passive elasticity of the muscle [3, 4], and as a mechanosensor serving various signaling functions [5, 6].
Late-onset autosomal dominant tibial muscular dystrophy (TMD) (MIM #600334); Young or early adult onset recessive distal titinopathy; Limb-girdle muscular dystrophy type 2J (LGMD2J; MIM #608807); Congenital centronuclear myopathy (CNM; MIM #255200); Early-onset myopathy with fatal cardiomyopathy, EOMFC (MIM #611705); Multi-minicore disease with heart disease (MmDHD) including clinical variations; Childhood-juvenile onset Emery-Dreifuss-like phenotype without cardiomyopathy; Hereditary myopathy with early respiratory failure (HMERF; MIM #603689); Adult onset recessive proximal muscular dystrophy.
Mutations in titin will probably prove to be the cause of many additional phenotypes of muscular disorders in the coming years.
Due to its huge size, it has not been possible to sequence the entire
Here, we focus on the current understanding of the titin gene and protein from a human disease perspective. In particular, we provide an overview of the different neuromuscular disorders caused by mutations in the
THE TITIN GENE, ISOFORMS AND PROTEIN
The titin gene (MIM #188840), is located on the short arm of chromosome 2 (chromosomal band q31.2). It contains 363 coding exons and an additional first non-coding exon [1]. The longest theoretical transcript (variant IC, NM_001267550.2), virtually obtained by the transcription of all the coding exons (excluding the alternative C-terminal Novex-3 exon) and called “meta isoform”, has been adopted as the gold standard for describing
The titin protein spans from the Z-disk to the M-band [12]. Its modular structure is composed of four main parts (Fig. 1): the amino-terminal Z-disc region, the I-band and A-band regions, and the carboxyl-terminal part spanning the M-band. Titin is composed of repeated immunoglobulin-like (Ig) and fibronectin type 3 -like (FN3) domains, interspersed by unique sequence regions [1]. It also contains the repetitive PEVK region, rich in proline (P), glutamate (E), valine (V), and lysine (K) residues, in the I band, and a serine/threonine kinase (TK) domain in the M-band.
More than 1 million splice variants could be generated theoretically by the
The major isoform classes are represented in the NCBI RefSeq database by the entries NM_133378 (N2A; NP_596869:3,680 kDa and 33,423 aa), NM_001256850.1 (N2BA; NP_001243779:3,780 kDa and 34,350 aa), and NM_003319 (N2B; NP_003310:2,960 kDa and 26,926 aa) [1, 15].
The Novex-1 (NM_133432; NP_597676) and Novex-2 (NM_133437; NP_597681) isoforms are similar to N2B, but they also include further 125 and 192 amino acids encoded by the Novex-1 and Novex-2 exons in the I-band. Finally, the much smaller Novex-3 isoform (NM_133379; NP_596870:616 kDa and 5604 aa) only contains the N-terminal part of the protein. This isoform, expressed on a low level in all striated muscles, results from inclusion of the Novex-3 exon encoding an alternative C-terminus [1].
The best characterized titin function is that of a scaffold protein aiding myofibrillar assembly during myogenesis [17]. However, it is also the backbone for the positioning of myosin filaments in the center of the sarcomere, and a molecular spring responsible for the passive elasticity of the muscle [3, 4]. The passive force of the muscle cells is, in fact, largely due to the elastic properties of I-band titin, allowing shortening of the sarcomere in contraction and extension when stretched. A crucial role in the myofibrillar signal transduction pathways has also been demonstrated [18]: titin seems to integrate or coordinate multiple signaling pathways related to gene activation and/or to protein folding, quality control and degradation [6, 19].
INTERACTIONS OF TTN WITH OTHER PROTEINS
The versatile roles played by titin in cardiac and skeletal muscles are enabled and facilitated by a high (or presumably very high) number of different protein ligands.
The search for TTN interactors in large public databases (PSICQUIC (20), IntAct [21], BioGRID [22]) results in a list of more than 170 putative ligands, as a product of large-scale studies of protein-protein interactions.
Even if a detailed discussion of titin interactions is not the main aim of this review, a summary of the best characterized ones is provided below and in Fig. 1.
Several reports have confirmed that telethonin (also named Titin-cap or T-cap) and
Similarly, the actin binding proteins, filamin C and nebulin, have been shown to interact with titin in the Z-region [15, 29].
The central I-band region of titin has been widely studied, and several interactors identified, including tropomyosin [30],
The A-band region of titin, tightly associated with thick filaments, binds myosin heavy chain and MyBP-C [41].
The M-band region of titin has several interactors. The domains at the A-band/M-band boundary bind the ubiquitin ligases MURF1 and MURF2 (muscle RING finger 1 and 2) [42, 43]. The titin kinase (TK) domain, located at the M-band periphery, interacts with calmodulin [44], as well as with the signalosome composed of nbr1 p62, and MURF2 [33]. FHL2, expressed predominantly in the heart, binds to the is2 region [45]. The alternatively spliced is7 region binds calpain 3 (CAPN3) [35], the calcium-dependent protease involved in the pathogenesis of LGMD2A. The M10 domain interacts with the giant structural and signaling protein obscurin and its smaller homologue obscurin-like 1 (OBSL1) [46], and the A-kinase anchoring intermediate filament protein alpha-synemin [47]. Finally, several of the C-terminal titin domains can bind the multifunctional docking protein myospryn (CMYA5) [48].
TITIN VARIANTS AND DISEASES
Mutations in the
Late-onset autosomal dominant tibial muscular dystrophy (TMD)
Tibial muscular dystrophy (TMD; MIM #600334]) is a mild autosomal dominant distal myopathy involving the anterior compartment muscles of the lower legs but sparing of the short toe extensor digitorum brevis muscles [49].
It is characterized by a late onset (>35 years), a slow progression, normal or slightly increased values of serum creatine kinase (CK) and a myopathic EMG pattern [49, 50]. Biopsy findings in the target muscles include fiber size variability, central nuclei, necrosis, presence of fibroadipose tissue and rimmed vacuoles. Electron microscopy showed autophagic vacuoles without membrane and very rare inclusions of 15–18-nm filaments [51]. Muscle imaging (CT or MRI) is very informative with selective fatty replacement in the muscles of the anterior compartments of the lower legs starting in the anterior tibial muscle and representing a useful clinical tool to address the diagnosis.
In 2002, the first
One year later, a dominant missense variant in the exon 364 (c.107840T>A p.Ile35947Asn) was also identified in a Belgian family with a similar phenotype [51]. In 2008, three novel truncating variants (two deletions – c.107647delT p.Ser35883Glnfs*10 and c.107889delA p.Lys35963Asnfs*9 – and a nonsense mutation – c.107890C>T p.Gln35964*) were identified in two French families and a Spanish kindred [50] and a further missense mutation in exon 364 was then described in a large Italian family [53].
Young or early adult onset recessive distal titinopathy
More recently, Evilä et al. [54] described four patients with a more severe distal phenotype, resulting in a young or early adult-onset recessive distal titinopathy: all these patients were compound heterozygotes for described TMD mutations and novel frameshift variants (Table 1). Two French patients, previously reported with a more severe distal phenotype compared to TMD, had a second causative mutation that explains the peculiar phenotype. Similarly, a 36-year-old Spanish female with a similar distal phenotype and an early onset had the previously described Iberian
In the reported patients, biopsy findings are highly variable, from mild myopathy to severe dystrophic changes usually with rimmed vacuoles, depending on the site of biopsy and the disease duration.
Muscle imaging shows an early (already at age 20) fatty degeneration of the anterior compartment, frequently combined with a similar degree of involvement of soleus, which is unusual in TMD.
Limb-girdle muscular dystrophy type 2J (LGMD2J)
Limb-girdle muscular dystrophies (LGMD) are Mendelian disorders affecting the voluntary muscles in proximal limbs of the hip and shoulder areas [55]. LGMDs includes more than 30 different diseases with different but often overlapping clinical pictures [56]. LGMD2J represents a recessive disease with an early age of onset (<12 y.o.) [50, 57]. The first Finnish patients described were homozygous for the FINmaj mutation, presenting with a very different and much more severe phenotype than TMD [50, 57]. In addition, in a French family with a dominant TMD phenotype due to a nonsense mutation in the last exon (c.107890C>T p.Gln35964*), one deceased patient with a more severe generalized muscle weakness proved in retrospect to be homozygous for the mutation [58].
More recently, Evilä et al. described three further LGMD2J patients [54]. Three Finnish patients, heterozygous for the FINmaj variant and presenting with an early onset LGMD or generalized muscle weakness phenotype, were clinically and molecularly re-evaluated. In two out of three patients, a second frameshift variant was detected in the other
On the protein level LGMD2J shows a secondary CAPN3 defect [60] and loss of titin C-terminus on Western blots and immunofluorescence microscopy with antibodies against several C-terminal domains [50, 61]. Most biopsied muscles of patients homozygous for the FINmaj variant show dystrophic findings with end stage pathology without rimmed vacuoles [48]. However, rimmed vacuolar pathology was reported in a recently described case compound heterozygous for the FINmaj mutation and a truncating mutation [52].
Muscle imaging shows a progressive fatty degeneration of skeletal muscles. Muscles are relatively well preserved in young patients. A small degree of fatty degeneration can be observed 10 years after onset of muscle weakness, and fatty replacement is usually total after 40 years [48, 52].
Congenital centronuclear myopathy
Centronuclear myopathies (CNMs) are congenital myopathies characterized by the presence of centralized nuclei in the muscle fibers [62]. The term has also been used for myopathies with less specific increase of internalized nuclei. Mutations in four different genes have been reported to cause CNM:
Fattori at al. described a further CNM patient with two
Early-onset myopathy with fatal cardiomyopathy (EOMFC) and multi-minicore disease with heart disease (MmDHD)
In 2007, Carmignac et al. reported a novel recessive titinopathy involving both heart and skeletal muscle, in two consanguineous families of Moroccan and Sudanese origin [69]. The disease was characterized by early onset, slowly progressive, muscle weakness (1 y.o.); conversely, a severe dilated cardiomyopathy with rhythm disturbances developed later and resulted in a premature sudden death before adulthood. Skeletal muscle biopsies showed minicore-like lesions, centralized nuclei and type 1 fiber predominance.
In 2014, Chauveau and collagues described four other families with congenital core myopathy and primary heart disease associated with
Chauveau described a wide range of phenotypes, spanning from an Emery-Dreifuss-like form to an unusual, severe condition with distal arthrogryposis multiplex congenita (AMC), congenital muscle weakness, kyphosis, and neonatal cardiac failure. All of them are congenital or infantile muscle conditions, characterized by weakness with rigid spine, distal or elbow joint contractures, impaired respiratory function and mild hyperCKemia (<5x).
Childhood-juvenile onset Emery-Dreifuss-like phenotype without cardiomyopathy
Emery-Dreifuss muscular dystrophy has been associated with several genes:
Congenital centronuclear myopathy [67, 68], early-onset myopathy with fatal cardiomyopathy (EOMFC) [69], multi-minicore disease with heart disease (MmDHD) [70], and childhood-juvenile onset Emery-Dreifuss-like phenotype withoutcardiomyopathy [76] (Table 2) represent a group of TTN-related recessive disorders characterized by an early onset. Most of the patients described are homozygous or compound heterozygous for truncating variants. The causative mutations in CNM patients are mainly localized in I- or A-bands but frameshift variants in M-band titin have been found in 3/6 patients [67, 68]. On the contrary, almost all the patients with an EDMD-like phenotype or an EOMFC/MmDHD described so far have truncating mutations in the M-band, but these variants do not involve the last exon, previously associated to TMD/LGMD phenotypes [69, 76]. Interestingly, despite the location of all these M-band truncating variants in proximity to each other, the clinical pictures and the histological findings are heterogeneous and, above all, a cardiac phenotype is only reported in a subset of patients.
Hereditary myopathy with early respiratory failure (HMERF)
Hereditary myopathy with early respiratory failure (HMERF) is an adult-onset autosomal dominant myopathy with respiratory muscle involvement that may lead to a fatal respiratory crisis if not treated [77].
In 2005, Lange et al. identified a
With the exception of the TK-R279W mutation, all other mutations reported as being causative of HMERF are localized in the
In the affected members of the original Swedish family described by Lange et al. [33], Hedberg et al. reported the presence of a second variant, the “recessive” p.Pro31732Leu change [86]. A suggestive hypothesis is that the co-inheritance of both p.Pro31732Leu and p.Arg34091Trp may cause the disease by a fully penetrant bi-mutational dominant allele [87].
Other titinopathies
In two families, one Finnish and one Italian, with well-known dominant TMD disease one individual in each family developed a different phenotype: adult onset proximal lower limb weakness without the normal ankle dorsiflexion weakness [54, 88]. Muscle MRI consistently showed significant dystrophic changes in the thigh muscles, and in the Finnish patient marked soleus muscle involvement. Both patients proved to have a second recessive mutation inherited from the healthy non-TMD parent. In particular, the Finnish patient showed a missense mutation (c.92167C>T p.Pro30723Ser) in the exon 340 [54] and in the Italian patient a missense mutation (c.60494A>G p.His20165Arg) was identified in the exon 305 [88].
Exome sequencing detected two variants in compound heterozygosity (c.45599C>G p.Ala15200Gly and c.106154
ANIMAL MODELS OF TITINOPATHIES
Several spontaneous and induced animal models with titinopathy have been described so far.
“Runzel” (“ruz”) is a dystrophic zebrafish mutant with a reduced expression of certain
A spontaneous mouse model with a complex rearrangement causing the loss of 83 amino acids from the N2A region exhibits a recessive muscular dystrophy with myositis (mdm) [91]. The homozygous mice show a progressive muscle degeneration involving prominently distal skeletal muscles such as the tibialis and a reduced expression (50–60%) of CAPN3.
Mice carrying the FINmaj mutation [92] in homozygosity develop a progressive muscular dystrophy as well as a dilated cardiomyopathy, whereas heterozygotes only show a mild, later onset restricted phenotype. Interestingly, crossing the FINmaj model with CAPN3-deficient mice attenuates the muscular disorder in double heterozygotes, although not in the FINmaj homozygotes, suggesting a role for CAPN3 in the pathogenesis.
Similarly, other models were characterized and stu-died to focus on the
“Pickwick” (“pik”) is a zebrafish mutant, carrying a
A mouse lacking the cardiac N2B element was generated to study the role of this element in systole and diastole [94]. The shorter protein is correctly integrated into the sarcomere, but causes a restrictive diastolic dysfunction.
A murine conditional knock-out for the first two M-line exons (exons 359-360 or Mex1 and Mex2) has been produced to study the role of these domains during heart development [95, 96].
Finally,
GENOTYPE-PHENOTYPE CORRELATION
As described above, variants in the
Homozygous or compound recessive truncating mutations in the first four M-line exons (exons 359–362 or Mex1–4) cause a range of severe congenital or very early onset muscle diseases with or without cardiomyopathy [67–70, 76]. The reason why some truncations in the same exon cause cardiomyopathy and others not is unexplained. All truncated transcripts do not undergo nonsense-mediated decay and some read-through occurs, which may lead to very variable amounts of titin protein available. Cardiomyopathy could be associated with lower amounts of protein, but this has not been conclusively confirmed. Compound heterozygosity including a missense mutation with a truncating change leads to functional homozygosity of the missense transcript due to nonsense-mediated decay of the truncated transcript. Moreover, a compound heterozygous patient (patient IV in ref [54]), carrying the FINmaj variant and a missense change in the A-band (p.Pro30723Ser), shows an atypical, adult onset, proximal lower limb titinopathy that spares the anterior tibial muscle [54]. An identical phenotype occurs with compound heterozygosity of the Italian TMD mutation (p.His35946Pro) and another missense A-band mutation (p.His20165Arg) [88].
The different molecular mechanisms underlying LGMD2J, the recessive distal titinopathy, or specific atypical phenotypes have not been clarified so far.
Finally, HMERF represents a unique, well recognized, phenotype (Table 3). Even if mutations in the exon 344 are the only changes confirmed to cause fully dominant HMERF, the recent finding of the more recessive change (p.Pro31732Leu) [82] highlights the possibility that a a second variant
In the pre-NGS era, the low number of described patients, as well as the positional bias caused by the extensive scanning of M-band exons as compared to other
CHALLENGES ARISING FROM NGS PROJECTS
In the last few years, the next generation sequencing approaches have demonstrated to be extremely useful in research and diagnostic testing for various hereditary conditions, including neuromuscular disorders [99]. Whole exome (WES) [100], whole genome (WGS) [9], and targeted sequencing approaches [101] have been utilized to identify causative mutations in already known or novel disease genes. All these strategies are revealing a high number of novel and rare variants in the
In 2012, Herman et al. developed an affinity capture for the sequencing of the titin exons [102]. In this way, they identified 72 loss-of-function variants, indicating the important role of titin in the development of dominant dilated cardiomyopathy. Later studies have shown many truncating variants to be too common for fully penetrant dominant effects [103].
Several custom enrichment assays, including MyoCap [88], MotorPlex [104, 105], and others [106–108], have been developed to sequence specific genes of interest related to neuromuscular disorders.
The use of comprehensive NGS tools allows the analysis of almost all the coding regions of
On the other hand, novel challenges arise from the NGS data.
Most of the patients analyzed by NGS strategies show previously undescribed rare missense variants. The clinical interpretation of missense variants in the
According to a recent large-scale study, most disease-associated variants perturb protein– protein interactions without causing misfolding [112]. Moreover, each variant may affect only specific interactions while leaving most other interactions unperturbed [112], explaining how different variants in the same gene can cause different phenotypes. These notions likely hold true also for
Meanwhile, the research community has responded to the challenges arising from the use of NGS through the formation of clinical and research consortia [113, 114]. These collaborations take advantage of the power of shared resources and expertise, and particularly the benefit of combining cohorts of patients into larger groups. This greatly increases the likelihood of success of NGS projects and enhances the impact of these projects in terms of the clinically relevant data that is associated with them. To reach this goal, there is an urgent need to collect all reported, novel detected and rare
Footnotes
ACKNOWLEDGMENTS
This study was supported by the Folkhälsan Research Foundation, the Jane and Aatos Erkko Foundation, Association Française contre les Myopathies (SB/CP2013-0106, B.U. and PF2016-19522, M.S.), the Academy of Finland (no. 138491, B.U.), the Sigrid Jusélius Foundation, the Liv och Hälsa Foundation, The Finnish Cultural Foundation, the Austrian Science Fund (FWF, P27634FW), and Tampere University Hospital Research Funds (B.U.).
