Abstract
Objectives
The objective of this study was to perform a proof-of-concept experiment that validates a precision medicine approach to identify variants associated with hypertrophic cardiomyopathy (HCM). We hypothesized that whole-genome sequencing would identify variant(s) associated with HCM in two affected Maine Coon/Maine Coon cross cats when compared with 79 controls of various breeds.
Methods
Two affected and two control Maine Coon/Maine Coon cross cats had whole-genome sequencing performed at approximately × 30 coverage. Variants were called in these four cats and 77 cats of various breeds as part of the 99 Lives Cat Genome Sequencing Initiative (http://felinegenetics.missouri.edu/99lives) using Platypus v0.7.9.1, annotated with dbSNP ID, and variants’ effect predicted by SnpEff. Strict filtering criteria (alternate allele frequency >49%) were applied to identify homozygous-alternate or heterozygous variants in the two HCM-affected samples when compared with 79 controls of various breeds.
Results
A total of four variants were identified in the two Maine Coon/Maine Coon cross cats with HCM when compared with 79 controls after strict filtering. Three of the variants identified in genes MFSD12, BTN1A1 and SLITRK5 did not segregate with disease in a separate cohort of seven HCM-affected and five control Maine Coon/Maine Coon cross cats. The remaining variant MYBPC3 segregated with disease status. Furthermore, this gene was previously associated with heart disease and encodes for a protein with sarcomeric function.
Conclusions and relevance
This proof-of-concept experiment identified the previously reported MYBPC3 A31P Maine Coon variant in two HCM-affected cases. This result validates and highlights the power of whole-genome sequencing for feline precision medicine.
Introduction
Hypertrophic cardiomyopathy (HCM) is the most commonly diagnosed cardiomyopathy in cats. 1 A previous study reported that a colony of Maine Coon (MC) cats with HCM closely resembles the inheritance pattern of human familial HCM. 2 This colony ultimately led to the identification of the MYBPC3 A31P variant in MC cats via a candidate gene approach. 3 Furthermore, the R820W MYBPC3 HCM variant in cats from the Ragdoll breed was also identified through a candidate gene approach. 4 Even though candidate gene studies have previously identified HCM variants in known genes encoding for proteins with a sarcomeric function, investigators are limited to previously described gene-encoded protein functions and known gene-to-disease relationships when utilizing this method.
As a result of this limitation, whole-genome sequencing (WGS) has emerged as a powerful tool to identify variants associated with inherited diseases. In recent years, new next-generation sequencing (NGS) technology has resulted in a reduction in WGS cost and led to higher integration of its use for precision medicine.5,6 Furthermore, the reduced cost of NGS has allowed genomes from multiple species, including cats, to be assembled and annotated. 7 Therefore, conducting an NGS study in cats is beneficial for the detection of novel disease-causing genetic variants, such as single nucleotide polymorphism (SNP) and/or insertion/deletions (indels). 8 The 99 Lives Cat Genome Sequencing Initiative (http://felinegenetics.missouri.edu/99lives) is a consortium that provides a large control cohort of cats that have had their whole genome sequenced, and has already led to huge advances in the healthcare of cats.9,10 A recent publication by Aberdein et al further illustrates how the 99 Lives Cat Genome Sequencing control database can be used to identify a variant associated with disease in two affected and 82 control cats. 11
In humans, HCM is a familial genetic heart disease affecting 1 in 500 individuals. 12 HCM is also highly prevalent in cats, with recent estimates showing a rate as high as one in seven. 13 Despite the high prevalence of this condition, genetic discoveries are limited to just two previously reported breed-specific variants in MYBPC3.3,4 In particular, the correlation of the MYBPC3 A31P variant and HCM development in MC cats has been in the center of discussion after Wess et al reported that testing positive or negative for this variant was not an indicator for HCM development in a cohort of MC cats phenotyped by a single echocardiogram. 14 Nevertheless, multiple clinical investigations have continued to demonstrate the correlation between the MYBPC3 A31P variant and development of HCM in MC cats.15,16
For this proof-of-concept study, we aimed to demonstrate the efficacy of WGS by identifying the MYBPC3 A31P variant in two cats affected by HCM, one MC and one MC/domestic shorthair cross (MCX), that were distantly related. This proof of concept is an important step toward validating this methodology for future genetic studies that aim to identify novel HCM variants in the cat population. Previous publications have identified that the MC/MCX cat MYBPC3 A31P variant may not be the sole HCM-associated variant in this breed. Therefore, further genetic studies are warranted to define the presence or absence of other biologically relevant variants associated with HCM in MC/MCX cats.17–19
Materials and methods
Sample phenotyping
Extensive cardiac phenotyping was provided to the four MC/MCX cat samples submitted for WGS and for 12 cats used in variant segregation analysis. These cats received annual echocardiograms by a board-certified veterinary cardiologist and necropsy at the time of death. HCM status was defined as a diastolic left ventricular (LV) segment exceeding 6 mm on two-dimensional (2D) or M-Mode imaging, as well as necropsy findings, which confirmed histopathological evidence of HCM and gross appearance of significant LV hypertrophy. 13
Blood collection and DNA extraction
Whole blood and/or tissue samples were collected for all MC/MCX cats following protocols approved by the University of California Davis Institutional Animal Care and Use Committee. DNA was extracted from whole blood and/or tissue using a commercially available kit (ArchivePure; 5Prime), following the manufacturer’s protocol.
Sanger sequencing
PCR was used to verify the MYBPC3 A31P genotype of the four MC/MCX cats submitted for WGS. We used previously designed primers for the MYBPC3 A31P variant PCR reaction following the manufacturer’s protocol for LA Taq with GC Buffer (Takara). 3 After, ExoSAP-IT (Affymetrix) was used to purify PCR amplicons following the manufacturer’s protocol and PCR products were submitted to the UC Davis DNA Sequencing Facility. For cycle sequencing, ABI BigDye Terminator v3.1 kit (ThermoFisher) was used prior to sequencing on an ABI 3730 Capillary Electrophoresis Genetic Analyzer (ThermoFisher). The sequences were aligned to the Felis catus 6.2 reference genome (https://www.ncbi.nlm.nih.gov/assembly/GCF_000181335.1/) and analyzed using Lasergene Evolution Suite (DNASTAR).
WGS analysis
High-quality DNA (2 µg with a 260/280 ratio of 1.80–1.90) was submitted to BGI@UC Davis (Sacramento, CA) for WGS. Quality control of DNA samples was also completed by BGI@UC Davis via a Quibit Fluorometer (ThermoFisher) and 2% agarose gel electrophoresis. If DNA samples passed quality control and were not degraded, 350 bp and 550 bp insert PCR-free libraries were prepared using an Illumina TruSeq DNA PCR-free sample preparation kit. The libraries were pooled and sequenced on 12 lanes – six lanes per library insert – using an Illumina HiSeq 2500 sequencing platform. This generated paired-end 100 bp reads to achieve a desired coverage of × 30.
As part of the 99 Lives Cat Genome Sequencing Initiative, generated fastq.gz sequence files were submitted to MaverixBiomics to be processed and analyzed as described previously. 20 In short, sequence reads were trimmed with Trimmomatic v0.32, and the quality of reads was evaluated with FastQC v0.10.1 (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/). 21 Reads were aligned to Felis catus 6.2 genome (https://www.ncbi.nlm.nih.gov/assembly/GCF_000181335.1/) with bwa-mem v0.7.6a. 22 Duplicate reads were detected and removed with SAMBLASTER v0.1.9. 23 Depth of coverage was analyzed with GATK Depth Of Coverage tool v2.3. 24 Variants were called simultaneously on all samples with Platypus v0.7.9.1 and annotated with dbSNP ID.25,26 SnpEff was used to predict variant effects based on the Ensembl gene model.27,28 Variant calls were uploaded to MaverixBiomics Variant Explorer (www.maverix.com).
Detection of candidate variants
Results from SnpEff classified variants as severe (ie, frameshift, splice site, stop gain, etc), moderate (ie, indels, non-synonymous variants, etc) and low (ie, intronic and/or intergenic). 24 These results were incorporated into the Maverix Bioinformatics Variant Explorer interface and only variants classified as severe and/or moderate were used in our filtering analysis. Therefore, to identify variants correlated with an autosomal dominant disease in either a gene encoding for a protein with sarcomeric function and/or a gene previously associated with heart disease the following filtering criteria was used: alternate allele frequency >49% in the two HCM-affected MC/MCX cats and wild type in 79 control cat samples. This stringent filtering protocol would thus identify all heterozygous or homozygous variants that were present in the two cases, whereas all of the control cats were wild type.
Variant segregation analysis
We performed variant segregation analysis using seven MC/MCX cats with HCM and five unaffected MC/MCX cats. We used previously designed primers for the MYBPC gene variant identified. 3 Primers were designed for the additional three variants in genes MFSD12, BTN1A1 and SLTRK5 (see Table 1 in the supplementary material). The PCR reaction was optimized using LA Taq with GC Buffer (Takara) and PCR amplicons were purified with ExoSAP-IT (Affymetrix), following the manufacturer’s protocol. All PCR products were submitted to the UC Davis DNA Sequencing Facility for Sanger sequencing, as described above. Results were analyzed with Lasergene Evolution Suite (DNASTAR).
Results
Sample phenotypes
The MC/MCX control cats were free of LV hypertrophy by echocardiography within 6 months of their deaths and had a normal cardiac necropsy with no evidence of LV hypertrophy or histopathological features of HCM. The control cats were >10 years old at time of necropsy. The MC/MCX HCM-affected cats had a diastolic LV segment exceeding 6 mm via 2D or M-Mode imaging during their annual echocardiogram examinations. Cardiac necropsy examination of the affected cats illustrated the gross appearance of LV hypertrophy. The phenotype of the parents of both HCM-affected MC/MCX submitted for WGS is unknown. The 77 control cats from the consortium were not phenotyped cardiovascularly for this study and represent samples from collaborating institutions. However, none of the additional control cats used for comparison in this study were from the MC breed.
Sanger sequencing to verify genotype
Sanger sequencing verified the MYBPC3 A31P genotype of the four experimental MC/MCX cat samples submitted for WGS (Table 1). We did not have DNA for the 77 additional control cats from the 99 Lives Consortium; thus, we were unable to verify their genotype via Sanger sequencing.
Sanger sequencing verification results for the MYBPC 3A31P variant for two cats with hypertrophic cardiomyopathy and two control Maine Coon (MC)/MC domestic shorthair cross (MCX) cats
n/a = not available
WGS
Maverix Variant Explorer interface was used to identify possible HCM-associated variants in two MC/MCX cases when compared with 79 controls from the 99 Lives Cat Genome Sequencing Initiative (Figure 1). The median percentage of reads mapped to the genome was 98.4% (interquartile range 98.1–98.8%) with a mean ± SD of 56.1 ± 15.0 Gbp for all 81 cats in this study. Additionally, mean ± SD coverage for all 81 cats was 25.28 ± 6.56. In particular, the coverage for the two HCM-affected samples was 27.39 and 24.54, and coverage for the two control samples was 23.89 and 24.07. The reads for all 83 cats were aligned to Felis catus 6.2 genome (https://www.ncbi.nlm.nih.gov/assembly/GCF_000181335.1/) assembly.

Schematic representation for the identification of the A31P variant in MYBPC3 gene.
Variants classified as severe and/or moderate were used in this analysis. A total of 228,598 variants were detected for all 81 WGS cat samples prior to filtering. The two affected MC/MCX cat samples shared 15,205 variants, regardless of the 79 control cats’ variant genotype (Table 2). Of those 15,205 variants, all 81 cats shared 2710 variants that were not further evaluated as these variants would be considered common in the general cat population.
Maverix Biomics Interface variant results for two affected Maine Coon (MC)/MC domestic shorthair cross (MCX) samples
Variants’ effect severity was predicted by SnpEff. 24 High effect severity refers to frameshift, stop gained and splice site donor/acceptor variants. Moderate refers to single nucleotide polymorphisms, indels and non-synonymous variants. Lowest refers to synonymous variants and lowest effect severity are intron and intergenic variants
Four variants were identified after stringent filtering criteria, described in the ‘Materials and methods’, in the two affected MC/MCX cats, and the four variants have a predicted moderate effect severity (Table 3). Variant segregation analysis was performed in a cohort of 12 MC/MCX cats (seven affected and five controls) to determine if the variants segregated with disease status (Table 4). The M3VYP3:c.112C>G(p.A38P) variant MYBPC3 was the only variant that segregated with HCM affection status. Furthermore, MYBPC3 is a sarcomeric protein-encoding gene that was previously associated with HCM.3,4 The identified variants M3VVL0:c.109C>T(p.V37M) and M3WR01:c.73C>G(p.D25H) in MFSD12 and BTNA1A, respectively, did not segregate with disease and these genes have not been previously associated with heart disease. The other variant identified was M3X7P5:c.1199C>A(p.T400K) in SLITRK5. This gene is involved in neurite modulation and its protein is expressed abundantly in neural tissue. 29 While no other variants in SLITRK5 are directly linked to HCM, this gene was found to be linked to coronary heart disease in humans, as well as an active myocardial signaling pathway known as Slit-Robo.30,31 Alterations to this signaling pathway are associated with congenital heart defects. 31 Regardless, segregation analysis illustrated that the M3X7P5:c.1199C>A(p.T400K) variant in SLITRK5 did not segregate with disease.
Maverix Biomics Variant results for an autosomal dominant trait
Alternate allele frequency >49% in two cats with hypertrophic cardiomyopathy (Maine Coon [MC]/MC domestic shorthair cross [MCX] samples); homozygous to reference (wild type) for 79 controls
Alt = alternate allele; AA = amino acid
Variant segregation analysis genotype results for seven Main Coon (MC)/MC domestic shorthair cross (MCX) cats with hypertrophic cardiomyopathy (HCM) and five control MC/MCX cats
HO = homozygous; HET = heterozygous; WT = wild type
The previously identified MYBPC3 A31P variant by Meurs et al utilized human reference sequence to design their oligonucleotides. 3 The variant M3VYP3:c.112C>G (p.A38P) identified in this study utilized the cat reference genome Felis catus 6.2 (https://www.ncbi.nlm.nih.gov/assembly/GCF_000181335.1/), which became available after the publication of Meurs et al. 3 Therefore, to determine if these two variants are the same, we analyzed nucleotide and amino acid sequences surrounding both variants, using Ensembl (http://uswest.ensembl.org/index.html). Nucleotide sequence and amino acid alignment for both of these variants verified that M3VYP3:c.112C>G (p.A38P) variant was the previously reported MYBPC3 A31P MC cat variant. Difference in annotation is due to an additional seven amino acid in the cat MYBPC3 gene transcript ENSFCAT00000002530.3 that is not found on the human MYBPC3 gene transcript ENST00000256993.8. The additional seven amino acids are most likely caused by a transcript annotation error as the most recent MYBPC3 gene cat transcript ENSFCAT00000002530.4 also places this variant in position 31.
Discussion
This proof-of-concept study illustrates the advantages of using a WGS approach for precision medicine in cats. The identification of M3VYP3:c.112C>G(p.A38P) variant, which is the same variant previously identified by Meurs et al, further supports and validates this technique. 3 The study design relies on the assumption that the disease of interest is breed specific, as is previously reported in cats with HCM.3,4 Additionally, this study further illustrates that utilizing a small, strictly phenotype-affected sample cohort and a large control population of unknown phenotype allows for the identification of a variant associated with disease. 11
This study’s WGS methodology also revealed three additional variants that met our filtering criteria. The variants M3VVL0:c.109C>T(p.V37M) and M3WR01:c.73C>G (p.D25H) in the MFSD12 and BTN1A1 genes, respectively, were omitted as these are not genes that encode for a protein with sarcomeric function and/or previously associated with heart disease. Published alterations in MFSD12 were linked to modification in melanocytes in both humans and mice. 32 Alterations in BTN1A1 were associated with immune regulation and/or milk fat globule secretion and not cardiac disease.33,34 Furthermore, variant segregation analysis in a cohort of seven affected and five control MC/MCX cats illustrated that these variants did not segregate with HCM status.
The only additional candidate variant was M3X7P5: c.1199C>A(p.T400K) in SLITRK5. This gene was previously associated with heart disease, although not a classical HCM-associated gene.30,31,35 Nevertheless, segregation analysis of the SLITRK5 variant demonstrated that it did not segregate with HCM status. Even though the three additional variants identified did not segregate with disease, the role of compound variants and modifier genes warrants consideration in feline HCM. 36 This can be accomplished by sequencing additional HCM-affected MC/MCX cats to identify novel HCM variants that segregate with disease.
Two different HCM variants in MYBPC3 have been identified in two feline breeds, MC and Ragdoll.3,4 MYBPC3 encodes cardiac myosin binding protein-C (cMyBP-C), a critical regulator of cardiac contraction that is necessary for normal heart function, as well as for increased contractility in response to inotropic stimuli. 37 In people, variants in MYBPC3 are one of the top two most common causes of HCM.38,39
In cats, variants in MYBPC3 were initially reported to cause disease in an autosomal dominant fashion, although evidence has since been presented that the MYBPC3 A31P variant more commonly behaves as autosomal recessive.2,3,16,17,40 It was recently shown that cats heterozygous for the A31P variant express very little of the A31P mutant protein and instead express mainly wild-type protein. 41 One possibility is that structural instability of the mutant protein leads to faster turnover of the A31P cMyBP-C relative to the wild-type protein. However, in homozygous cats that express only the A31P mutant protein, the mutant protein could directly cause disease through a poison polypeptide mechanism or, indirectly, by affecting protein degradation pathways. Regardless, disease penetrance in homozygous A31P cats is higher than in cats heterozygous for the variant.16,17,42
In this proof-of-concept study, we rapidly identified the previously reported MYBPC3 A31P HCM MC cat variant. 3 The difference in amino acid numbering is due to different reference genomes and transcripts being utilized, and most likely due to an transcription error in the cat Ensembl transcript (ENSFCAT00000002530.3). To our knowledge, this is the first WGS study that utilized two distantly related HCM cases for the identification of a disease-causing variant. A limitation to this study was that apart from the two control MC/MCX cats, the remaining 77 control cats were not rigorously phenotyped via echocardiogram. Therefore, we do not know the age of these 77 cats or if they were diagnosed with HCM. Variants in MYBPC3 have low penetrance and are likely breed specific. Utilizing the additional 77 control unphenotyped samples could allow for the identification of novel variants in future studies when only a small HCM-affected sample cohort is available. This is owing to the control samples from the 99 Lives Consortium consisting of various cat breeds, including nine wild felids. Therefore, using this control population allows for rapid identification of common variants in the general cat population that would not be associated with disease.
Coverage for the known HCM genes that encode for proteins with sarcomeric function was not specifically reported in this study. Given that the mean ± SD coverage obtained for all samples in this study was 25.28 ± 6.56, we are confident with the reliability of variants identified in this group of cats. While targeted resequencing study designs specifically targeting sarcomeric genes may be able to provide a greater coverage depth, our study design has a distinct advantage of allowing the identification of non-sarcomeric protein coding variants associated with HCM across the entire genome. For this reason, WGS is commonly utilized for variant discovery in human HCM. 42
A limitation of this study is that a pedigree analysis to definitively determine pattern of inheritance was not performed and, as such, a definitive call on variant causation cannot be conclusively determined. However, the MYBPC3 A31P variant identified in the two cases of this study represented one heterozygous cat and one homozygous mutant cat. Additionally, the segregation analysis revealed two additional cats, which had HCM and an MYBPC3 A31P heterozygous genotype. These findings support the previously reported autosomal-dominant pattern of inheritance for the MYBPC3 A31P variant and warrant future evaluation with large pedigree analysis studies. 2
In the absence of a well-annotated genome, WGS data can be difficult to interpret. Specifically, disease association with variants of unknown significance (VUS) is sometimes observed. This issue has been reported in human HCM sequencing studies and underscores the possibility that many cases of HCM may not be attributed to a single variant.43–45 This problem also applies to cat sequencing studies where the annotation of the current feline reference genome is incomplete. In this study, the authors evaluated all VUS to determine if they segregate with disease status and for biological relevance based on what is currently known for gene function and disease association across species. Additionally, the authors intend to revisit sequencing data in the future as genome annotations improve.
The efficient variant filtering and successful identification of the known MYBPC3 A31P variant, as well as a second candidate variant, highlights the power of this methodology and value of the 99 Lives Cat Genome Sequencing Initiative. Our results clearly support continued precision medicine efforts to study the genetic basis of HCM in cats.
Conclusions
In this proof-of-concept study we confirmed that our methodology would rapidly identify the MYBPC3 A31P variant in two HCM-affected MC/MCX cats when compared with 79 controls of various breeds. Furthermore, variant segregation analysis demonstrated that the additional variants identified in this study did not segregate with disease in a cohort of seven HCM-affected and five control MC/MCX cats. HCM is one of the most common diagnoses in cats, and thus far only two variants have been reported to be associated with this disease in cats. We demonstrate how this methodology can be utilized to discover variants associated with HCM in a small sample cohort when compared with a large control population. Future studies can utilize this approach to identify novel variants associated with HCM, which can lead to a genetic test to reduce the prevalence of this disease in the cat population.
Supplemental Material
Supplementary Table 1
Primers used in segregation analysis for the four identified variants in two affected and two control MC/MCX cats
Footnotes
Acknowledgements
Besides two of the authors (JAS and YU), we would like to acknowledge the 99 Lives Sequencing Consortium for providing a large control sample cohort for this manuscript. Each member of the consortium has reviewed and approved this manuscript for publication. The 99 Lives Consortium consists of the following members: Leslie A Lyons and Barbara Gandolfi (Department of Veterinary Medicine and Surgery, College of Veterinary Medicine, University of Missouri, USA); Danielle Aberdein, Dorian J Garrick and John S Munday (Institute of Veterinary, Animal and Biomedical Sciences, Massey University, New Zealand); Paulo C Alves (CIBIO/InBIO, University of Portugal, Portugal; Wildlife Biology Program, University of Montana, USA); Gregory S Barsh and Christopher B Kaelin (HudsonAlpha Institute for Biotechnology, USA; Department of Genetics, Stanford University, USA); Holly C Beale and Patricia P Chan (Maverix Biomics, USA); Adam R Boyko (Department of Biomedical Sciences, College of Veterinary Medicine, Cornell University, USA); Jeffrey A Brockman (Hill’s Pet Nutrition, USA); Marta G Castelhano, Rory J Todhunter and Elizabeth A Wilcox (Department of Clinical Sciences, College of Veterinary Medicine, Cornell University, USA); N Matthew Ellinwood, Max F Rothschild and Dorian J Garrick (Department of Animal Science, College of Agriculture and Life Sciences, Iowa State University, USA); Jonathan E Fogle (College of Veterinary Medicine, North Carolina State University, USA); Christopher R Helps (Langford Veterinary Services, University of Bristol, UK); Marjo K Hytönen and Maria Kaukonen (Department of Veterinary Biosciences and Research Programs Unit, Molecular Neurology, University of Helsinki and Folkhälsan Research Center, Finland); Emilie Leclerc (Diana Pet Food, France); Maria Longeri (Department of Veterinary Medicine, University of Milan, Italy); Richard Malik (Centre for Veterinary Education, University of Sydney, Australia); Michael J Montague and Wesley C Warren (The McDonnell Genome Institute, Washington University School of Medicine, USA); William J Murphy (Department of Veterinary Integrative Biosciences, College of Veterinary Medicine, Texas A&M University, USA); Niels C Pedersen (Department of Medicine and Epidemiology, School of Veterinary Medicine, University of California at Davis, USA); William F Swanson (Center for Conservation and Research of Endangered Wildlife [CREW], Cincinnati Zoo & Botanical Garden, USA); Karen A Terio (Zoological Pathology Program, University of Illinois, USA); and Julia H Wildschutte (Bowling Green State University, Department of Biological Sciences, USA).
Supplementary material
The following file is available online: Table 1: Primers used in segregation analysis for the four identified variants in two affected and two control MC/MCX cats.
Conflict of interest
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
Funding support for this study was provided by BGI@UCDavis.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
