Abstract
It has been estimated that 60–70% of neural tube defects (NTDs) have a genetic component, but few causative genes have been identified. The lack of information on genes associated with non-syndromic NTDs in humans is especially notable as the “genomic revolution” has led to new tools (e.g., genome-wide genotyping arrays, next-generation sequencing) that are helping to elucidate the full spectrum of genetic variation (from common to rare) contributing to complex traits, including structural birth defects. However, the application of modern genomic approaches to the study of NTDs has lagged behind that of some other common structural birth defects. This may be due to the difficulty of assembling large study cohorts for anencephaly or spina bifida. The purpose of this review is to outline the evolution of genetic studies of NTDs, from studies of familial aggregation to candidate gene and genome-wide association studies, through whole-exome and whole-genome sequencing. Strategies for addressing gaps in NTD genetic research are also explored.
Introduction
Neural tube defects (NTDs) are among the most frequent, most costly, and most deadly of all congenital anomalies [1, 2]. The most common NTDs are anencephaly, which results from failure of fusion of the cranial neural tube, and myelomeningocele (commonly called spina bifida), which results from failure of fusion in the spinal region of the neural tube [2]. While anencephaly is incompatible with life, individuals with other forms of NTDs typically live long lives due to medical advances over the past half century but exhibit a spectrum of neurological deficits that affect multiple organ systems as well as motor and/or sensory functions.
The number of infants born with NTDs varies temporally, by race/ethnicity, and by geographic region. For example, in recent years, approximately 1 in 1,000 infants is born with a NTD in the United States and many European countries is estimated to be 0.5–0.8 cases per 1,000, whereas in northern China, it has been reported to be as high as 10 per 1,000 [1, 3]. In spite of their clinical and public health significance, the majority of the risk for NTDs cannot be accounted for by known risk factors [4].
One of the most important epidemiologic findings related to the etiology of NTDs is the protective effect of maternal periconceptional folic acid supplementation [2, 5]. This finding has been translated into public health policies, including food fortification programs in many countries [6]. While folic acid fortification has led to a 28% reduction in the birth of infants with anencephaly and spina bifida in the United States since mandatory fortification began in 1998 [7], NTDs continue to occur despite fortification.
There are other known causes of NTDs, including genetic syndromes [2, 5]. For instance, NTDs can occur in children with certain chromosomal anomalies (e.g., trisomy 13, 18, and 21) [2, 5] and chromosomal deletion syndromes (e.g., 22q11 deletion syndrome), and there is evidence that NTDs are associated with Waardenburg syndrome [8]. In addition, a few families have been observed with X-linked inheritance of anencephaly [9, 10], and anencephaly has recently been associated with mutations in TRIM36 [11]. While these syndromes and single gene defects account for only a small proportion of individuals with NTDs, the remainder without an identified syndrome are also thought to involve genetic factors because the risk of NTDs is increased in the relatives of individuals with the condition (see “Family History, Aggregation, and Linkage Studies”). In fact, it has been estimated that 60-70% of NTDs have a genetic component [1], but few causative genes have been identified.
The lack of information on genes associated with non-syndromic NTDs in humans is especially notable as the “genomic revolution” has led to new tools (e.g., genome-wide genotyping arrays, next-generation sequencing) that are helping to elucidate the full spectrum of genetic variation (from common to rare) contributing to complex traits, including structural birth defects. The application of modern genomic approaches to the study of NTDs has, however, lagged behind that of some other common structural birth defects. For example, there has yet to be a published genome-wide association or whole-exome sequencing study of spina bifida [12]. This lag may due to several factors including: (1) prenatal detection of NTDs leading to pregnancy terminations, (2) high mortality, and (3) reduced reproduction among individuals with these conditions that make assembling a large study cohort difficult. The purpose of this review is to outline the evolution of genetic studies of NTDs, from studies of familial aggregation to candidate gene and genome-wide association studies, through whole-exome and whole-genome sequencing. Strategies for addressing gaps in NTD genetic research are also explored.
Family history, aggregation, and linkage studies
Having a family history of spina bifida or anencephaly is one of the strongest
risk factors for these outcomes [5]. Familial
aggregation studies, largely conducted in the 1970s and 1980s, demonstrated that the risk
for spina bifida or anencephaly, or both, was increased in the relatives of individuals with
the condition: 3–8% in siblings (first-degree relatives); 1–2% in second-degree relatives;
and 0.5% in third-degree relatives [13, 14, 15,
16, 17] – compared to 0.1% (or 1 per 1,000) in the United States. These studies clearly
demonstrated: (1) increased risk of NTDs if other family members have NTDs, (2) risk
increased with the number of relatives with the condition, which is not consistent with a
single genetic locus underlying risk; (3) a non-linear decline in risk with decreasing
degree of relationship; and (4) in most families (
Genetic association studies
Study design considerations
The majority of genetic association studies of NTDs have used a case-control study design. In this approach, the frequency of variants in putative disease genes is compared between cases (i.e., individuals with the condition) and controls (i.e., individuals without the condition). This study design is particularly useful when studying rare outcomes, such as NTDs, and is often used in genetic association studies of other conditions. For instance, the case-control study design was used to identify genes associated with neuroblastoma (a relatively rare pediatric malignancy), which has led to improved therapeutic options for these children [20]. However, genetic association studies using the case-control design are vulnerable to a type of confounding referred to as “population stratification bias,” where a false association between a genotype and disease, or the masking of a genotypic effect, is induced by the existence of subgroups within a population (e.g., different racial or ethnic groups) that have different genotype frequencies and frequencies of disease [21].
Another study design emerged in the early 1990s that has proven to be very useful in genetic association studies of structural birth defects: the case-parent trio design (or child-parent trio). The trio is composed of the child with the condition and his or her biological parents. This design is particularly useful for studies of birth defects and conditions with early disease onset, since parents of children with these conditions are generally available. Several methods for analyzing the data generated in a case-parent trio study have been developed, including the transmission disequilibrium test [22, 23] and approaches using log-linear models [24]. The child-parent trio design has the advantage (as compared to case-control studies) of being immune to population stratification bias when assessing the effects of the inherited genotype. Furthermore, this design can be used to assess maternal genetic effects (i.e., the effect of the maternal genotype on the phenotype of offspring) without incurring additional genotyping expenses (i.e., in the case-control design, evaluation of the maternal and case genotype would require genotyping cases and controls as well as the mothers of these individuals).
Candidate gene approaches
Regardless of the study design, early genetic association studies relied on a candidate gene approach [25]. With this approach, candidate genes are selected based on current understanding of the disease. For example, given the known association of NTDs and folic acid, many candidate gene studies of NTDs focused on genes that are involved in folate metabolism [1, 2, 5, 26]. Variants within the candidate gene (e.g., single-nucleotide polymorphisms or SNPs) are then genotyped and evaluated for their association with the condition of interest. While there have been a number of published studies implicating genes involved in folate metabolism, with the exception of the MTHFR C677T variant, these reports tend to characterized by weak and imprecise estimates of association, as well as a lack of consistent replication across studies. Notably, even for the MTHFR C677T variant, the observed association does not change the recommendation that women of childbearing age should take the standard dose of folic acid supplementation as per the general population guidelines [27].
Other candidate genes and candidate gene pathways have been explored in relation to NTD risk, including: genes associated with diabetes and obesity (maternal risk factors for NTDs) [28]; genes identified in animal models of NTDs (e.g., Pax3) [29]; genes that are involved in the metabolism of known teratogenic agents; and genes that have been implicated in the metabolism of compounds needed for embryonic development [2, 5]. However, no candidate genes in these (or other) categories have been definitely linked to the risk of NTDs.
A sample of statistically significant findings from GWAS of structural birth
defects
A sample of statistically significant findings from GWAS of structural birth defects
Abbreviations: het
Genetic study designs and approaches that have been applied to neural tube defects versus other structural birth defects among published studies
Abbreviations: GWAS
Advances in technology that permit affordable and reliable genotyping of
millions of SNPs have provided the opportunity to expand beyond candidate gene association
studies to an “agnostic” genome-wide approach, commonly referred to as genome-wide
association studies or GWAS. In GWAS, hundreds of thousands or millions of SNPs are tested
for association. Because of the enormous number of tests, the commonly accepted threshold
for statistical significance in a GWAS is
In comparison to other diseases, there have been relatively few GWAS of
structural birth defects [30, 31, 32, 33, 34].
This is largely due to the difficulty of assembling the large study populations needed for
GWAS (i.e.,
Other genome-wide approaches that may yield insights into the etiologies of NTDs include the analysis of copy-number variation. Rather than focusing on single base pairs (e.g., SNPs), when evaluating copy-number variants (CNVs), investigators interrogate structural variation involving the number of copies of specific regions of DNA, which can either be deleted or duplicated. These chromosomal deletions and duplications involve fairly large stretches of DNA. Previous assessments of pediatric neurodevelopmental conditions have demonstrated the importance of CNVs in human disease [35]. However, as with GWAS, there are currently no published genome-wide assessments of CNVs among individuals with NTDs.
Whole-exome and whole-genome sequencing
Whole-exome and whole-genome sequencing (WES and WGS, respectively) studies have moved to the forefront of genomic research in recent years. As with the shift from candidate gene studies to GWAS, this is largely due to advances in technology that have led to reductions in the cost and time required to sequence DNA. The current focus of WES/WGS is on identification of rare mutations that have a large impact on disease risk. With that said, child-parent trios have reemerged as an important study design, partly because of the potential importance of de novo (i.e., new) mutations. New mutations have long been known to cause genetic disease, but their true contribution to disease burden in the absence of significant family history can only now begin to be determined using child-parent trio-based WES or WGS approaches. Notably, recent studies demonstrate that de novo mutations contribute to a range of rare and common pediatric phenotypes, including seemingly sporadic malformation syndromes, congenital heart defects, intellectual disability, autism, and schizophrenia [36, 37, 38, 39, 40, 41, 42, 43]. Furthermore, de novo mutations provide a mechanism by which early-onset reproductively-lethal diseases remain prevalent in the population. However, de novo mutations do not explain the strong association between a family history of NTDs and the risk of these outcomes, so future assessments must still consider the role of inherited genetic variation. Additionally, when present, WES/WGS of several of multiplex families (in which many related individuals have NTDs) will also be informative in disentangling the genetics of these conditions. As with GWAS, there have been no large-scale efforts to conduct WES or WGS among those with spina bifida or anencephaly to identify the recurrent de novo mutations or novel inherited variants that play a role in the etiology of NTDs.
Conclusions and future directions
While there is strong evidence for a significant genetic component to the
etiology of NTDs, little headway has been made to identify genetic variations related to
these conditions. In fact, as highlighted in this review, understanding genetic
susceptibility to NTDs continues to lag behind that of other common structural birth
defects. This is in part due to the lack of large well-characterized study populations with
biological samples collected with appropriate informed consent for more modern analytical
techniques, such as GWAS, WES, and WGS. Future studies should not only take advantage of
modern genomic approaches, but should also leverage existing study cohorts, including the
National Spina Bifida Patient Registry [44], which
has enrolled thousands of individuals with spina bifida with the goals of improving the
health outcomes of people living with spina bifida and building a foundation for ongoing
research. Another important avenue to explore in human populations is evaluating the
Footnotes
Acknowledgments
The authors would like to acknowledge Robert Roger Lebel, MD, FACMG for his review and comments.
Conflict of interest
The authors have no conflicts of interest to report.
