Abstract
The neuron-restrictive silencer factor (NRSF) is a zinc finger transcription factor that represses neuronal gene transcription in non-neuronal cells by binding to the consensus repressor element-1 (RE1) located in regulatory regions of target genes. NRSF silences the expression of a wide range of target genes involved in neuron-specific functions. Previous studies showed that aberrant regulation of NRSF plays a key role in the pathological process of human neurodegenerative diseases. However, a comprehensive set of NRSF target genes relevant to human neuronal functions has not yet been characterized. We performed genome-wide data mining from chromatin immunoprecipitation followed by deep sequencing (ChIP-Seq) datasets of NRSF binding sites in human embryonic stem cells (ESC) and the corresponding ESC-derived neurons, retrieved from the database of the ENCODE/HAIB project. Using bioinformatics tools such as Avadis NGS and MACS, we identified 2,172 NRSF target genes in ESC and 308 genes in ESC-derived neurons based on stringent criteria. Only 40 NRSF target genes overlapped between both data sets. According to motif analysis, binding regions showed an enrichment of the consensus RE1 sites in ESC, whereas they were mainly located in poorly defined non-RE1 sites in ESC-derived neurons. Molecular pathways of NRSF target genes were linked with various neuronal functions in ESC, such as neuroactive ligand-receptor interaction, CREB signaling, and axonal guidance signaling, while they were not directed to neuron-specific functions in ESC-derived neurons. Remarkable differences in ChIP-Seq-based NRSF target genes and pathways between ESC and ESC-derived neurons suggested that NRSF-mediated silencing of target genes is highly effective in human ESC but not in ESC-derived neurons.
Introduction
The neuron-restrictive silencer factor (NRSF), also known as repressor element-1 (RE1) silencing transcription factor (REST), is a zinc finger transcription factor that represses neuronal gene transcription in non-neuronal cells by binding to the consensus RE1 site, also known as neuron-restrictive silencer element (NRSE), located in regulatory regions of target genes. 1 NRSF silences the expression of a wide range of target genes involved in neuron-specific functions encoding ion channels, synaptic vesicle proteins, neurotransmitter receptors, neurotransmitter-synthesizing enzymes, neuropeptides, cell adhesion molecules, and cytoskeletal components. 2 When NRSF represses the transcription of target genes, it recruits the repressor complex, composed of the REST corepressor COREST, transcription regulator SIN3A, methyl CpG binding protein MECP2, histone deacetylases, histone modifying enzymes, and the components of the SWI-SNF chromatin remodeling complexes. 3
NRSF acts as a master negative regulator of neurogenesis in the central nervous system (CNS) during development. It is expressed at variable levels in most nonneuronal cell types, while the expression levels are reduced in neuronal progenitor cells and more greatly suppressed in differentiated neurons in adult rodent brains. 4
NRSF plays a pivotal role in the control of proper timing of neuronal differentiation in neural progenitor cells. 5 Sustained expression of NRSF inhibits radial migration of neural progenitor cells by targeting doublecortin in the developing rat brain. 6 Overexpression of the NRSF competitor REST-VP16 induces neuronal differentiation by direct activation of NRSF target genes in mouse neural stem cells. 7
Because most previous studies on NRSF employed rodent models,3–7 the functional roles of NRSF remain largely unknown in the human CNS during development and maturation, except for a previous study showing that blocking NRSF function induces apoptosis in human medulloblastoma cells, although the target genes of NRSF were not characterized. 8
Increasing evidence indicates that aberrant regulation of NRSF is involved in the pathological process of human neurodegenerative diseases, such as Huntington's disease (HD) and Alzheimer's disease (AD). Wild-type huntingtin (HTT), the causative gene of HD, binds to and sequesters NRSF protein in the cytoplasm, thereby inhibiting the transcriptional repression of BDNF, a cardinal target of NRSF. 9 In contrast, mutant HTT protein with a polyglutamine repeat expansion causes nuclear accumulation of NRSF and persistently represses the expression of neuronal NRSF target genes in HD brains. 9 Furthermore, the binding ability of NRSF to NRSE is enhanced in lymphocytes of HD patients. 10 NRSF protein levels are elevated in AD brains correlated with reduced expression of the choline acetyltransferase gene. 11 These observations suggest that to clarify the role of NRSF in the pathogenesis of human neurodegenerative diseases, it is important to characterize the comprehensive list of NRSF target genes relevant to human neuronal functions and dysfunctions.
Recently, the next-generation sequencing (NGS) technology had a great impact on the field of genome research. Chromatin immunoprecipitation followed by deep sequencing (ChIP-Seq) serves as a highly efficient NGS method for genome-wide profiling of DNA-binding proteins, histone modifications, and nucleosomes, with high resolution and great coverage of the genome. 12 Increasing amounts of NGS data have been deposited in public databases, such as the Sequence Read Archive (SRA) (http://www.ncbi.nlm.nih.gov/sra) and the European Nucleotide Archive (ENA) (http://www.ebi.ac.uk/ena/home). Since these databases contain extremely high-throughput experimental data, it is often difficult to extract meaningful biological implications. Recent advances in systems biology enable illustration of a cell-wide map of complex molecular interactions by using the literature-based knowledgebase of molecular pathways. 13 The logically arranged molecular networks construct the entire system characterized by robustness, which maintains the proper function of the system in the face of genetic and environmental perturbations. Therefore, the integration of extremely high dimensional NGS data with underlying molecular pathways and networks by using bioinformatics tools of knowledgebase offers a rational approach for characterizing the pathway- and network-based molecular mechanisms of gene regulation on the whole genome scale. 13
To identify genome-wide NRSF target genes relevant to human neuronal functions, we compared ChIP-Seq-based NRSF binding sites between human embryonic stem cells (ESC) and the corresponding ESC-derived neurons and studied the difference in molecular pathways and networks of NRSF target genes. We found remarkable differences in NRSF target genes and pathways between ESC and ESC-derived neurons, suggesting that NRSF-mediated silencing of target genes is highly effective in human ESC but not in ESC-derived neurons.
Methods
ChIP-Seq dataset of NRSF target genes.
To identify a comprehensive set of NRSF target genes in human ESC and ESC-derived neurons, we investigated FASTQ-formatted files of ChIP-Seq datasets retrieved from the DDBJ Sequence Read Archive (DRA) (http://trace.ddbj.nig.ac.jp/DRASearch) under the accession number of SRX100410, SRX190260, SRX100409, and SRX190329. The datasets were originally prepared for the Encyclopedia of DNA Elements (ENCODE) project by Dr Richard Myers, HudsonAlpha Institute for Biotechnology (HAIB). In their experiments, human H1 ESC cells established from inner cell mass, presenting with a normal karyotype, were processed to induce neuronal differentiation. Both ESC and ESC-derived neurons were processed for ChIP with mouse monoclonal anti-NRSF antibody (Protein Expression Center, Caltech). Next, NGS libraries constructed from size-selected ChIP DNA fragments were processed for deep sequencing at a read length composed of 30–36 bp for ESC or 40–50 bp for neurons on Genome Analyzer (Illumina, San Diego, CA, USA). In both cell types, the corresponding libraries termed RevXlinkChromatin made from sonicated chromatin, whose formaldehyde cross-links are reversed without immunoenrichment, were utilized for input control.
First, we evaluated the quality of short reads by searching them on the FastQC program. Next, we removed the reads of insufficient quality by filtering them out by the FASTX-toolkit. After cleaning the data, we mapped them on the human genome reference sequence version hg19 using a mapping tool of Avadis NGS (Agilent Technologies, Santa Clara, CA, USA) or using the Bowtie program. We then identified significant peaks by using the Model-based Analysis of ChIP-Seq (MACS) program in the stringent condition that satisfies the false discovery rate (FDR) ≤ 0.01 and fold enrichment (FE) ≥ 20 in order to reduce the detection of false-positive binding sites if possible as described previously. 14 We characterized the genomic location of binding peaks using a peak-finding tool of Avadis NGS that classifies the locations into the upstream region, 5′ untranslated region (5′UTR), exon, intron, and 3′UTR. To identify the exact location of binding sites, we imported the Bowtie and MACS-processed data into the GenomeJack genome viewer (Mitsubishi Space Software) as described previously. 14
The genes corresponding to MACS peaks were determined by a neighboring gene analysis tool of Avadis NGS within a distance of 5000 bp from peaks to genes. The consensus sequence motifs were identified by importing a 400 bp-length sequence surrounding the summit of MACS peaks derived from top 100 genes based on FE into the MEME-ChIP program. The utility of QC tools, Bowtie, MACS, and MEME-ChIP is described elsewhere.15–18 The information on websites and analysis conditions of bioinformatics tools is summarized in Supplementary Table 1.
Top 20 Most Enriched ChIP-Seq-based NRSF target genes in human ESC.
From ChIP-Seq datasets, we identified 2,172 NRSF target genes in human ESC based on stringent criteria that satisfy both false discovery rate (FDR) ≤ 0.01 and fold enrichment (FE) ≥ 20. Top 20 genes based on FE are listed with the chromosome, the start base position, the end base position, FE, FDR, Entrez Gene ID, Gene Symbol, and Gene Name.
Molecular pathway analysis.
To identify biologically relevant molecular pathways and networks, we imported Entrez Gene IDs of NRSF target genes into the Functional Annotation tool of Database for Annotation, Visualization and Integrated Discovery (DAVID) v6.7. 19 DAVID extracts gene ontology (GO) terms enriched in the set of imported genes and identifies relevant pathways constructed by Kyoto Encyclopedia of Genes and Genomes (KEGG) or the Protein Analysis Through Evolutionary Relationships (PANTHER) classification system. The results are followed by statistical evaluation with the modified Fisher's exact test corrected by multiple comparison tests. In DAVID analysis data, we considered P-value < 0.05 by Bonferroni correction as significant. KEGG is a publicly accessible knowledgebase that contains manually curated 272,733 pathways that cover a wide range of metabolic, genetic, environmental, and cellular processes, as well as human diseases. PANTHER is a publicly accessible knowledgebase that includes the information on 176 signaling and metabolic pathways manually curated by expert biologists.
We also imported Entrez Gene IDs of NRSF target genes into the Core Analysis tool of Ingenuity Pathways Analysis (IPA) (Ingenuity Systems). IPA is a commercial knowledgebase that contains approximately 3,000,000 biological and chemical interactions with definite scientific evidence. By uploading the list of Gene IDs, the network-generation algorithm identifies focused genes integrated in global molecular pathways and networks. IPA calculates the score P-value that reflects the statistical significance of association between the genes and the pathways and networks by Fisher's exact test. For IPA network and pathway analysis data, we considered P-value < 0.05 by Fisher's exact test as significant.
Results
Identification of ChIP-Seq-based NRSF target genes in human ESC and ESC-derived neurons.
After mapping short reads on hg19 and detecting peaks by MACS on Avadis NGS, we initially identified 36,096 ChIP-Seq peaks of NRSF target genes in ESC and 45,108 peaks in ESC-derived neurons. After performing the neighboring gene search within a distance of 5,000 bp from peaks to genes, these were reduced to 12,757 genes in ESC and 15,802 genes in ESC-derived neurons, respectively. These results indicate that approximately 65% of ChIP-Seq peaks were located in intergenic regions far from the nearest genes. Upon application of stringent criteria that satisfied both FDR ≤ 0.01 and fold enrichment (FE) ≥ 20, the results were further reduced to 2,697 genes in ESC and 388 genes in ESC-derived neurons, respectively. After omitting non-protein coding genes, we extracted 2,172 protein-coding genes in ESC (FE = 59.4 ± 38.8) and 308 genes in ESC-derived neurons (FE = 26.0 ± 6.8). The top 20 genes based on FE are listed in Table 1 for ESC and Table 2 for ESC-derived neurons. The complete lists are shown in Supplementary Table 2 for ESC and Supplementary Table 3 for ESC-derived neurons.
Top 20 Most Enriched ChIP-Seq-based NRSF target genes in human ESC-derived neurons.
From ChIP-Seq datasets, we identified 308 NRSF target genes in human ESC-derived neurons based on stringent criteria that satisfy both false discovery rate (FDR) ≤ 0.01 and fold enrichment (FE) ≥ 20. Top 20 genesbased on FE are listed with the chromosome, the start base position, the end base position, FE, FDR, Entrez Gene ID, Gene Symbol, and Gene Name.
Top 5 IPA canonical pathways of NRSF target genes in human ESC and ESC-derived neurons.
By importing Entrez Gene IDs of ChIP-Seq-based NRSF target genes into the Core Analysis tool of IPA, we identified canonical pathways with significant relevance to the imported genes. Top 5 pathways based on P-value are listed with the category, focused molecules, ratio, and P-value of Fisher's exact test. The NRSF target genes shared between ESC and ESC-derived neurons are underlined.
Importantly, when compared with 985 ChIP-Seq-based NRSF target genes in Jurkat human T cells identified in a previous study, 20 405 genes (41.1%) of Jurkat overlapped with NRSF target genes of human ESC, supporting the validity of our analysis (Supplementary Table 4). In contrast, only 40 genes overlapped between human ESC and ESC-derived neurons. These included ST8SIA3, NPAS4, EP300, NKX1–1, ARID2, SLCA7 A14, MEX3B, GPR26, ZSWIM5, SEZ6 L2, ASPHD1, DDX39B, CELSR3, GAD2, SPARTA16, MPHOSPH10, MCEE, RAB28, FAM117B, KCNMA1, MRP63, SKA3, TMEM187, LATS1, PGK1, RPS17, LRP6, H3F3B, EIF4 A2, MGEA5, RPL10, KIF21 A, FUS, AZIN1, WHSC2, BTG1, CCDC88 A, WWC1, TMED2, and GLCE (underlined in Supplementary Table 2 and Supplementary Table 3). The network analysis of 40 genes by IPA indicated that the set of 14 genes (35%), such as ARID2, MEX3B, SEZ6 L2, GAD2, RAB28, SKA3, RPS17, RPL10, KIF21A, FUS, AZIN1, WHSC2 (NEFLA), CCDC88A, and TMED2, belong to the molecules interacting with UBC (ubiquitin C) (Supplementary Fig. 1).

The genomic location of ChIP-Seq-based NRSF target gene EP300 in ESC. An example of NRSF target gene in ESC named histone acetyltransferase p300 (EP300) is shown on GenomeJack, where a MACS peak numbered 26542 is located in the exon of EP300. The panels (
Among 2,172 genes detected in ESC, the peaks were located in the upstream region (8.6%), 5′UTR (12.8%), exon (21.9%), intron (53.4%), and 3′UTR (3.3%). Among 308 genes detected in ESC-derived neurons, the peaks were located in the upstream region (1.9%), 5′UTR (30.9%), exon (27.4%), intron (36.2%), and 3′UTR (3.7%). Thus, the intron serves as a major NRSF-binding site in both ESC and ESC-derived neurons. The more precise location was determined by importing processed data into the GenomeJack genome viewer (Fig. 1 for ESC and Fig. 2 for ESC-derived neurons).

The genomic location of ChIP-Seq-based NRSF target gene SLC7 A14 in ESC-derived neurons. An example of NRSF target gene in ESC-derived neurons named solute carrier family 7, member 14 (SLC7A14) is shown on GenomeJack, where a MACS peak numbered 31638 is located in the first intron of SLC7A14. The panels (
According to motif analysis with MEME-ChIP of top 100 genes based on FE, the most significant consensus sequences in ESC were characterized by a well-defined REST motif (RE1) composed of 5′AG[CG]ACCA[TC] GGACAG3′ (80 sites; E-value = 5.1 × 10–254) and a left half-site of RE1 composed of 5'[TG]TTCAGCAC[CT]3′ (21 sites; E-Value = 2.9 × 10–11) (Fig. 3A, B). In contrast, the most significant consensus sequences in ESC-derived neurons represented a poorly defined non-RE1 motif composed of 5′[GA][AGC][GA][AG][ATG][GA]G[CA]GG[C/G][AG]G[CGT][GA][GA][CAG][AG][GC][TA][GA][GA][GA][AG][CAG][GA][GC]A3′ (63 sites; E-Value = 6.7 × 10–17) and an RE1-like motif composed of 5′[CG][CT][TG]TC[AT][GT]CACCG[CT][GA]GACAGC[GA]C[CT][ACT][TG]3′ (11 sites; E-Value= 1.7 × 10–4) (Fig. 3C, D). These results suggest that NRSF binds chiefly to the RE1 consensus sites in ESC by direct binding, whereas NRSF often binds to the promiscuous non-RE1 sites in ESC-derived neurons possibly by indirect binding.

Identification of NRSF binding consensus motif sequence. The consensus motif sequence was identified by importing a 400 bp-length sequence surrounding the summit of MACS peaks of top 100 binding sites based on FE into MEME-ChIP. The panels (
Molecular pathways of ChIP-Seq-based NRSF target genes in human ESC and ESC-derived neurons.
Next, we studied molecular pathways and networks of ChIP-Seq-based NRSF target genes by using bioinformatics tools of the comprehensive knowledgebase. First, by importing the sets of Entrez Gene IDs of NRSF target genes, DAVID identified statistically significant functionally associated GO terms and related KEGG and PANTHER pathways. The top 3 most significant GO terms, regardless of their categories in ESC, included “neuron projection” (GO: 0043005; P = 1.21 × 10–8), “calcium ion binding” (GO: 0005509; P = 1.21 × 10–7), and “hemophilic cell adhesion” (GO: 0007156; P = 3.72 × 10–6). In ESC, only one KEGG pathway, termed “Neuroactive ligand-receptor interaction” (hsa04080; P = 0.0343; Fig. 4), and only one PANTHER pathway, termed “Wnt signaling pathway” (P00057; P = 0.0399), were considered to be significantly relevant to NRSF target genes. In contrast, the top 3 most significant GO terms, regardless of their categories in ESC-derived neurons, included “nuclear lumen” (GO: 0031981; P = 8.72 × 10–8), “intracellular non-membrane-bounded organelle” (GO: 0043232; P = 5.41 × 10–7), and “non-membrane-bounded organelle” (GO: 0043228; P = 5.41 × 10–7). In ESC-derived neurons, only one KEGG pathway termed “Ribosome” was significant (hsa03010; P = 3.53 × 10–5) and none of the PANTHER pathways reached the statistical significance after Bonferroni correction.

KEGG pathway of ChIP-Seq-based NSRF target genes in ESC. Entrez Gene IDs of 2,172 ChIP-Seq-based NRSF target genes in ESC were imported into the Functional Annotation tool of DAVID. It extracted statistically significant KEGG pathways relevant to the set of imported genes. KEGG pathway termed “Neuroactive ligand-receptor interaction” (hsa04080; P = 0.0343) is shown. NRSF target genes are highlighted by orange.
Finally, we imported the list of Entrez Gene IDs of NRSF target genes into IPA. It extracted the network defined as “Cell-To-Cell Signaling and Interaction, Cell Signaling, Nervous System Development and Function” (P = 1.00 × 10–39) and “Endocrine System Disorders, Gastrointestinal Disease, Hereditary Disorder” (P = 1.00 × 10–39) in ESC, and “Cell-mediated Immune Response, Cellular Movement, Developmental Disorder” (P = 1.00 × 10–71) in ESC-derived neurons as the most significant functionally associated molecular network. IPA also identified canonical pathways relevant to the set of imported genes (Table 3). The top 5 most significant pathways in ESC included “Corticotropin releasing hormone signaling” (P = 7.05E-08), “Role of NFAT in cardiac hypertrophy” (P = 1.48 × 10–7), “CREB signaling in neurons” (P = 3.64 × 10–7; Fig. 5), “Axonal guidance signaling” (P = 4.24 × 10–7), and “Neuropathic pain signaling in dorsal horn neurons” (P = 2.02 × 10–6). In contrast, the top 5 most significant pathways in ESC-derived neurons were composed of “EIF2 signaling” (P = 1.98 × 10–8), “Regulation of eIF4 and p70S6K signaling” (P = 1.10 × 10–4), “mTOR signaling” (P = 2.07 × 10–4), “ERK5 signaling” (P = 4.74 × 10–4), and “ERK/MAPK signaling” (P = 7.46 × 10–4). When the top 5 IPA pathways were compared between both cell types, a very small number of NRSF target genes, such as EP300, EIF4 A2, RPS17, RPL10, and H3FB, overlapped between ESC and ESC-derived neurons (underlined in Table 3). These results suggest that the signaling pathways composed of NRSF target genes differ between cell types, and the pathways pivotal for maintenance of neuronal function are enriched in ESC, whereas the pathways in ESC-derived neurons involve more general functions, such as protein translation control and cell survival.

IPA pathways of ChIP-Seq-based NSRF target genes in ESC. Entrez Gene IDs of 2,172 ChIP-Seq-based NRSF target genes in ESC were imported into the Core Analysis tool of IPA. It extracted statistically significant canonical pathways relevant to the set of imported genes. The third rank pathway termed “CREB signaling in neurons” (P = 3.64 × 10–7) in Table 3 is shown. NRSF target genes are highlighted in red.
Discussion
NRSF acts as a transcriptional repressor that restricts neuronal gene expression to differentiated neurons by silencing expression of target genes in nonneuronal cells.1,2 Most previous studies have focused on a role of NRSF in rodent cells.3–7 We characterized a comprehensive set of NRSF target genes relevant to human neuronal functions by analyzing ChIP-Seq datasets of NRSF binding sites in human ESC and the corresponding ESC-derived neurons, retrieved from the public database. We identified totally 2,172 NRSF target genes in ESC and 308 genes in ESC-derived neurons based on stringent criteria, with a small overlapping set of 40 genes, in which ubiquitin C-interacting proteins are accumulated. Importantly, the protein level of NRSF is regulated by ubiquitin-mediated proteolysis in a β-TRCP-dependent manner during neuronal differentiation. 21 These observations suggest that NRSF, along with a subset of NRSF targets, is processed for degradation by the ubiquitin-proteasome system (UPS), leading to acceleration of neuronal differentiation. Furthermore, the list of NRSF target genes in human ESC included many known targets identified in various cell types, such as NRXN3, L1CAM, LHX3, CY11B2, SYP, NEUROD1, FGF14, SCN3B, ST8SIA3, RASL10B, WDR25, SYN1, HCN1, DCX, CRH, KIRREL3, PTPRT, HES1, ONE-CUT1, CACNG2, NTRK3, CALB1, BDNF, NGDN, GRIN2 A, LHFPL5, RXFP3, HNF4 A, STMN2, GPR123, ACTA1, GLRA2, NPAS4, and VIP.2,20,22
A previous ChIP-Seq study identified 985 NRSF target genes in Jurkat human T cells. 20 Surprisingly, they are highly enriched in the GO terms related to synaptic transmission and nervous system development despite lymphoid cell types. 20 We analyzed the molecular pathway of 405 overlapping genes between human ESC and Jurkat T cells by using IPA. The program extracted “Corticotropin Releasing Hormone Signaling” (P = 2.79 × 10–6), “Glutamate Receptor Signaling” (P = 2.18 × 10–5), “Calcium Signaling” (P = 2.84 × 10–4), “Synaptic Long Term Depression” (P = 5.25 × 10–4), and “Circadian Rhythm Signaling” (P = 5.41 × 10–4) as the top 5 significant pathways. These observations suggested that the pathways involved in neuronal functions are enriched in NRSF target genes, and they are shared substantially between ESC and T lymphocytes.
We found that NRSF binding sites are located in various genomic regions, among which the intron region serves as the most major site in both ESC and ESC-derived neurons except for intergenic regions, which was consistent with previous observations. 23 Importantly, NRSF effectively inhibits transcription of target genes, regardless of the location or the orientation of its binding sites in 5′UTRs, introns, exons, upstream of the transcription start, and downstream of the coding region distant from the enhancer and the promoter. 24 Furthermore, only a subset of NRSE sequences are constitutively occupied by NRSF in non-neuronal cells, and the occupancy depends largely on endogenous levels of NRSF. 22
The canonical NRSE site reflects a 21-bp consensus sequence composed of 5′TTCAGCACCACGGACAGCGCC3′, which is divided into the left half-site and the right half-site separated by a spacer of two nucleotides. 23 However, NRSF binding sites with highly degenerate sequences are often found throughout the genome. 25 Through motif analysis, we found that most NRSF binding sites in ESC are present in the most significant RE1 sites (E-value = 5.1 × 10–254), while they are located mostly in non-RE1 sites and the minority is located in less significant RE1 sites (E-value = 1.7 × 10–4) in ESC-derived neurons. The possibility exists that the lower expression levels of NRSF in ESC-derived neurons may make it difficult to detect specific ChIP-Seq peaks. However, a highly sensitive monoclonal anti-NRSF antibody was utilized to detect a trace amount of NRSF protein for both ESC and ESC-derived neurons. Therefore, our observations suggest that NRSF binds chiefly to RE1 sites in ESC by direct binding, whereas it often binds to promiscuous non-RE1 sites in ESC-derived neurons by indirect binding via an interaction with other DNA binding proteins, as described peviously. 23 Notably, noncanonical binding sites are often highly polymorphic, where the left and right half-site motifs are separated by variable-length spacer sequences.25,26 Furthermore, numerous ChIP-Seq-based NRSF binding sites contain only partial motifs composed of a single half-site. 23 NRSF is composed of a DNA binding domain located within the cluster of eight zinc fingers, and two repressor domains at the N- and C-termini, which are capable of interacting with corepressor complex components. 3 Importantly, as neuronal progenitor cells differentiate into neurons, NRSF and its corepressor complex gradually dissociate from RE1 sites, triggering activation of neuronal gene expression. 27 These observations may explain why only a limited subset of NRSF target genes overlapped between human ESC and ESC-derived neurons.
By molecular pathway analysis, we identified the pathways termed “Neuroactive ligand-receptor interaction”, “Wnt signaling pathway”, “CREB signaling in neurons”, and “Axonal guidance signaling”, all of which are closely related to neuronal functions, as the pathways relevant to NRSF target genes in ESC. Among these, downregulation of CREB signaling may represent a highly important function of NRSF in non-neuronal cells. CREB is promptly activated in response to a wide range of extracellular stimuli, such as growth factors, peptide hormones, and neuronal activity, which activates protein kinase A (PKA), mitogen-activated protein kinases (MAPKs), and Ca2+/calmodulin-dependent protein kinases (CAMKs). 28 Phosphorylated CREB upregulates a panel of genes essential for neuronal differentiation and plasticity, including BDNF, TrkB, somatostatin, and c-fos. 28 NRSF inhibits the CREB signaling on activation of cocaine and amphetamine regulated transcript (CART) in SK-N-SH neuroblastoma cells, being in line with our observations. 29 Furthermore, NRSF expression is positively regulated by CREB in small-cell lung cancer (SCLC) cell lines, serving as an autoregulatory loop in the NRSF-CREB signaling pathway. 30 Importantly, NRSF is upregulated in Alzheimer's disease (AD) brains, 11 and CREB signaling is deregulated in the hippocampus of AD brains. 31 With respect to the “Axonal guidance signaling” pathway of NRSF target genes, overexpression of NRSF in differentiated neurons causes axonal guidance errors in the developing check spinal cord. 32
In contrast to the molecular pathways in ESC, we identified the pathways linked to regulation of protein synthesis termed “Ribosome”, “EIF2 signaling”, “Regulation of eIF4 and p70S6K signaling”, and “mTOR signaling” as the major pathways relevant to NRSF target genes in ESC-derived neurons. They converge on the PI3K/Akt/mTOR signaling pathway pivotal for cellular proliferation and survival in all cell types. 33 These observations suggest that NRSF target genes play a key role in maintenance of neuronal functions, which are suppressed in ESC, whereas they are not restricted to neuron-specific functions in ESC-derived neurons. Notably, NRSF negatively regulates WNT signaling pathway, providing a mechanism responsible for maintenance of pluripotency in ESC. 34
Increasing evidence indicates that aberrant regulation of NRSF plays a pivotal role in the pathogenesis of cancer, cardiac hypertrophy, ischemia, epilepsy, and neurodegeneration.9–11,35–37 NRSF acts as an essential regulator of maintenance of normal cardiac structure, since transgenic mice expressing a dominant-negative mutant NRSF in hearts exhibited dilated cardiomyopathy. 38 Through IPA analysis, we found the pathway termed “Role of NFAT in cardiac hypertrophy” as the second-ranked significant pathway for NRSF target genes in ESC (Table 3). In Huntington's disease (HD) brains, the inability of mutant huntingtin (HTT) to retain NRSF in the neuronal cytoplasm inhibits the expression of NRSF target genes essential for neuronal functions. 9 We identified both NRSF itself and HTT as potential NRSF target genes in human ESC, because they have degenerate RE1 motifs of NRSF binding sites (Supplementary Fig 2 for NRSF/REST and Supplementary Fig 3 for HTT). These observations suggested the existence of a complex autoregulatory loop in the network of the HTT-NRSF axis.
Conclusions
Using genome-wide data mining from ChIP-Seq datasets, we identified 2,172 NRSF target genes in human ESC and 308 NRSF target genes in ESC-derived neurons. The binding regions were accumulated in the RE1 sites in human ESC, whereas they were located mainly in non-RE1 sites in ESC-derived neurons. The molecular pathways of NRSF target genes were relevant to the maintenance of neuronal functions in ESC, whereas they were not limited to neuron-specific functions in ESC-derived neurons. Remarkable differences in ChIP-Seq-based NRSF target genes and pathways between ESC and ESC-derived neurons suggested that NRSF-mediated silencing of target genes is highly effective in human ESC but not in ESC-derived neurons.
Author Contributions
JS designed the methods, analyzed the data, and drafted the manuscript. NK and YY helped with the data analysis. All authors have read and approved the final manuscript.
DISCLOSURES AND ETHICS
As a requirement of publication the authors have provided signed confirmation of their compliance with ethical and legal obligations including but not limited to compliance with ICMJE authorship and competing interests guidelines, that the article is neither under consideration for publication nor published elsewhere, of their compliance with legal and ethical guidelines concerning human and animal research participants (if applicable), and that permission has been obtained for reproduction of any copyrighted material. This article was subject to blind, independent, expert peer review. The reviewers reported no competing interests.
Footnotes
Acknowledgements
The authors thank Ms. Aki Takaoka for her invaluable help.
Supplementary Data
