Applications of Functional Genomics for Drug Discovery

Abstract

Japanese

Korean

Chinese

Many diseases, such as diabetes, autoimmune diseases, cancer, and neurological disorders, are caused by a dysregulation of a complex interplay of genes. Genome-wide association studies have identified thousands of disease-linked polymorphisms in the human population. However, detailing the causative gene expression or functional changes underlying those associations has been elusive in many cases. Functional genomics is an emerging field of research that aims to deconvolute the link between genotype and phenotype by making use of large -omic data sets and next-generation gene and epigenome editing tools to perturb genes of interest. Here we review how functional genomic tools can be used to better understand the biological interplay between genes, improve disease modeling, and identify novel drug targets. Incorporation of functional genomic capabilities into conventional drug development pipelines is predicted to expedite the development of first-in-class therapeutics.

Keywords

gene editing RNAi shRNA CRISPR epigenetics genomics cell-based assays

Introduction

A major challenge facing pharmaceutical research and drug development is the high attrition rate of therapies in clinical development. This has led to the enormously high costs for bringing new drugs to market, with a recent estimate of about $2.5 billion per new drug approval.¹ When assessing the driving factors behind high attrition rates, it was reported that the majority of drug failures are due to toxicity and lack of efficacy.² Further exemplifying the reduction in R&D productivity, many new drugs that do gain regulatory approval have limited commercial success due to a failure to significantly differentiate from the standard of care.^1,2 There has also been a reduction in both the proportion of late-stage targets in the pipeline that are classified as first-in-class and the percentage of approvals considered first-in-class.¹ This decline in innovation is highlighted by annual peak pharmaceutical sales decreasing by almost 50% in recent years.^1,3 Although 2018 is considered a blockbuster year for the number of drug approvals by the Food and Drug Administration (FDA), a large portion of these approvals are for orphan or rare oncology indications, where despite the clinical impact of these drugs, the commercial potential is projected to be minimal.⁴ A recent analysis examining the drug discovery process over the last 60 years highlights that despite tremendous improvements in technology, pharmaceutical research has been plagued with deficiencies in reproducibility and efficiency.⁵ Pulling together the high attrition rates, reduced differentiation, and issues with reproducibility, a clear question emerges for drug discovery: How can biotech and pharmaceutical companies identify and de-risk new first-in-class drug targets that will successfully translate into the clinic?

Companies are approaching this problem by expanding the toolsets used to identify new targets, with greater emphasis on human genetics and functional genomic technologies as a means to improve understanding of the molecular mechanisms driving disease.^2,6 Functional genomics is a broad term that covers the investigation of biochemical, cellular, or physiological properties of gene products to understanding the relationship between genotype and phenotype.⁷ Functional genomics is used to better understand various processes related to genomic sequence, gene expression, and encoded protein function, including the study of coding and noncoding transcription, protein translation, and interactions between proteins, DNA, and RNA species. Here we review the recent history and cutting-edge tools for functional genomics and outline how these approaches are being used to improve the drug development process.

Tools for Functional Genomics

The first robust tool for making site-specific perturbations to the transcriptome was RNA interference (RNAi).⁸ RNAi is a multistep process that results in targeted degradation and subsequent repression of specific mRNA transcripts ( Fig. 1 ). Upon delivery of double-stranded RNA (dsRNA) moieties to cells, the dsRNA is recognized by the enzyme Dicer and processed into 21–23 bp small interfering RNAs (siRNAs).⁹ These siRNAs are incorporated into the RNA-induced silencing complex (RISC).¹⁰ Through homologous base pairing of the siRNA, the RISC nuclease identifies and degrades target RNA sequences.¹⁰ siRNAs can be generated by chemical synthesis¹¹/in vitro transcription,¹² or expressed from a plasmid in the form of short hairpin RNA (shRNA).¹³ While RNAi has been instrumental in facilitating the study of individual genes,¹⁴ one of its limitations is off-target effects.^15–17 While siRNA targets are identified through homology, the rules for selecting a highly active but specific siRNA have been difficult to develop. Generally, multiple siRNAs need to be constructed and manually tested to identify active sequences that have minimal effects on other off-target transcripts.

Figure 1.

Mechanism of RNAi. Short dsRNA can either be directly delivered to cells or be expressed from a plasmid as shRNAs. The dicer enzyme processes the dsRNA into ssRNA. ssRNA associates with the RISC complex. Through sequence homology, the RISC complex binds and subsequently degrades the targeted mRNA sequence.

Recent advances in genome- and epigenome editing tools now allow researchers to readily make site-specific perturbations to the transcriptome, genome, and epigenome. While a reduction in mammalian gene expression has been facilitated through RNAi-based technologies for the last 18 years,⁸ gene editing platforms provide expanded functionality, including, but not limited to, gene repression. Gene editing enzymes induce a site-specific double-stranded break at the loci of interest ( Fig. 2 ). By manipulating the endogenous DNA repair mechanisms in the cell, site-specific changes to the DNA sequence, including deletions, insertions, and replacements, can be introduced at the cut site ( Fig. 2A ). Through nonhomologous end joining (NHEJ), cells repair the cut site though a mechanism that may result in insertions or deletions (indels). Indels can be used to induce frameshifts to knock out gene expression, mutate regulatory elements responsible for transcription factor binding, or manipulate splicing signals. The identity of the indels should be carefully monitored as in-frame or silent mutations may occur that do not result in the desired outcome. Alternatively, following a double-stranded break, cellular repair can be skewed toward homology-directed repair (HDR) by delivering a donor repair template with homology to the 5′ and 3′ ends of the double-stranded break.¹⁸ By introducing sequence modifications between the donor homology arms, targeted changes can be made to the genome upon repair, such as producing natural variant alleles to model disease or introducing a fusion tag to track the protein product of a gene. Generally speaking, HDR is less efficient than inducing an indel and the efficiency of HDR-mediated repair correlates inversely with the size of the donor (i.e., the larger the donor, the less efficient the repair). HDR occurs more readily in actively dividing cells, and therefore certain terminally differentiated cell types have limited HDR capacity. Improving HDR efficiencies through techniques such as engineering aspects of the donor template¹⁹ and small-molecule treatments²⁰ is an active area of research.

Figure 2.

Induction and manipulation of DNA double-stranded break repair for genome editing applications. (A) Two main mechanisms of DNA double-stranded break repair utilized for gene editing applications. (B) The three most common platforms using to induce double-stranded breaks are ZFNs, TALENs, and CRISPR/Cas9 nucleases.

While the principle of utilizing double-stranded DNA break repair for gene editing applications has been around since the 1990s,^21–23 the first gene editing tools were quite complex and not accessible to most research laboratories. Before the discovery of clustered regularly interspaced short palindromic repeats and associated endonucleases (CRISPR/Cas9), the most widely used platforms for gene editing were zinc finger nucleases (ZFNs)²⁴ and transcription activator-like effector nucleases (TALENs)^25–28 ( Fig. 2B ). These nucleases are constructed by fusing the ZF or TALE DNA binding domains to a modular nuclease domain, such as Fok1.^24,29 Fok1 requires dimerization to induce a double-stranded break and, therefore, both platforms necessitate construction of two unique proteins. ZFs²⁴ and TALEs^25–28 confer DNA binding specificity based on a protein–DNA interaction. Each ZF recognizes a specific triplicate of DNA, while each TALE domain recognizes a single DNA base. By stringing together multiple ZF or TALE monomers, researchers create a sequence-specific DNA binding domain. While the rules governing the design and use of ZFNs and TALENs have become more straightforward over time, both platforms necessitate the construction of a new pair of proteins for each new genomic target site of interest, limiting high-throughput screening applications. Furthermore, labs utilizing these platforms must be proficient in molecular cloning or have the resourcing to have constructs externally synthesized, thus limiting widespread accessibility.

The field of gene editing became far more accessible to the general scientific community with the discovery of CRISPR/Cas9 for gene editing in mammalian cells.^30,31 In contrast to earlier platforms, the specificity of the Cas9 nuclease is conferred by a RNA–DNA interaction ( Fig. 2B ). The Cas9 protein complexes with a short guide RNA (gRNA) sequence and then, via homologous base pairing, the gRNA binds to the target site of interest and Cas9 induces a double-stranded break. Instead of designing a new gene editing protein for each locus of interest, researchers can now use the same Cas9 protein and control cutting specificity by exchanging the short ~100-base gRNA sequence (17–20 bp crRNA plus the tracrRNA). With the ever-decreasing prices of DNA and RNA synthesis, it has become affordable for both academic and industrial researchers to construct or order panels of short gRNAs. The vectors required for such work are available through Addgene³² for academic researchers, and many vendors will even provide validated off-the-shelf reagents ready for use.

In contrast to gene editing, where permanent changes are made to the DNA sequence, epigenome editing involves modifying the chromatin structure and proteins recruited to a specific locus to influence gene expression. Recent advances have led to the development of engineered epigenome editing tools capable of modulating both gene expression and chromatin state ( Fig. 3 ). These synthetic transcription factors are constructed through fusion of a modular DNA binding domain from the same proteins used for gene editing (most commonly TALEs, ZFs, or deactivated Cas9 [dCas9])^33,34 to an activation domain (such as VP16, VP64, or p65),^35–38 repression domain (such as KRAB or SID),^39,40 or chromatin modifying domain (such as p300, Tet1, or LSD1)^41–43 ( Fig. 3 ). By creating two amino acid substitutions in Cas9 (D10A and H840A), the two nuclease domains are mutated, creating a dCas9 capable of acting as a modular DNA binding domain analogous to ZFs and TALE proteins.³⁰

Figure 3.

Tools for epigenome editing. (A) Epigenome editing tools are constructed by fusing a modular DNA binding domain to an effector domain of choice. By localizing specific effector domains to specific genomic loci, researchers can induce targeted modifications to chromatin structure and gene expression. (B) Commonly used DNA binding domains include ZF proteins, TALEs, and CRISPR/dCas9. (C) There are multiple commonly used effector domains that induce gene activation, gene repression, and chemical modifications to the chromatin state.

The use of dCas9 fused to epigenetic regulators has vastly increased the flexibility and applicability of CRISPR/Cas9, particularly from a drug discovery perspective. Without altering the underlying DNA sequence, this CRISPR/Cas9 platform enables researchers to disable (CRISPR-knockout [CRISPR-KO]), turn on (CRISPR activation [CRISPRa]), or decrease (CRISPR inhibition [CRISPRi]) gene expression from single or multiple genomic loci. The platform versatility provides new avenues for studying and modeling both monogenic and complex diseases. Furthermore, the CRISPR/Cas9 platform allows for cost-effective high-throughput screening on endogenous gene regulation.

Improved Disease Understanding: Genetic Variants and Human Health

Functional genomic tools are critical for mechanistically linking genetic variation to health. For decades, geneticists have used candidate gene approaches to elucidate the function of individual genes associated with rare hereditary disorders. While the study of human monogenic disorders has provided many drug targets, these diseases are typically rare.^44,45 Following the conclusion of the Human Genome Project in 2001–2003^46,47 ( Fig. 4 ), genome-wide association studies (GWAS) have genotyped patient samples and collected matched health information. These large data sets have been used to uncover thousands of genetic variants linked to disease.⁴⁸ Well-defined GWAS disease-linked variants represent a valuable resource of evolutionarily tested perturbations that can be incorporated into drug discovery efforts. Coding variants linked to a disease phenotype often implicate novel gene targets and pathways that could yield both differentiated and first-in-class therapeutics. Alternatively, disease-protective alleles identified in the population provide strong evidence to de-risk a drug target. The presence of protective variants in apparently healthy individuals can help frame expectations of the efficacy and toxicity of drugs targeting the variant gene or its regulatory machinery in humans.⁴⁹ For example, loss of proprotein convertase subtilisin/kexin type 9 (PCSK9) function in some healthy individuals is associated with reduced low-density lipoprotein (LDL) cholesterol and heart disease risk, while gain-of-function mutations in PCSK9 are associated with hypercholesterolemia.^50,51 Together, these findings prompted rapid drug discovery efforts focused on inhibition of PCSK9, resulting in well-tolerated and efficacious monoclonal antibody therapies for treating high cholesterol.^51–53

Figure 4.

Timeline of select functional genomic advances in the post-human genome assembly era.

GWAS efforts have found that common human traits or diseases are usually highly polygenic, with individual genetic variants explaining little of the overall trait variance or the risk of developing disease.⁵⁴ Interestingly, these same techniques have uncovered cases of rare variants with strong effect sizes, either substantially increasing risk or conferring protection from disease, but in small patient populations or individual families.⁵⁵ As functional genomic techniques become more accessible to researchers, we will gain further understanding of the molecular mechanisms behind these observations. The relatively small number of well-annotated GWAS loci indicate that rare variants with strong effects may represent the extreme, where a disease-linked gene exhibits complete loss or gain of function. For example, there are more than 100 loci associated with obesity and type 2 diabetes, with many alleles contributing modest risk, and rare loss-of-function variants in the adenylate cyclase 3 (ADCY3) gene confer a severe obesity phenotype.^56,57 This finding led to the pursuit of ADCY3 enhancers as antiobesity medicines.^58,59 In another example, variants that reduce function of the immune receptor encoded by the triggering receptor expressed on myeloid cells 2 (TREM2) gene are associated with an increased risk of Alzheimer’s disease,⁶⁰ while complete loss of TREM2 signaling has been shown to cause rare Nasu–Hakola disease that includes progressive early-onset dementia.⁶¹ Together, these findings indicate that enhancing TREM2 signaling is a potential therapeutic strategy in neurodegenerative disease and therefore is an active area of research.

As compared with exonic variants that clearly modify the function of a particular gene, the majority of GWAS discoveries fall in the noncoding region of the human genome in putative gene regulatory elements. These putative regulatory elements are often defined with active enhancer histone marks that may be cell type specific.^62–64 A major challenge of modern functional genomics is how to mechanistically link specific noncoding variants with gene regulation and the associated disease processes. To date, functional genomic consortia, such as ENCODE and Roadmap Epigenomics, have struggled to do this at scale due to the myriad possible gene regulatory mechanisms involved.⁶⁵ One scalable approach is the identification of expression quantitative trait loci (eQTLs), the systematic association of genetic variants with variation in gene expression levels. As large microarray and RNA-seq data sets that sample across many individuals have become available, eQTL mapping efforts have identified thousands of cis- and trans-acting loci across diverse tissue types for most human genes.^66,67 Similar quantitative association techniques have been applied to smaller data sets to study protein abundance, DNA methylation status, and chromatin states.^68–70 eQTL approaches help establish a link between disease-associated variants and particular genes or genomic features, but alone often fall short of detailing molecular mechanisms indicating proteins or pathways suitable for pharmacological intervention.

The availability of high-throughput sequencing has facilitated widespread use of genome-wide biochemical assays to characterize the genomic landscape surrounding disease variants. Long-range chromatin interactions can be studied genome-wide using techniques such as “Hi-C,” which captures proximal genomic regions in a sequencing library by dilute ligation reactions. These tools have helped confirm that the human genome is organized into hundreds of kilobase to megabase size topologically associating domains (TADs), often bounded by insulating CCCTC binding factor (CTCF) protein binding sites.^71,72 TADs are thought to provide a microenvironment for gene and gene regulatory element sequences to move around and establish long-range contacts. Enhancer looping within TADs reinforces basal promoter activity and explains why sets of genes located within the same TAD are often co-regulated or developmentally linked.^73,74 In addition, functional genomic consortia have discovered millions of putative gene regulatory elements that are cell or tissue dependent.^64,75 These regulatory elements are defined by transcription factor binding, active histone modifications, and increased local chromatin accessibility. The mapping of the human genomic regulatory landscape has set the stage for interrogation of molecular mechanisms underlying disease-associated loci.

Individual loci with multiple disease-associated single-nucleotide polymorphisms (SNPs) in linkage disequilibrium may indicate altered transcription factor binding sites, perturbation of noncoding RNAs, splicing changes, disruption of local chromatin structure, or altered enhancer looping.^76–79 There has been an increased focus on using functional genomic tools to deconvolute complex GWAS loci. Miller et al. provide an early example of integrating modern functional genomic techniques and analyses to connect regulatory variants to gene function in the context of coronary artery disease.⁸⁰ By integrating genomic, epigenomic, and transcriptomic profiling of cells and tissues, the authors describe how particular regulatory variants influence disease gene expression profiles. However, even after noncoding variants are connected to the regulation of a particular gene, it still may be unclear how the encoded protein or RNA from that gene influences key disease biology. Fortunately, the toolbox to fill this gap is also expanding, enabled by improved human genomic annotation, high-throughput sequencing, proteomics, and bioinformatic insights. Claussnitzer et al. provide an impressive example of leveraging many of these advances to understand the connection between a particular noncoding SNP and obesity risk.⁷⁷ The authors demonstrate that SNPs disrupting the binding site of ARID5B, a transcriptional repressor, results in increased expression of IRX3 and IRX5 genes and a shift from energy-dissipating beige adipocytes to energy-storing white adipocytes associated with obesity.

In addition to inherited disease risk, de novo or somatic mutation has emerged as a secondary source of genetic variation underlying disease. Cancers are increasingly classified by a molecular taxonomy of their mutation burden and driver genes.⁸¹ This work has improved patient outcomes by facilitating the development of specific mutation targeted therapies.⁸² While certain acquired somatic mutations have been linked to oncogenic pathways for a number of years, there is a recent accumulation of evidence that somatic mutation is involved in other disease types as well, such as neurological and autoimmune conditions.^83,84 For instance, some focal epilepsies appear to be driven by somatic mutations impacting a localized lineage of neurons and glia in the brain.⁸⁵ As another example, a subset of autoimmune diseases may be linked to somatic mutation generating autoantigens that are recognized by the adaptive immune system as foreign.^86,87 Somatic mutations may explain the apparent tissue specificity, late onset, or unusual presentation of particular conditions.

Functional genomic tools are increasingly being used to investigate both somatic and heritable mutation-driven disease in various cell and animal models. In particular, CRISPR/Cas9-based genome engineering has emerged as the tool of choice to introduce mutations into endogenous genomic loci, including at particular developmental time points.^88–90 Functional genomic tools such as these are increasingly being used to better study these complex mutation-associated phenotypes and rapidly improving the way we model and study disease.

Applications of Functional Genomics in Disease Modeling

Historically, researchers have relied on model organisms to study human disease. While these studies have made important contributions toward understanding key pathways, many animal models are unable to fully recapitulate complex human disease biology. For example, a single rodent model of Alzheimer’s disease is unable to exhibit the full spectrum of human disease pathologies, including the accumulation of amyloid beta, tau tangles, and extensive neuronal loss.⁹⁰ This is likely due to differences in genetic drivers as well as neural network development, connectivity, and complexity between model organisms and humans.⁹¹ Furthermore, it can be difficult or cost-prohibitive to produce the multiplicity of animal models that would demonstrate the diversity of a given human disease phenotype. For example, there are multiple mutations associated with Alzheimer’s disease with different severity, time of onset, and pathologies. It is currently unrealistic and impractical for most researchers to construct and/or study multiple animal models simultaneously to get a holistic evaluation of disease biology.

For next-generation therapies including antibody, RNA, or gene editing approaches, it is important that the therapies are evaluated in a human background because these reagents may not exhibit the same specificity toward the gene, transcript, or protein in another species. While the genomes of nonhuman primates and humans are 92% conserved,^92,93 small changes in the genome, transcriptome, and proteome can greatly affect efficacy, off-target effects, and subsequent toxicity. With the advent of induced pluripotent stem cell (iPSC) technology, in principle, scientists can engineer virtually any cell type/tissue of interest from an unlimited cell source. However, in practice, these engineered tissues are often lacking in some of the transcriptomic, epigenomic, and phenotypic hallmarks of mature tissue. Some protocols achieve a mature cell phenotype or tissue formation, but the cultures are often impure or the long timescales required are not amenable for routine use. These limitations increase the cost and lengthen the timelines of conventional drug discovery where candidate therapeutics are screened in iPSC models.

By pairing in vivo animal model studies with robust human-derived in vitro cell models, scientists will gain a more complete understanding of human disease biology, therefore enabling the development of effective therapeutics. Functional genomic tools have had a large impact on the quality and efficiency of generating novel animal models, though these tools are only starting to address some of the current limitations of human-derived in vitro disease modeling. In this section, we discuss the current status of disease models and how functional genomic tools are being used to improve animal model generation, model the genetic variants of human disease, improve the quality of iPSC-derived disease models, and recapitulate mature tissue transcriptomic and epigenetic profiles.

Generating Novel Animal Models

Functional genomic tools have greatly expedited the process of generating novel animal models. As of 2005, most transgenic mice were generated through the injection of genetically modified mouse embryonic stem (mES) cells into wild-type mouse blastocysts.⁹⁴ Through homologous recombination, a stable mES cell line is generated containing the desired genetic mutation. Upon injection into the blastocyst, the mES cells contribute to the germline of the animal. The blastocytes are then implanted into a host mother. The resulting chimeric animals are bred to generate a homozygous model with the desired genetic modification.⁹⁴ In best-case scenarios, these methods take about 1–1.5 years to generate a new transgenic strain.⁹⁵

Historically, ES cells were required for generating transgenic animals because gene targeting technologies were not efficient enough to directly induce genetic modifications in mouse embryos. Conventional gene targeting technologies relied on the delivery of donor DNA constructs where the desired mutation is straddled between two DNA sequences that have homology to the target genomic site. With initial studies only achieving a targeting efficiency of ~1/1000 cells, a large number of ES clones needed to be screened for the desired mutation before being injected into the blastocysts for transgenic animal generation.⁹⁴ As the field of gene editing progressed, it was found that the rate of HDR can be greatly increased by inducing a targeted double-stranded break at the desired integration site. Depending on the desired edit, gene editing tools including ZFNs, TALENs, and CRISPR/Cas9 can be used to edit ES cells at efficiencies of more than 80%.⁹⁶ Currently, these methods are used for generating transgenic animals requiring complex genetic manipulations, for example, when knocking in large DNA segments. In recent years, alternative methods have been developed that further accelerate the process of genome modification by directly injecting DNA or mRNA of site-specific nucleases into single-cell embryos to induce a targeted double-stranded break.^96–100 These protocols make use of pronuclear injections or electroporation of gene editing components directly into the embryos. Similar to ES-derived transgenic methods, direct injections still generally give rise to chimeric animals that are then bred to generate a stable mouse strain. Direct injection of editing tools into embryos skips the laborious process of generating stable ES lines and therefore greatly reduces the timeline for generating a transgenic strain to an average of 6–12 months.^100,101

For many genetic diseases, there are multiple mutations associated with the disease phenotype. Highly efficient gene editing protocols now allow for multiple genetic mutations to be generated simultaneously. It has been reported that up to five mutations can be simultaneously introduced into mouse ES cells or two mutations directly in mouse embryos.⁹⁹ Therefore, rather than sequentially generating compound mutation models or cross-breeding multiple single-mutation strains, these models can be generated in a single project. More robust animal model generation has led to the ability to test new compounds in multiple genetic backgrounds, which will help to determine which mutations are responders to a given treatment, aiding in patient population selection for the resulting therapeutic. In addition, the use of CRISPR/Cas9 technology has allowed the study of disease processes in animal models that were previously out of reach. For example, focal cortical dysplasia is caused by somatic cells that acquire mutations in the brain, leading to dysregulated signaling and epilepsy. This has been difficult to model previously as the mutations only affect a portion of cells in the brain, but with CRISPR/Cas9, researchers can create animals with brain mosaics, thereby mimicking the disease.^102,103

Controlling for Genetic Variability

Genetic variation between individuals is one limitation of modeling human disease using primary cells isolated from patients. On average, the human genome varies by about 20 million bases between unrelated individuals (or 0.6% of the 3.2 billion bases).¹⁰⁴ For complex human diseases, this makes it difficult to experimentally deconvolute the causative sequences or transcriptomic profiles linked to disease phenotypes from passive variation. By studying a panel of mutations or disease states in the same genetic background, it is much easier to link the causative mutation to disease outcome. CRISPR/Cas9 has made it possible to create multiple disease models in the same isogenic background. However, while gene editing is a robust method in most immortalized cells, it can be quite difficult to induce high rates of gene editing in primary cell models, therefore necessitating clonal isolation to obtain a pure population of cells containing the desired edit. This is where the fields of iPSC cultures and gene editing come together for the generation of isogenic disease models. For most diseases, the disease phenotype presents in terminally differentiated cells with limited proliferative capacity, making clonal isolation impossible. Therefore, isogenic disease models must be created in the iPSC stem-cell-like state, before being differentiated into the desired cell type. By creating panels of iPSC isogenic disease models, research labs can now study multiple genetic disease backgrounds in parallel and more easily determine causative relationships between genotype and phenotype^105–107 ( Fig. 5 ).

Figure 5.

Functional genomic tools contribute to robust disease modeling for drug discovery.

One application of isogenic disease models is experimentally validating potential causal disease SNPs identified through GWAS. Loci identified through GWAS tend to have multiple SNPs within a short distance of one another, all in linkage disequilibrium. For example, there are at least 14 SNPs in the GRM3 locus reported to be associated with schizophrenia. Following meta-analysis of the available data, a significant association with schizophrenia was found for three of these SNPs,¹⁰⁸ yet it is unclear which of these three are causative versus merely tightly associated with the disease allele. However, using CRISPR/Cas9 three iPSC-derived neuronal lines can be generated to model the three GRM3 SNPs. The effect of these mutations on the transcriptomic profile of these cells compared with the healthy isogenic control can then be experimentally determined. Since the resulting differences in GRM3 expression are thought to be subtle,¹⁰⁹ isogenic controls would be necessary to reduce baseline variability and gain statistical power.

Another application of isogenic disease modeling is to identify the genes and pathways that are associated with disease-causing mutations to identify new drug targets. By comparing the transcriptomic profile of neurons derived from Parkinson’s disease patients and corrected isogenic controls, downregulation of the transcription factor MEF2 was identified as a mechanistic driver of mitochondrial damage implicated in Parkinson’s disease.¹¹⁰ Furthermore, by screening for compounds that increase MEF2 transcription, the compound isoxazole was identified and shown to have protective effects against mitochondria-induced damage.¹¹⁰ This example demonstrates that isogenic controls help increase the probability of identifying genes implicated in complex disease and how that information can be used to identify new candidate small-molecule therapeutics.

Improving the Quality of iPSC-Derived Disease Models

Functional genomic tools currently allow for relatively simple generation of multiple iPSC-derived disease models. However, the utility of iPSC-based disease models for drug discovery is currently limited by efficiency and the long time frames of current reprogramming methods. The final cultures usually contain a mix of cell types in addition to the target cell type, making downstream data deconvolution difficult. Furthermore, even the best protocols generally produce cultures more closely resembling the fetal or neonatal cellular state rather than the desired mature adult state. For example, neurons derived from human iPSCs fire action potentials as early as 3 weeks postdifferentiation; however, the properties of these early action potentials are relatively immature. Allowing maturation out to day 55, there is a significant improvement characterized by increased sodium and potassium current amplitudes, action potential amplitude, and action potential threshold.¹¹¹ However, the increased time and costs associated with long differentiations are prohibitive for use in drug screening and are thereby generally used solely as a tool for validation.

Defined iPSC differentiation protocols aim to mimic the stages of natural development, where stem cells gradually move from a pluripotent state to a multipotent state and finally into a unipotent terminally differentiated cell type. For example, during neuronal differentiation iPSCs first transition into neural progenitors, and then can be further differentiated into excitatory cortical neurons, inhibitory cortical neurons, midbrain dopaminergic neurons, or motor neurons, depending on the stimuli provided. Inefficiencies at each stage of differentiation drastically decrease the purity and maturity of the final cell product. For many lineages, key markers of the different stages of development have been well characterized through lineage-tracing studies in mice.^112,113 Using this information, researchers can purify iPSC cultures at each stage of development based on known markers. Using functional genomic tools, this approach has been used to successfully generate iPSC-derived chondrocytes¹¹⁴ and skeletal muscle progenitor cells.¹¹⁵ In these studies, CRISPR/Cas9 was used to knock in endogenous reporters for COL2A¹¹⁴ and Myf5/Pax7,¹¹⁵ respectively, to purify the desired cell types from a mixed cell population. The resulting cell cultures are more uniform compared with cultures that do not undergo a purification step. Robust and reproducible differentiation protocols are required in order to successfully use iPSC-derived cell cultures for drug development.

Epigenome editing tools have been successfully used to reprogram cells into a variety of cell types, including iPSCs,^116–118 myocytes,^119,120 and neurons,^119,121,122 demonstrating that these synthetic factors are potent enough to drive changes in cell phenotypes. Transcription factor-driven reprogramming and defined reprogramming protocols share the same general limitations; cultures are impure and long times are required to achieve functional maturity. However, the adaptation of CRISPR/Cas9-based transcription factors for high-throughput screening enables systematic identification of the optimal factors required to improve current reprogramming protocols. With the central hypothesis that current reprogramming protocols are failing to induce the expression of necessary genes to drive sufficient reprogramming, high-throughput genetic screens can be used to identify these missing factors. This approach has been successfully used to identify a combination of ZF-based transcription factors that are able to replace the master transcription factor Oct4 for inducing reprogramming into iPSCs.¹²³ CRISPR/Cas9-based synthetic screens are just starting to be used to identify the necessary genes responsible for controlling differentiation or reprogramming. In the first published example, a genome-wide knockout screen was used to uncover a set of kinases that inhibit the transition of iPSCs into definitive endoderm. Pharmacological inhibition of these kinases leads to an improved generation of definitive endoderm and subsequent differentiation into pancreatic and lung progenitor cells.¹²⁴ By screening on differentiation markers, libraries of gRNAs targeting promoters of genes could be used to identify proteins that enhance existing reprogramming protocols.¹²⁵ Discovery of novel reprogramming factors would therefore help improve culture quality and maturity, enabling drug discovery in more mature and disease-relevant cell types. Once robust and pure single-lineage cultures can be made, this will increase the ease and availability of multitissue organoids that allow for examination of more complex disease biology for drug development.^126–128

Generating Human Disease Models That Recapitulate Mature Transcriptomic and Epigenomic Profiles

DNase-seq/ATAC-seq and RNA-seq allow researchers to comprehensively assess the chromatin and transcriptomic profiles, respectively, of cells and tissues. Application of these assays to iPSC-derived disease models has shown that while iPSC differentiation protocols produce cells that exhibit some of the phenotypic qualities of the desired tissue, there are differences in the transcriptomic¹²⁹ and epigenetic (unpublished data) profiles of these cells compared with mature adult tissues. Reprogrammed cells tend to exhibit “epigenetic” memory, meaning that iPSCs derived from one lineage tend to retain epigenetic marks from the parent cell type.¹³⁰ This epigenetic memory in iPSCs inherited from the parental cell type influences the differentiation capacity and likely the epigenetic profile of the final cell product. For example, iPSCs derived from nonhematopoietic cells (such as fibroblasts) have a reduced capacity to differentiate into blood-forming cells.¹³¹

To efficiently identify new disease targets and drugs, it is important to develop human therapeutics in the context of disease models that accurately reflect the epigenetic and transcriptomic profiles of the relevant tissues. Most iPSC-derived neuronal protocols produce cells that are fetal in nature, meaning that they may not accurately model advanced neurological disorders associated with aging.¹³² One way to partially overcome the immature epigenetic nature of iPSC-derived models is the direct reprogramming of an adult cell type into the desired cell type. By bypassing the pluripotent state, direct reprogramming allows for retention of the epigenetic marks that have accumulated in the parental somatic cell. It has been shown that the epigenetic methylation signatures associated with aging are well conserved when adult fibroblasts are directly differentiated into neurons (correlation of 0.91).¹³³ However, even these cells likely will not exhibit the full transcriptomic and epigenetic profile of adult tissue.¹³⁴ One limitation of this approach is that cells must be directly reprogrammed for each experiment, as somatic cells used for direct differentiation generally have limited proliferation capacity.

Functional genomic tools can induce site-specific genetic and/or epigenetic changes that alter chromatin conformation, transcriptomic profiles, and protein expression. Using a variety of the modular DNA binding platforms discussed previously, different effector domains can be localized to a specific genomic locus to induce changes in DNA sequence or chromatin structure, and thereby influence gene expression profiles.^41,135 Should reprogrammed cell models lack the desired epigenetic signature, these tools can be used to induce the correct epigenetic mark. For example, fragile X syndrome is characterized by a CGG expansion in the 5′ UTR of the gene that promotes methylation and gene silencing of the fragile mental retardation protein (FMRP). An increased number of CGG repeats is generally associated with increased methylation and a more deleterious phenotype. Targeting a CRISPR/Cas9-based demethylase to the locus induces normal levels of FMRP expression and alleviates the phenotype.¹³⁶ Conversely, this also suggests that by using epigenome editing tools, researchers can model the disease by inducing methylation of the promoter rather than needing to generate multiple model cell lines, each with a different number of CGG repeats to model the spectrum of disease. Tools such as these will be instrumental in providing the understanding of disease biology needed to drive the next generation of therapies.

Functional Genomic Screening for Drug Discovery

High-throughput screening is a critical part of drug discovery. Pharmaceutical R&D has traditionally relied on one of two different pharmacological screening approaches: target-based screens and phenotypic screens. Target-based screens require screening of large chemical libraries for activity toward a known disease-associated target. These screens can be done utilizing high-throughput array-based methods, often screening thousands or even millions of compounds for a known target. Phenotypic screening allows for unbiased evaluation of chemical matter looking for an effect on the phenotype(s) of interest. In recent years, there has been increased focus on phenotypic screens as there has been evidence that such targets lead to more successful outcomes in the clinic.⁴ However, phenotypic screening is still fraught with difficulty, predominantly in the target identification stage, which can be both lengthy and costly, as well as potentially unsuccessful. Regardless of target versus phenotype based, pharmacological screens are currently unable to probe the entire set of potential cellular drug targets. There are estimated to be 500–700 unique protein targets currently included in the FDA-approved drug list, though there are approximately 20,000 genes in the human genome, highlighting a lack of robust chemical matter available for targeting the majority of human genes.^137,138 Genetic screens offer the potential to perturb every gene and ask if that perturbation influences the target or phenotype of interest. Additionally, genetic screens provide a new way to capitalize on phenotypic screening while avoiding the drawback of target deconvolution ( Fig. 6 ).

Figure 6.

Comparison of screening paradigms. (A) Potential screening workflow, starting with target identification and subsequent therapeutic compound identification. (B) Screens that can be run in drug discovery with expected outcomes, highlighting aspects that can aid in deciding which approach is appropriate for different needs.

The increased throughput of functional genetic screens in recent years has allowed for unbiased, genome-scale screens to answer fundamental biological questions. This technology initially focused on gene essentiality with clear applications in oncology, but has since expanded through interrogation of increasingly complex phenotypes. RNAi was first used to manipulate mammalian gene expression in 2001.⁸ This technology enabled modulating the transcriptome with simple antisense oligonucleotides to understand the biological effects of genes. It was not long before the use of RNAi was commonplace and large-scale arrayed and pooled screening became possible in mammalian cells with siRNA and shRNA libraries.

The advent of CRISPR/Cas9-based tools for high-throughput functional genomic screens has transformed genetic screening methods. From essentiality screens focused on genes that contribute to cellular viability to more intricate screens identifying drug response or complex phenotypes, CRISPR/Cas9 tools have opened new avenues in drug discovery. Prior to the use of CRISPR/Cas9 in mammalian cells, high-throughput genetic screens were limited by the lack of specificity and effectiveness of shRNA and siRNA mechanisms.¹³⁹ While RNAi was a great advance, the issues of incomplete knockdown and off-target effects limited its broader utility for high-throughput screening.^15–17 CRISPR/Cas9 tools have allowed increased specificity in genomic and epigenomic editing in mammalian cells by acting at the level of DNA rather than RNA. The requirement of complementarity of both the gRNA and target DNA, combined with the need for a protospacer adjacent motif (PAM) sequence, has allowed for better specificity of gene targeting.^140,141 Direct cutting of target DNA with the Cas9 enzyme has allowed for site-specific induction of indels and subsequent gene knockout, avoiding potential issues with incomplete knockdown that can be seen with RNAi. The combined specificity and complete gene knockout with CRISPR/Cas9 has led to fewer false positives and more reproducible hit identification compared with RNAi methods.^142,143 Additionally, the ease of use over other DNA editing technologies such as ZFNs and TALENs has led to the rapid adoption of CRISPR/Cas9 tools in drug discovery.

Implementing CRISPR/Cas9 Functional Genomic Screens

CRISPR/Cas9 screening can be performed in either an arrayed format or, more commonly, in a pooled format. In a pooled screen, a large number of cells are transduced with a pooled library of gRNAs packaged in a lentiviral delivery system that can be combined with a variety of Cas9 effectors to achieve knockout, activation, or inhibition ( Fig. 7 ).¹⁴⁴ Cas9 effectors for knockout, activation, or inhibition can be can be delivered to cells via a variety of methods in diverse cell types, including primary cells.^140,145 Early CRISPR/Cas9 screens used an all-in-one vector to co-express the gRNA and Cas9 from the same plasmid packaged in a lentivirus.¹⁴⁶ Alternatively, Cas9 and the gRNA can be delivered separately, for example, via a stably expressing Cas9 cell line or by transfecting/electroporating in Cas9 mRNA, DNA, or protein.¹⁴⁷ Transduction of a gRNA library containing virus at a low multiplicity of infection (MOI), typically around 0.2 MOI, increases the probability that each cell will only contain one gRNA targeting a specific gene ( Fig. 7 ). In this way, thousands of genes can be manipulated at once in the same population of cells. Coverage of the gRNA library must be maintained throughout the experiment so that there are typically 500–1000 times as many cells as gRNAs in the library. The most recent versions of published human genome-wide gRNA libraries use 4–5 guides per gene for a total of around 80,000–100,000 guides per library.¹⁴⁸ To maintain the 500× coverage of this library, a minimum of 40–50 million cells are cultured per replicate. For a proliferation screen, after transduction of Cas9-expressing cells with the gRNA library, the genetically perturbed mixture of cells is allowed to proliferate over a defined period of time or population doublings. Genes that regulate proliferation can be identified by isolating genomic DNA from the pool of cells at defined time points and identifying changes in the abundance of gRNAs using high-throughput sequencing. gRNAs that decrease in abundance over time in the screen are said to have dropped out and indicate genes that positively regulate, or are required for cell proliferation. Conversely, gRNAs that enrich in the population over time indicate that knockout of those genes leads to a growth advantage. The first genome-wide screens using CRISPR/Cas9 are presented in pioneering papers by Shalem et al. and Wang et al. from Feng Zhang’s and Eric Lander’s laboratories, respectively.^149,150 These early studies highlighted the power of the CRISPR/Cas system and the possibility of conducting forward genetic screens in human cell populations.

Figure 7.

Representative pooled CRISPR screening workflow. Pooled CRISPR screening is typically performed by transducing a large pool of cells with gRNA-containing lentivirus. It is important to maintain the desired gRNA coverage throughout the screen (typically at least 500×), which can mean maintaining a minimum of 40–50 million cells per replicate in genome-wide screens. Virus is given to cells at a low MOI (typically around 0.2) to ensure there is only one gRNA per cell. For an MOI of 0.2, five times the number of cells must be transduced with the viral gRNA library to ensure coverage is maintained after antibiotic selection. In a proliferation screen, cells are collected at time points along the way and finally at the endpoint to monitor changes in gRNA abundance over time. In a phenotypic assay, cells may be stained with an antibody for a particular marker and sorted using FACS for abundance of the marker at the endpoint of the screen. The final phase of the screen involves isolating genomic DNA from all collected samples and PCR amplifying the guide-containing region with barcoded primers. PCR products are then sequenced by next-generation sequencing and the abundance of gRNAs can be compared across conditions or time points.

On/Off-Target Effects of CRISPR/Cas9

As CRISPR/Cas9 technology has developed, a continuing point of discussion has been related to understanding and improving the on- and off-target effects (i.e., efficiency and specificity) of both the gRNAs and Cas9 itself. Initial genome-wide CRISPR/Cas9 screening papers used the “Genome Scale CRISPR Knock-Out” (GeCKO) library of gRNAs, which were selected to minimize off-target effects using a metric that includes the number of predicted off-targets in the genome and the type of mutations (distance from protospacer-adjacent motif and clustering of mismatches).¹⁴⁹ Early studies established that mismatches closer to the PAM are more important for proper DNA binding compared with distal mutations.³⁰ The first high-throughput screens and other focused studies have since uncovered other important features of gRNAs that are critical for specificity. For instance, several studies have explored the sequence bias of gRNAs in genome-wide libraries by measuring the frequency of bases at each position in high- and low-performing gRNAs.^150,151 Other studies have examined the effects of consecutive mismatches.¹⁵² Next-generation libraries make use of more complex gRNA design algorithms and training data for improved specificity and on-target activity.¹⁴⁸ To test the off-target effects of the CRISPR/Cas9 system over the course of several weeks in a pooled screen, Wang et al. conducted an experiment using a two-vector system where Cas9 and a gRNA toward AAVS1 were constitutively expressed for 2 weeks. In this experiment, there was 97% cutting efficiency at the predicted AAVS1 site after 2 weeks compared with <2.5% cleavage at 13 predicted off-target sites.¹⁵⁰ These data gave promise to the specificity and low rate of off-target effects in pooled CRISPR/Cas9 screens over the length of an experiment. gRNA design has continued to be optimized with updated algorithms powered by gRNA cutting efficiency and specificity data, leading to cleaner, more reproducible screens for target discovery.^148,153

Pooled CRISPR-Based Screening for Drug Discovery

Whole-genome CRISPR/Cas9 screening libraries can now be purchased or made for a relatively low cost, and a pooled screen can be performed by one person in only a matter of weeks. Besides the ease of use, the other major advantage of CRISPR/Cas9 pooled screening lies in target identification, avoiding the need for target deconvolution that is often faced by small-molecule phenotypic screens. An unbiased CRISPR pooled screen can target every gene in the genome, allowing the possibility of discovering novel targets and disease biology.

One of the prominent applications of CRISPR/Cas9-based pooled screening to date has been uncovering essential genes and genes that regulate cellular proliferation. Comparison of essential genes across tissue types and individual mutations has revealed the ability to define context-specific dependencies on certain genes.¹⁵⁴ For example, the loss of the tumor suppressor retinoblastoma protein (Rb) is a common occurrence in many cancers; however, identifying cellular vulnerabilities in Rb mutant patients has eluded researchers. CRISPR/Cas9 knockout screening of a pooled gRNA library in Rb mutant small-cell lung cancer (SCLC) cells showed that loss of Rb made these cells uniquely reliant on Aurora B kinase (AURKB) compared with wild-type cells.¹⁵⁵ Furthermore, the dependence of Rb null cells on AURKB was confirmed in xenograft models with AURKB inhibitors. These results indicate a potential therapeutic avenue for SCLC patients harboring an Rb mutation and highlight the use of pooled genetic screening for drug discovery.

Understanding the mechanism of action is critical to the successful development of a drug candidate. A clear example of the application of functional genomic tools in this area is the use of high-throughput genetic screens performed in combination with drug treatment. Screens such as these can be used to understand the mechanism of action of a compound with unknown biology or to uncover genes that confer intrinsic or acquired resistance to a particular drug. Drug resistance is a major obstacle in the clinic, particularly in cancer therapy, that can arise through a wide variety of mechanisms. The use of CRISPR/Cas9 screening has uncovered mechanisms of drug resistance pointing to key genes and pathways that dictate the response to individual compounds.¹⁵⁶ Early evidence for the power of pooled CRISPR/Cas9 screens in drug resistance was shown in a proof-of-principle study using a near-genome-wide gRNA library to identify resistance to 6-thioguanine (6-TG), a nucleotide analog that damages DNA.¹⁵⁰ In this screen, cells were transduced with the gRNA library followed by treatment with a lethal dose of 6-TG. Cells that survived the treatment were then sequenced to identify gRNAs that were enriched in this population. As expected, genes known to be involved in DNA mismatch repair were identified as top hits, validating this approach to identify drug resistance and mechanism of action. Later work by Anderson et al. used pooled CRISPR/Cas9 screening with a targeted gRNA library across multiple KRAS mutant cell lines to identify drug sensitizers.¹⁵⁷ By using low-dose small-molecule inhibitors (~IC₂₅), these screens could identify drug combinations that could promote primary drug action and delay drug resistance, in this case to MEK/ERK inhibitors, in KRAS mutant cancers.¹⁵⁷

Pooled CRISPR/Cas9 screening has also been used in vivo.¹⁵⁸ In the first pooled screen performed in vivo, a genome-wide lentiviral library of gRNAs were given to cells in vitro and then transplanted into mice in a xenograft model, and lung metastases were sequenced to determine mediators of metastatic disease.¹⁵⁹ In another example, a small library of gRNAs were packaged into AAV and delivered directly to the brains of immunocompetent mice to uncover the role of genes that are frequently found mutated in glioblastoma multiforme (GBM) patients.¹⁶⁰ This screen identified several driver mutations and co-occurring mutations in GBM in vivo models that correlated with genomic data seen in patients. Future studies will continue to show the utility of pooled CRISPR/Cas9 screens in vivo for target discovery, particularly for specific phenotypes that cannot be reliably reproduced in vitro.

Gain-of-Function Screens

While much focus has been on the manipulation of genes by decreasing their expression, screening using the overexpression of genes can be important in certain contexts. The overexpression of genes for gain-of-function screens has been possible through cDNA expression vectors¹⁶¹ and later CRISPR/Cas9 activation (CRISPRa) screens.¹⁴⁰ Overexpression allows for a positive manipulation of genes to understand biological activity that occurs when the gene is present, in contrast to loss-of-function studies. One benefit of overexpression is avoiding potential variables of cellular compensation and redundancy that occur with gene knockdown or knockout. The caveats to cDNA overexpression are expressing the gene off of an exogenous plasmid, out of the cellular context, and thereby achieving potentially supraphysiological protein expression, which may alter function and localization. Alternatively, CRISPRa allows for targeted overexpression from endogenous loci to activate gene expression from endogenous promoters, or enhancers, of a gene and in this way can regulate a gene in a manner, and to a level, that may be more physiologically relevant. Furthermore, activation of endogenous promoters can lead to expression of multiple gene splice variants,¹⁶² something that is currently not possible with a single cDNA construct.

Phenotypic Genetic Screening

As functional genomic screening has evolved, more complex screens have significantly expanded the range of biology that can be interrogated. The use of fluorescence-activated cell sorting (FACS) has allowed for studies to be performed using pooled gRNA or shRNA libraries at a genome-wide scale, followed by sorting cells based on the abundance of a protein of interest.^163–168 FACS-based pooled genomic screens can be applied to a wide variety of disease states by screening on changes in the abundance of a particular protein of interest. For example, to screen on regulators of autophagy, a lentiviral genome-wide gRNA library was delivered to H4 neuroglioma cells stably expressing green fluorescent protein (GFP)-tagged p62, a well-known substrate, and marker of autophagic activity.¹⁶⁴ After 7 days, cells were sorted on the upper and lower quartile of GFP protein levels, followed by high-throughput sequencing, to determine changes in gRNA abundance between p62 high and low populations. Regulators were identified as gRNAs that changed abundance in the p62 low or high population, signifying active degradation or accumulation of p62, and thus altered autophagy. mTOR is a known regulator of autophagy and, accordingly, the majority of negative regulators identified in the screen were positive regulators of the mTOR pathway, such as Rheb and Raptor, as well as mTOR itself. In addition, several potentially novel regulators of autophagy were identified, showing the utility of such screens in identifying candidate drug targets.¹⁶⁴

In addition to pooled FACS-based screens, genomic perturbations can be assayed using arrayed methods. Arrayed-based screens are done in plate format and thus are more labor-intensive and may require automation depending on the size/type of screen. However, arrayed screens can be used to study specific cellular phenotypes that would not otherwise be possible in a pooled format, such as screening on an image or kinetic-based phenotype.^169,170 The ability to complex multiple endpoints into the same screen also allows much more information to be gathered about how the probed gene influences the cell phenotype. Therefore, array-based screens are an attractive option for specific phenotypic outputs, especially for targeted gRNA or RNAi libraries that could potentially be combined with chemical screens.

Moving forward, the ability to analyze and compare the scale of data being generated by genome-wide screens has become increasingly important. Project Achilles was initiated by the Broad Institute to aid in the effort to compare these screens by compiling close to 1000 cell lines screened with RNAi and CRISPR/Cas9 knockout libraries to enable analysis across screens and identification of cellular dependencies across cell lines.^171–175 In this data portal, screen data sets can be analyzed in combination with gene copy number and expression data in a publicly available data set to examine unique and context-specific genetic vulnerabilities.^172–175 As genome-wide functional screens become increasingly popular, it will be critical to comprehensively analyze these data sets to gain a deep biological understanding to uncover new drug targets and therapeutic avenues.

Future of Functional Genomics in Drug Discovery

In the short history of the functional genomic field, progress has been rapid ( Fig. 4 ). The understanding of gene regulation in biologic systems has greatly improved, leading to the identification of novel biological targets that offer therapeutic options for multiple diseases. However, a large portion of these targets are not considered classically druggable from a small-molecule or antibody perspective, leading to a need for other modalities to address these targets. An increasing amount of research is being put into the development of new modalities to address difficult, or “un-druggable,” targets, such as protein–protein/nucleic acid interactions and transcription factors. New modalities include oligonucleotide therapies (e.g., antisense and modified RNA), protein degradation approaches, and in vivo and ex vivo gene editing using CRISPR/Cas9 technologies.¹³⁸

To address the issues with translatability in drug discovery, many pharmaceutical companies are moving toward examining patient samples to better understand the molecular mechanisms driving disease and identify genetic biomarkers of therapeutic response.¹⁷⁶ This precision medicine approach has been used successfully in the clinic, particularly in oncology. One such example is the clinical benefit seen with the use of a poly (ADP-ribose) polymerase (PARP) inhibitor, olaparib, as a monotherapy in metastatic breast and advanced ovarian cancer patients with BRCA mutations that have received prior chemotherapy.^177,178 The use of targeted therapies, such as olaparib, demonstrates the benefits of identifying mechanistically distinct patient populations that dictate the clinical response to a given therapy. Similar approaches are being explored in other diseases, such as epilepsy, where genetic evaluation has demonstrated that refractory epilepsies can be caused by different underlying mechanisms that can lead to variation in clinical response.¹⁷⁹ These examples demonstrate the value of having increased information about the molecular mechanism of disease and response to therapy.

An issue faced in many diseases is the difficulty in accessing disease tissue and obtaining enough genetic material in relevant patient samples for testing; this is particularly true for neurological disorders. New low-cell-input profiling techniques, such as single-cell RNA-seq,¹⁸⁰ ATAC-seq,¹⁸¹ and CUT&RUN, are opening up new opportunities in this area. By being able to extract more information from small amounts of sample, scientists can more broadly apply these functional genomic techniques. Additionally, as technical hurdles limiting genome editing efficiencies in primary tissue are overcome, it may become possible to use CRISPR/Cas9 tools to conduct drug discovery campaigns directly in patient samples and to examine relevant phenotypes or endpoints in single-cell format. Improved barcoding technologies that use proteins as barcodes may allow for more direct links of cellular perturbation to phenotypes compared with conventional DNA barcoding.¹⁸²

Generally, genetic screens are used to modulate a single target per cell. However, modulating a single gene at a time may not be suitable for the screening of complex polygenic diseases. Future studies will likely examine combinatorial effects of genetic modifications. This could potentially be addressed by combinatorial CRISPR/Cas screens, either by increasing the number of gRNAs introduced per cell in pooled or arrayed screens, or by screens performed in various isogenic disease cell lines to identify phenotype modifying genes. As methods become more refined and robust, complex combinatorial screens will likely lead to the discovery of novel biological pathways and interactions, subsequently expanding the number of future drug targets.

Conclusion

Discovering and developing new medicines is a difficult and high-risk process. Functional genomic tools provide an avenue to gain a comprehensive understanding of human disease biology and enable drug development. With genomic and epigenomic tools, endogenous regulatory networks can be directly probed and clearly linked to phenotypic disease outcomes. As functional genomic technologies continue to develop, they will increasingly be implemented into conventional drug discovery pipelines, aiding the efforts to develop novel therapeutics.

Footnotes

Declaration of Conflicting Interests

The authors declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: All authors are employed by UCB Pharma or Element Genomics, a wholly owned subsidiary of UCB Pharma.

Funding

The authors received no financial support for the research, authorship, and/or publication of this article.

References

Deloitte Centre for Health Solutions. A New Future for R&D? Measuring the Return from Pharmaceutical Innovation 2017. www2.deloitte.com/content/dam/Deloitte/uk/Documents/life-sciences-health-care/deloitte-uk-measuring-roi-pharma.pdf (accessed March 25, 2019).

Plenge

R. M.

Disciplined Approach to Drug Discovery and Early Development. Sci. Transl. Med. 2016, 8, 349ps15.

Schulze

Ringel

Panier

; et al. Market Watch: Value of 2016 FDA Drug Approvals: Reversion to the Mean? Nat. Rev. Drug Discov. 2017, 16, 78.

Mullard

2018 FDA Drug Approvals. Nat. Rev. Drug Discov. 2019, 18, 85–89.

Scannell

J. W.

Bosley

When Quality Beats Quantity: Decision Theory, Drug Discovery, and the Reproducibility Crisis. PLoS One 2016, 11, e0147215.

Fellmann

Gowen

B. G.

Lin

P.-C.

; et al. Cornerstones of CRISPR-Cas in Drug Discovery and Therapy. Nat. Rev. Drug Discov. 2017, 16, 89–100.

Lötsch

Kringel

Use of Computational Functional Genomics in Drug Discovery and Repurposing for Analgesic Indications. Clin. Pharmacol. Ther. 2018, 103, 975–978.

Elbashir

S. M.

Harborth

Lendeckel

; et al. Duplexes of 21-Nucleotide RNAs Mediate RNA Interference in Cultured Mammalian Cells. Nature 2001, 411, 494–498.

Bernstein

Caudy

A. A.

Hammond

S. M.

; et al. Role for a Bidentate Ribonuclease in the Initiation Step of RNA Interference. Nature 2001, 409, 363–366.

10.

Hammond

S. M.

Bernstein

Beach

; et al. An RNA-Directed Nuclease Mediates Post-Transcriptional Gene Silencing in Drosophila Cells. Nature 2000, 404, 293–296.

11.

Micura

Small Interfering RNAs and Their Chemical Synthesis. Angew. Chem. Int. Ed. Engl. 2002, 41, 2265–2269.

12.

J.-Y.

DeRuiter

S. L.

Turner

D. L.

RNA Interference by Expression of Short-Interfering RNAs and Hairpin RNAs in Mammalian Cells. Proc. Natl. Acad. Sci. U.S.A. 2002, 99, 6047–6052.

13.

Brummelkamp

T. R.

Bernards

Agami

A System for Stable Expression of Short Interfering RNAs in Mammalian Cells. Science 2002, 296, 550–553.

14.

Harborth

Elbashir

S. M.

Bechert

; et al. Identification of Essential Genes in Cultured Mammalian Cells Using Small Interfering RNAs. J. Cell Sci. 2001, 114, 4557–4565.

15.

Jackson

A. L.

Burchard

Schelter

; et al. Widespread SiRNA “Off-Target” Transcript Silencing Mediated by Seed Region Sequence Complementarity. RNA 2006, 12, 1179–1187.

16.

Birmingham

Anderson

E. M.

Reynolds

; et al. 3′ UTR Seed Matches, but Not Overall Identity, Are Associated with RNAi Off-Targets. Nat. Methods 2006, 3, 199–204.

17.

Jackson

A. L.

Bartz

S. R.

Schelter

; et al. Expression Profiling Reveals Off-Target Gene Regulation by RNAi. Nat. Biotechnol. 2003, 21, 635–637.

18.

Maeder

M. L.

Gersbach

C. A.

Genome-Editing Technologies for Gene and Cell Therapy. Mol. Ther. 2016, 24, 430–446.

19.

Richardson

C. D.

Ray

G. J.

DeWitt

M. A.

; et al. Enhancing Homology-Directed Genome Editing by Catalytically Active and Inactive CRISPR-Cas9 Using Asymmetric Donor DNA. Nat. Biotechnol. 2016, 34, 339–344.

20.

Riesenberg

Maricic

Targeting Repair Pathways with Small Molecules Increases Precise Genome Editing in Pluripotent Stem Cells. Nat. Commun. 2018, 9, 2164.

21.

Choulika

Perrin

Dujon

; et al. Induction of Homologous Recombination in Mammalian Chromosomes by Using the I-SceI System of Saccharomyces cerevisiae. Mol. Cell. Biol. 1995, 15, 1968–1973.

22.

Rouet

Smih

Jasin

Expression of a Site-Specific Endonuclease Stimulates Homologous Recombination in Mammalian Cells. Proc. Natl. Acad. Sci. U.S.A. 1994, 91, 6064–6068.

23.

Smih

Rouet

Romanienko

P. J.

; et al. Double-Strand Breaks at the Target Locus Stimulate Gene Targeting in Embryonic Stem Cells. Nucleic Acids Res. 1995, 23, 5012–5019.

24.

Kim

Y. G.

Cha

Chandrasegaran

Hybrid Restriction Enzymes: Zinc Finger Fusions to Fok I Cleavage Domain. Proc. Natl. Acad. Sci. U.S.A. 1996, 93, 1156–1160.

25.

Miller

J. C.

Tan

Qiao

; et al. A TALE Nuclease Architecture for Efficient Genome Editing. Nat. Biotechnol. 2011, 29, 143–148.

26.

Huang

Jiang

W. Z.

; et al. TAL Nucleases (TALNs): Hybrid Proteins Composed of TAL Effectors and FokI DNA-Cleavage Domain. Nucleic Acids Res. 2011, 39, 359–372.

27.

Mussolino

Morbitzer

Lutge

; et al. A Novel TALE Nuclease Scaffold Enables High Genome Editing Activity in Combination with Low Toxicity. Nucleic Acids Res. 2011, 39, 9283–9293.

28.

Huang

Zhao

; et al. Modularly Assembled Designer TAL Effector Nucleases for Targeted Gene Knockout and Gene Replacement in Eukaryotes. Nucleic Acids Res. 2011, 39, 6315–6325.

29.

L. P.

Chandrasegaran

Functional Domains in Fok I Restriction Endonuclease. Proc. Natl. Acad. Sci. U.S.A. 1992, 89, 4275–4279.

30.

Jinek

Chylinski

Fonfara

; et al. A Programmable Dual-RNA-Guided DNA Endonuclease in Adaptive Bacterial Immunity. Science 2012, 337, 816–821.

31.

Cong

Ran

F. A.

Cox

; et al. Multiplex Genome Engineering Using CRISPR/Cas Systems. Science 2013, 339, 819–823.

32.

Addgene: CRISPR Plasmids and Resources. https://www.addgene.org/crispr/ (accessed April 1, 2019).

33.

Gaj

Gersbach

C. A.

Barbas

C. F.

3rd . ZFN, TALEN, and CRISPR/Cas-Based Methods for Genome Engineering. Trends Biotechnol. 2013, 31, 397–405.

34.

Thakore

P. I.

Black

J. B.

Hilton

I. B.

; et al. Editing the Epigenome: Technologies for Programmable Transcription and Epigenetic Modulation. Nat. Methods 2016, 13, 127–137.

35.

Seipel

Georgiev

Schaffner

A Minimal Transcription Activation Domain Consisting of a Specific Array of Aspartic Acid and Leucine Residues. Biol. Chem. Hoppe Seyler 1994, 375, 463–470.

36.

Beerli

R. R.

Dreier

Barbas

C. F.

3rd . Positive and Negative Regulation of Endogenous Genes by Designed Transcription Factors. Proc. Natl. Acad. Sci. U.S.A. 2000, 97, 1495–1500.

37.

Beerli

R. R.

Segal

D. J.

Dreier

; et al. Toward Controlling Gene Expression at Will: Specific Regulation of the ErbB-2/HER-2 Promoter by Using Polydactyl Zinc Finger Proteins Constructed from Modular Building Blocks. Proc. Natl. Acad. Sci. U.S.A. 1998, 95, 14628–14633.

38.

Ballard

D. W.

Dixon

E. P.

Peffer

N. J.

; et al. The 65-kDa Subunit of Human NF-Kappa B Functions as a Potent Transcriptional Activator and a Target for v-Rel-Mediated Repression. Proc. Natl. Acad. Sci. U.S.A. 1992, 89, 1875–1879.

39.

Margolin

J. F.

Friedman

J. R.

Meyer

W. K.

; et al. Kruppel-Associated Boxes Are Potent Transcriptional Repression Domains. Proc. Natl. Acad. Sci. U.S.A. 1994, 91, 4509–4513.

40.

Ayer

D. E.

Laherty

C. D.

Lawrence

Q. A.

; et al. Mad Proteins Contain a Dominant Transcription Repression Domain. Mol. Cell. Biol. 1996, 16, 5772–5781.

41.

Hilton

I. B.

D’Ippolito

A. M.

Vockley

C. M.

; et al. Epigenome Editing by a CRISPR-Cas9-Based Acetyltransferase Activates Genes from Promoters and Enhancers. Nat. Biotechnol. 2015, 33, 510–517.

42.

Maeder

M. L.

Angstman

J. F.

Richardson

M. E.

; et al. Targeted DNA Demethylation and Activation of Endogenous Genes Using Programmable TALE-TET1 Fusion Proteins. Nat. Biotechnol. 2013, 31, 1137–1142.

43.

Mendenhall

E. M.

Williamson

K. E.

Reyon

; et al. Locus-Specific Editing of Histone Modifications at Endogenous Enhancers. Nat. Biotechnol. 2013, 31, 1133–1136.

44.

Brinkman

R. R.

Dubé

M.-P.

Rouleau

G. A.

; et al. Human Monogenic Disorders—A Source of Novel Drug Targets. Nat. Rev. Genet. 2006, 7, 249–260.

45.

Lindpaintner

Genetics in Drug Discovery and Development: Challenge and Promise of Individualizing Treatment in Common Complex Diseases. Br. Med. Bull. 1999, 55, 471–491.

46.

International Human Genome Sequencing Consortium. Finishing the Euchromatic Sequence of the Human Genome. Nature 2004, 431, 931–945.

47.

Lander

E. S.

Linton

L. M.

Birren

; et al. Initial Sequencing and Analysis of the Human Genome. Nature 2001, 409, 860–921.

48.

Visscher

P. M.

Wray

N. R.

Zhang

; et al. 10 Years of GWAS Discovery: Biology, Function, and Translation. Am. J. Hum. Genet. 2017, 101, 5–22.

49.

Plenge

R. M.

Scolnick

E. M.

Altshuler

Validating Therapeutic Targets through Human Genetics. Nat. Rev. Drug Discov. 2013, 12, 581–594.

50.

Cohen

J. C.

Boerwinkle

Mosley

T. H.

; et al. Sequence Variations in PCSK9, Low LDL, and Protection against Coronary Heart Disease. N. Engl. J. Med. 2006, 354, 1264–1272.

51.

Farnier

PCSK9: From Discovery to Therapeutic Applications. Arch. Cardiovasc. Dis. 2014, 107, 58–66.

52.

Hopkins

P. N.

Defesche

Fouchier

S. W.

; et al. Characterization of Autosomal Dominant Hypercholesterolemia Caused by PCSK9 Gain of Function Mutations and Its Specific Treatment with Alirocumab, a PCSK9 Monoclonal Antibody. Circ. Cardiovasc. Genet. 2015, 8, 823–831.

53.

Stein

E. A.

Mellis

Yancopoulos

G. D.

; et al. Effect of a Monoclonal Antibody to PCSK9 on LDL Cholesterol. N. Engl. J. Med. 2012, 366, 1108–1118.

54.

Shi

Kichaev

Pasaniuc

Contrasting the Genetic Architecture of 30 Complex Traits from Summary Association Data. Am. J. Hum. Genet. 2016, 99, 139–153.

55.

Marouli

Graff

Medina-Gomez

; et al. Rare and Low-Frequency Coding Variants Alter Human Adult Height. Nature 2017, 542, 186–190.

56.

Grarup

Moltke

Andersen

M. K.

; et al. Loss-of-Function Variants in ADCY3 Increase Risk of Obesity and Type 2 Diabetes. Nat. Genet. 2018, 50, 172–174.

57.

Saeed

Bonnefond

Tamanini

; et al. Loss-of-Function Mutations in ADCY3 Cause Monogenic Severe Obesity. Nat. Genet. 2018, 50, 175–179.

58.

Tong

Park

α-Cedrene Protects Rodents from High-Fat Diet-Induced Adiposity via Adenylyl Cyclase 3.

Int. J. Obes. 2005 2019, 43, 202–216.

59.

Shen

Seed Ahmed

; et al. Adenylate Cyclase 3: A New Target for Anti-Obesity Drug Development. Obes. Rev. 2016, 17, 907–914.

60.

Guerreiro

Wojtas

Bras

; et al. TREM2 Variants in Alzheimer’s Disease. N. Engl. J. Med. 2013, 368, 117–127.

61.

Dardiotis

Siokas

Pantazi

; et al. A Novel Mutation in TREM2 Gene Causing Nasu-Hakola Disease and Review of the Literature. Neurobiol. Aging 2017, 53, 194.e13–194.e22.

62.

Maurano

M. T.

Humbert

Rynes

; et al. Systematic Localization of Common Disease-Associated Variation in Regulatory DNA. Science 2012, 337, 1190–1195.

63.

Hnisz

Abraham

B. J.

Lee

T. I.

; et al. Super-Enhancers in the Control of Cell Identity and Disease. Cell 2013, 155, 934–947.

64.

Roadmap Epigenomics Consortium; Kundaje

Meuleman

; et al. Integrative Analysis of 111 Reference Human Epigenomes. Nature 2015, 518, 317–330.

65.

Cannon

M. E.

Mohlke

K. L.

Deciphering the Emerging Complexities of Molecular Mechanisms at GWAS Loci. Am. J. Hum. Genet. 2018, 103, 637–653.

66.

GTEx Consortium; Laboratory, Data Analysis & Coordinating Center (LDACC)—Analysis Working Group; Statistical Methods groups—Analysis Working Group; et al. Genetic Effects on Gene Expression across Human Tissues. Nature 2017, 550, 204–213.

67.

Joehanes

Zhang

Huan

; et al. Integrated Genome-Wide Analysis of Expression Quantitative Trait Loci Aids Interpretation of Genomic Association Studies. Genome Biol. 2017, 18, 16.

68.

Yao

Chen

Song

; et al. Genome-Wide Mapping of Plasma Protein QTLs Identifies Putatively Causal Genes and Pathways for Cardiovascular Disease. Nat. Commun. 2018, 9, 3268.

69.

Hannon

Spiers

Viana

; et al. Methylation QTLs in the Developing Brain and Their Enrichment in Schizophrenia Risk Loci. Nat. Neurosci. 2016, 19, 48–54.

70.

Degner

J. F.

Pai

A. A.

Pique-Regi

; et al. DNase I Sensitivity QTLs Are a Major Determinant of Human Expression Variation. Nature 2012, 482, 390–394.

71.

Bouwman

B. A. M.

de Laat

Getting the Genome in Shape: The Formation of Loops, Domains and Compartments. Genome Biol. 2015, 16, 154.

72.

Symmons

Uslu

V. V.

Tsujimura

; et al. Functional and Topological Characteristics of Mammalian Regulatory Domains. Genome Res. 2014, 24, 390–400.

73.

Furlong

E. E. M.

Levine

Developmental Enhancers and Chromosome Topology. Science 2018, 361, 1341–1345.

74.

Le Dily

Baù

Pohl

; et al. Distinct Structural Transitions of Chromatin Topological Domains Correlate with Coordinated Hormone-Induced Gene Regulation. Genes Dev. 2014, 28, 2151–2162.

75.

ENCODE Project Consortium. An Integrated Encyclopedia of DNA Elements in the Human Genome. Nature 2012, 489, 57–74.

76.

Y. I.

van de Geijn

Raj

; et al. RNA Splicing Is a Primary Link between Genetic Variation and Disease. Science 2016, 352, 600–604.

77.

Claussnitzer

Dankel

S. N.

Kim

K.-H.

; et al. FTO Obesity Variant Circuitry and Adipocyte Browning in Humans. N. Engl. J. Med. 2015, 373, 895–907.

78.

Kessler

Wobst

Wolf

; et al. Functional Characterization of the GUCY1A3 Coronary Artery Disease Risk Locus. Circulation 2017, 136, 476–489.

79.

Prokop

J. W.

Yeo

N. C.

Ottmann

; et al. Characterization of Coding/Noncoding Variants for SHROOM3 in Patients with CKD. J. Am. Soc. Nephrol. 2018, 29, 1525–1535.

80.

Miller

C. L.

Pjanic

Wang

; et al. Integrative Functional Genomics Identifies Regulatory Mechanisms at Coronary Artery Disease Loci. Nat. Commun. 2016, 7, 12092.

81.

Bailey

M. H.

Tokheim

Porta-Pardo

; et al. Comprehensive Characterization of Cancer Driver Genes and Mutations. Cell 2018, 173, 371–385.e18.

82.

Marquart

Chen

E. Y.

Prasad

Estimation of the Percentage of US Patients with Cancer Who Benefit from Genome-Driven Oncology. JAMA Oncol. 2018, 4, 1093–1098.

83.

Poduri

Evrony

G. D.

Cai

; et al. Somatic Mutation, Genomic Variation, and Neurological Disease. Science 2013, 341, 1237758.

84.

Goodnow

C. C.

Multistep Pathogenesis of Autoimmune Disease. Cell 2007, 130, 25–35.

85.

Lim

J. S.

Kim

Kang

H.-C.

; et al. Brain Somatic Mutations in MTOR Cause Focal Cortical Dysplasia Type II Leading to Intractable Epilepsy. Nat. Med. 2015, 21, 395–400.

86.

Ross

K. A.

Coherent Somatic Mutation in Autoimmune Disease. PLoS One 2014, 9, e101093.

87.

Detanico

St. Clair

J. B.

Aviszus

; et al. Somatic Mutagenesis in Autoimmunity. Autoimmunity 2013, 46, 102–114.

88.

Platt

R. J.

Chen

Zhou

; et al. CRISPR-Cas9 Knockin Mice for Genome Editing and Cancer Modeling. Cell 2014, 159, 440–455.

89.

Weber

Öllinger

Friedrich

; et al. CRISPR/Cas9 Somatic Multiplex-Mutagenesis for High-Throughput Functional Cancer Genomics in Mice. Proc. Natl. Acad. Sci. U.S.A. 2015, 112, 13982–13987.

90.

Eaton

S. L.

Wishart

T. M.

Bridging the Gap: Large Animal Models in Neurodegenerative Research. Mamm. Genome 2017, 28, 324–337.

91.

Grow

D. A.

McCarrey

J. R.

Navara

C. S.

Advantages of Nonhuman Primates as Preclinical Models for Evaluating Stem Cell-Based Therapies for Parkinson’s Disease. Stem Cell Res. 2016, 17, 352–366.

92.

Caccone

Powell

J. R.

DNA Divergence among Hominoids. Evolution 1989, 43, 925–942.

93.

Rogers

Hixson

J. E.

Baboons as an Animal Model for Genetic Studies of Common Human Disease. Am. J. Hum. Genet. 1997, 61, 489–493.

94.

Capecchi

M. R.

Gene Targeting in Mice: Functional Analysis of the Mammalian Genome for the Twenty-First Century. Nat. Rev. Genet. 2005, 6, 507–512.

95.

Generating Mouse Models with CRISPR/Cas9. https://jackson.jax.org/rs/444-BUH-304/images/Whitepaper_CRISPR.pdf (accessed April 1, 2019).

96.

Wang

Yang

Shivalila

C. S.

; et al. One-Step Generation of Mice Carrying Mutations in Multiple Genes by CRISPR/Cas-Mediated Genome Engineering. Cell 2013, 153, 910–918.

97.

Qin

Dion

S. L.

Kutny

P. M.

; et al. Efficient CRISPR/Cas9-Mediated Genome Editing in Mice by Zygote Electroporation of Nuclease. Genetics 2015, 200, 423–430.

98.

Yang

Wang

Shivalila

C. S.

; et al. One-Step Generation of Mice Carrying Reporter and Conditional Alleles by CRISPR/Cas-Mediated Genome Engineering. Cell 2013, 154, 1370–1379.

99.

Wang

Kutny

P. M.

Byers

S. L.

; et al. Delivery of Cas9 Protein into Mouse Zygotes through a Series of Electroporation Dramatically Increases the Efficiency of Model Creation. J. Genet. Genomics 2016, 43, 319–327.

100.

Kim

Ryu

S. M.

Kim

S. T.

; et al. Highly Efficient RNA-Guided Base Editing in Mouse Embryos. Nat. Biotechnol. 2017, 35, 435–437.

101.

Chen

Lee

A. Y.

; et al. Highly Efficient Mouse Genome Editing by CRISPR Ribonucleoprotein Electroporation of Zygotes. J. Biol. Chem. 2016, 291, 14457–14467.

102.

Shinmyo

Kawasaki

CRISPR/Cas9-Mediated Gene Knockout in the Mouse Brain Using In Utero Electroporation. Curr. Protoc. Neurosci. 2017, 79, 3.32.1–3.32.11.

103.

Hsieh

L. S.

Wen

J. H.

Claycomb

; et al. Convulsive Seizures from Experimental Focal Cortical Dysplasia Occur Independently of Cell Misplacement. Nat. Commun. 2016, 7, 11753.

104.

Auton

Brooks

L. D.

Durbin

R. M.

; et al. A Global Reference for Human Genetic Variation. Nature 2015, 526, 68–74.

105.

Kim

H. S.

Bernitz

J. M.

Lee

D. F.

; et al. Genomic Editing Tools to Model Human Diseases with Isogenic Pluripotent Stem Cells. Stem Cells Dev. 2014, 23, 2673–2686.

106.

Moslem

Olive

Falk

Stem Cell Models of Schizophrenia, What Have We Learned and What Is the Potential?

Schizophr. Res. 2018, 210, 3–12.

107.

Jiang

Zhang

; et al. Modeling Parkinson’s Disease Using Patient-Specific Induced Pluripotent Stem Cells. J. Park. Dis. 2018, 8, 479–493.

108.

Saini

S. M.

Mancuso

S. G.

Mostaid

M. S.

; et al. Meta-Analysis Supports GWAS-Implicated Link between GRM3 and Schizophrenia Risk. Transl. Psychiatry 2017, 7, e1196.

109.

Egan

M. F.

Straub

R. E.

Goldberg

T. E.

; et al. Variation in GRM3 Affects Cognition, Prefrontal Glutamate, and Risk for Schizophrenia. Proc. Natl. Acad. Sci. U.S.A. 2004, 101, 12604–12609.

110.

Ryan

S. D.

Dolatabadi

Chan

S. F.

; et al. Isogenic Human IPSC Parkinson’s Model Shows Nitrosative Stress-Induced Dysfunction in MEF2-PGC1alpha Transcription. Cell 2013, 155, 1351–1364.

111.

Pre

Nestor

M. W.

Sproul

A. A.

; et al. A Time Course Analysis of the Electrophysiological Properties of Neurons Differentiated from Human Induced Pluripotent Stem Cells (IPSCs). PLoS One 2014, 9, e103418.

112.

Joyner

A. L.

Zervas

Genetic Inducible Fate Mapping in Mouse: Establishing Genetic Lineages and Defining Genetic Neuroanatomy in the Nervous System. Dev. Dyn. 2006, 235, 2376–2385.

113.

Spanjaard

Junker

J. P.

Methods for Lineage Tracing on the Organism-Wide Level. Curr. Opin. Cell Biol. 2017, 49, 16–21.

114.

Adkar

S. S.

C. L.

Willard

V. P.

; et al. Step-Wise Chondrogenesis of Human Induced Pluripotent Stem Cells and Purification via a Reporter Allele Generated by CRISPR-Cas9 Genome Editing. Stem Cells 2019, 37, 65–76.

115.

Matthias

; et al. A Myogenic Double-Reporter Human Pluripotent Stem Cell Line Allows Prospective Isolation of Skeletal Muscle Progenitors. Cell Rep. 2018, 25, 1966–1981.e4.

116.

Gao

Yang

Tsang

J. C.

; et al. Reprogramming to Pluripotency Using Designer TALE Transcription Factors Targeting Enhancers. Stem Cell Rep. 2013, 1, 183–197.

117.

Balboa

Weltner

Eurola

; et al. Conditionally Stabilized DCas9 Activator for Controlling Gene Expression in Human Cell Reprogramming and Differentiation. Stem Cell Rep. 2015, 5, 448–459.

118.

Liu

Chen

Liu

; et al. CRISPR-Based Chromatin Remodeling of the Endogenous Oct4 or Sox2 Locus Enables Reprogramming to Pluripotency. Cell Stem Cell 2018, 22, 252–261.e4.

119.

Liu

X. S.

; et al. Editing DNA Methylation in the Mammalian Genome. Cell 2016, 167, 233–247.e17.

120.

Chakraborty

Kabadi

A. M.

; et al. A CRISPR/Cas9-Based System for Reprogramming Cell Lineage Specification. Stem Cell Rep. 2014, 3, 940–947.

121.

Chavez

Scheiman

Vora

; et al. Highly Efficient Cas9-Mediated Transcriptional Programming. Nat. Methods 2015, 12, 326–328.

122.

Black

J. B.

Adler

A. F.

Wang

H. G.

; et al. Targeted Epigenetic Remodeling of Endogenous Loci by CRISPR/Cas9-Based Transcriptional Activators Directly Converts Fibroblasts to Neuronal Cells. Cell Stem Cell 2016, 19, 406–414.

123.

Eguchi

Wleklinski

M. J.

Spurgat

M. C.

; et al. Reprogramming Cell Fate with a Genome-Scale Library of Artificial Transcription Factors. Proc. Natl. Acad. Sci. U.S.A. 2016, 113, E8257–E8266.

124.

Q. V.

Dixon

Verma

; et al. Genome-Scale Screens Identify JNK-JUN Signaling as a Barrier for Pluripotency Exit and Endoderm Differentiation. Nat. Genet. 2019, 51, 999–1010.

125.

Black

J. B.

Gersbach

C. A.

Synthetic Transcription Factors for Cell Fate Reprogramming. Curr. Opin. Genet. Dev. 2018, 52, 13–21.

126.

Tao

Wang

Chen

; et al. Engineering Human Islet Organoids from IPSCs Using an Organ-on-Chip Platform. Lab Chip 2019, 19, 909–1104.

127.

Lee

C. T.

Bendriem

R. M.

W. W.

; et al. 3D Brain Organoids Derived from Pluripotent Stem Cells: Promising Experimental Models for Brain Development and Neurodegenerative Disorders. J. Biomed. Sci. 2017, 24, 59.

128.

Shi

Inoue

J. C.

; et al. Induced Pluripotent Stem Cell Technology: A Decade of Progress. Nat. Rev. Drug Discov. 2017, 16, 115–130.

129.

Hicks

M. R.

Hiserodt

Paras

; et al. ERBB3 and NGFR Mark a Distinct Skeletal Muscle Progenitor Cell in Human Development and HPSCs. Nat. Cell Biol. 2018, 20, 46–57.

130.

Brix

Zhou

Luo

The Epigenetic Reprogramming Roadmap in Generation of IPSCs from Somatic Cells. J. Genet. Genomics 2015, 42, 661–670.

131.

Kim

Doi

Wen

; et al. Epigenetic Memory in Induced Pluripotent Stem Cells. Nature 2010, 467, 285–290.

132.

Studer

Vera

Cornacchia

Programming and Reprogramming Cellular Age in the Era of Induced Pluripotency. Cell Stem Cell 2015, 16, 591–600.

133.

Huh

C. J.

Zhang

Victor

M. B.

; et al. Maintenance of Age in Human Neurons Generated by MicroRNA-Based Neuronal Conversion of Fibroblasts. eLife 2016, 5, e18648.

134.

Manandhar

Song

Kabadi

; et al. Incomplete MyoD-Induced Transdifferentiation Is Associated with Chromatin Remodeling Deficiencies. Nucleic Acids Res. 2017, 45, 11684–11699.

135.

Braun

S. M. G.

Kirkland

J. G.

Chory

E. J.

; et al. Rapid and Reversible Epigenome Editing by Endogenous Chromatin Regulators. Nat. Commun. 2017, 8, 560.

136.

Liu

X. S.

Krzisch

; et al. Rescue of Fragile X Syndrome Neurons by DNA Methylation Editing of the FMR1 Gene. Cell 2018, 172, 979–992.e6.

137.

Santos

Ursu

Gaulton

; et al. A Comprehensive Map of Molecular Drug Targets. Nat. Rev. Drug Discov. 2017, 16, 19–34.

138.

Rask-Andersen

Masuram

Schiöth

H. B.

The Druggable Genome: Evaluation of Drug Targets in Clinical Trials Suggests Major Shifts in Molecular Class and Indication. Annu. Rev. Pharmacol. Toxicol. 2014, 54, 9–26.

139.

Boettcher

Hoheisel

J. D.

Pooled RNAi Screens—Technical and Biological Aspects. Curr. Genomics 2010, 11, 162–167.

140.

Jiang

Doudna

J. A.

CRISPR-Cas9 Structures and Mechanisms. Annu. Rev. Biophys. 2017, 46, 505–529.

141.

Veres

Gosis

B. S.

Ding

; et al. Low Incidence of Off-Target Mutations in Individual CRISPR-Cas9 and TALEN Targeted Human Stem Cell Clones Detected by Whole-Genome Sequencing. Cell Stem Cell 2014, 15, 27–30.

142.

Joung

Konermann

Gootenberg

J. S.

; et al. Genome-Scale CRISPR-Cas9 Knockout and Transcriptional Activation Screening. Nat. Protoc. 2017, 12, 828–863.

143.

Evers

Jastrzebski

Heijmans

J. P. M.

; et al. CRISPR Knockout Screening Outperforms shRNA and CRISPRi in Identifying Essential Genes. Nat. Biotechnol. 2016, 34, 631–633.

144.

Doench

J. G.

Am I Ready for CRISPR? A User’s Guide to Genetic Screens. Nat. Rev. Genet. 2018, 19, 67–80.

145.

Sternberg

S. H.

Doudna

J. A.

Expanding the Biologist’s Toolkit with CRISPR-Cas9. Mol. Cell 2015, 58, 568–574.

146.

Sanjana

N. E.

Shalem

Zhang

Improved Vectors and Genome-Wide Libraries for CRISPR Screening. Nat. Methods 2014, 11, 783–784.

147.

Glass

Lee

; et al. Engineering the Delivery System for CRISPR-Based Genome Editing. Trends Biotechnol. 2018, 36, 173–185.

148.

Sanson

K. R.

Hanna

R. E.

Hegde

; et al. Optimized Libraries for CRISPR-Cas9 Genetic Screens with Multiple Modalities. Nat. Commun. 2018, 9, 5416.

149.

Shalem

Sanjana

N. E.

Hartenian

; et al. Genome-Scale CRISPR-Cas9 Knockout Screening in Human Cells. Science 2014, 343, 84–87.

150.

Wang

Wei

J. J.

Sabatini

D. M.

; et al. Genetic Screens in Human Cells Using the CRISPR-Cas9 System. Science 2014, 343, 80–84.

151.

Tzelepis

Koike-Yusa

De Braekeleer

; et al. A CRISPR Dropout Screen Identifies Genetic Vulnerabilities and Therapeutic Targets in Acute Myeloid Leukemia. Cell Rep. 2016, 17, 1193–1205.

152.

Lin

Cradick

T. J.

Brown

M. T.

; et al. CRISPR/Cas9 Systems Have Off-Target Activity with Insertions or Deletions between Target DNA and Guide RNA Sequences. Nucleic Acids Res. 2014, 42, 7473–7485.

153.

Doench

J. G.

Fusi

Sullender

; et al. Optimized SgRNA Design to Maximize Activity and Minimize Off-Target Effects of CRISPR-Cas9. Nat. Biotechnol. 2016, 34, 184–191.

154.

Hart

Chandrashekhar

Aregger

; et al. High-Resolution CRISPR Screens Reveal Fitness Genes and Genotype-Specific Cancer Liabilities. Cell 2015, 163, 1515–1526.

155.

Oser

M. G.

Fonseca

Chakraborty

A. A.

; et al. Cells Lacking the RB1 Tumor Suppressor Gene Are Hyperdependent on Aurora B Kinase for Survival. Cancer Discov. 2019, 9, 230–247.

156.

Ruiz

Mayor-Ruiz

Lafarga

; et al. A Genome-Wide CRISPR Screen Identifies CDC25A as a Determinant of Sensitivity to ATR Inhibitors. Mol. Cell 2016, 62, 307–313.

157.

Anderson

G. R.

Winter

P. S.

Lin

K. H.

; et al. A Landscape of Therapeutic Cooperativity in KRAS Mutant Cancers Reveals Principles for Controlling Tumor Evolution. Cell Rep. 2017, 20, 999–1015.

158.

Chow

R. D.

Chen

Cancer CRISPR Screens In Vivo. Trends Cancer 2018, 4, 349–358.

159.

Chen

Sanjana

N. E.

Zheng

; et al. Genome-Wide CRISPR Screen in a Mouse Model of Tumor Growth and Metastasis. Cell 2015, 160, 1246–1260.

160.

Chow

R. D.

Guzman

C. D.

Wang

; et al. AAV-Mediated Direct In Vivo CRISPR Screen Identifies Functional Suppressors in Glioblastoma. Nat. Neurosci. 2017, 20, 1329–1341.

161.

Škalamera

Ranall

M. V.

Wilson

B. M.

; et al. A High-Throughput Platform for Lentiviral Overexpression Screening of the Human ORFeome. PLoS One 2011, 6, e20057.

162.

Rebar

E. J.

Huang

Hickey

; et al. Induction of Angiogenesis in a Mouse Model Using Engineered Transcription Factors. Nat. Med. 2002, 8, 1427–1432.

163.

Nishioka

Miyazaki

Soejima

Unbiased shRNA Screening, Using a Combination of FACS and High-Throughput Sequencing, Enables Identification of Novel Modifiers of Polycomb Silencing. Sci. Rep. 2018, 8, 12128.

164.

DeJesus

Moretti

McAllister

; et al. Functional CRISPR Screening Identifies the Ufmylation Pathway as a Regulator of SQSTM1/P62. eLife 2016, 5, e17290.

165.

Arias-Fuenzalida

Jarazo

Qing

; et al. FACS-Assisted CRISPR-Cas9 Genome Editing Facilitates Parkinson’s Disease Modeling. Stem Cell Rep. 2017, 9, 1423–1431.

166.

Potting

Crochemore

Moretti

; et al. Genome-Wide CRISPR Screen for PARKIN Regulators Reveals Transcriptional Repression as a Determinant of Mitophagy. Proc. Natl. Acad. Sci. U.S.A. 2018, 115, E180–E189.

167.

Pusapati

G. V.

Kong

J. H.

Patel

B. B.

; et al. CRISPR Screens Uncover Genes That Regulate Target Cell Sensitivity to the Morphogen Sonic Hedgehog. Dev. Cell 2018, 44, 113–129.e8.

168.

Park

J. S.

Helble

J. D.

Lazarus

J. E.

; et al. A FACS-Based Genome-Wide CRISPR Screen Reveals a Requirement for COPI in Chlamydia trachomatis Invasion. iScience 2019, 11, 71–84.

169.

Tan

Martin

S. E.

Validation of Synthetic CRISPR Reagents as a Tool for Arrayed Functional Genomic Screening. PLoS One 2016, 11, e0168968.

170.

de Groot

Lüthi

Lindsay

; et al. Large-Scale Image-Based Profiling of Single-Cell Phenotypes in Arrayed CRISPR-Cas9 Gene Perturbation Screens. Mol. Syst. Biol. 2018, 14, e8064.

171.

DepMap: The Cancer Dependency Map Project at Broad Institute. https://depmap.org/portal/achilles/ (accessed Oct 31, 2019).

172.

Cowley

G. S.

Weir

B. A.

Vazquez

; et al. Parallel Genome-Scale Loss of Function Screens in 216 Cancer Cell Lines for the Identification of Context-Specific Genetic Dependencies. Sci. Data 2014, 1, 140035.

173.

Tsherniak

Vazquez

Montgomery

P. G.

; et al. Defining a Cancer Dependency Map. Cell 2017, 170, 564-576.e16.

174.

Aguirre

A. J.

Meyers

R. M.

Weir

B. A.

; et al. Genomic Copy Number Dictates a Gene-Independent Cell Response to CRISPR/Cas9 Targeting. Cancer Discov. 2016, 6, 914–929.

175.

Meyers

R. M.

Bryan

J. G.

McFarland

J. M.

; et al. Computational Correction of Copy Number Effect Improves Specificity of CRISPR-Cas9 Essentiality Screens in Cancer Cells. Nat. Genet. 2017, 49, 1779–1784.

176.

Dugger

S. A.

Platt

Goldstein

D. B.

Drug Development in the Era of Precision Medicine. Nat. Rev. Drug Discov. 2018, 17, 183–196.

177.

Robson

S. A.

Senkus

; et al. Olaparib for Metastatic Breast Cancer in Patients with a Germline BRCA Mutation. N. Engl. J. Med. 2017, 377, 1700.

178.

Oza

A. M.

Cibula

Benzaquen

A. O.

; et al. Olaparib Combined with Chemotherapy for Recurrent Platinum-Sensitive Ovarian Cancer: A Randomised Phase 2 Trial. Lancet Oncol. 2015, 16, 87–97.

179.

Myers

C. T.

Mefford

H. C.

Advancing Epilepsy Genetics in the Genomic Era. Genome Med. 2015, 7, 91.

180.

Ziegenhain

Vieth

Parekh

; et al. Comparative Analysis of Single-Cell RNA Sequencing Methods. Mol. Cell 2017, 65, 631–643.e4.

181.

Buenrostro

J. D.

Chang

H. Y.

; et al. ATAC-Seq: A Method for Assaying Chromatin Accessibility Genome-Wide. Curr. Protoc. Mol. Biol. 2015, 109, 21.29.1–21.29.9.

182.

Wroblewska

Dhainaut

Ben-Zvi

; et al. Protein Barcodes Enable High-Dimensional Single-Cell CRISPR Screens. Cell 2018, 175, 1141–1155.e16.