Secretome-Based Screening in Target Discovery

Abstract

Japanese

Korean

Chinese

Secreted proteins and their cognate plasma membrane receptors regulate human physiology by transducing signals from the extracellular environment into cells resulting in different cellular phenotypes. Systematic use of secretome proteins in assays enables discovery of novel biology and signaling pathways. Several secretome-based phenotypic screening platforms have been described in the literature and shown to facilitate target identification in drug discovery. In this review, we summarize the current status of secretome-based screening. This includes annotation, production, quality control, and sample management of secretome libraries, as well as how secretome libraries have been applied to discover novel target biology using different disease-relevant cell-based assays. A workflow for secretome-based screening is shared based on the AstraZeneca experience. The secretome library offers several advantages compared with other libraries used for target discovery: (1) screening using a secretome library directly identifies the active protein and, in many cases, its cognate receptor, enabling a rapid understanding of the disease pathway and subsequent formation of target hypotheses for drug discovery; (2) the secretome library covers significant areas of biological signaling space, although the size of this library is small; (3) secretome proteins can be added directly to cells without additional manipulation. These factors make the secretome library ideal for testing in physiologically relevant cell types, and therefore it represents an attractive approach to phenotypic target discovery.

Keywords

secretome library cell-based screening target discovery phenotypic drug discovery

Introduction

The human proteome consists of protein products from approximately 20,000 protein-coding genes.¹ Based on predictions of signal peptides and transmembrane (TM) regions around two-thirds of the protein-coding genes code for proteins that primarily have an intracellular location, whereas one-third code for proteins that are destined for the secretory pathway. The latter group includes proteins destined to different membranes (e.g., endoplasmic reticulum, Golgi, lysosome, or plasma membrane), or intracellular compartments, as well as proteins secreted into the extracellular environment. Proteins can also be secreted to the extracellular environment via alternative, nonsecretory pathway mechanisms.²

Secreted proteins, and their cognate receptors, are responsible for communication between cells, tissues, and organs within the body. Regulation of these communication pathways is critical for normal human homeostasis, with dysregulation playing a major role in pathophysiology and disease. Secreted signaling proteins can be destined to the blood (endocrine) or to the local environment (paracrine or autocrine). The expressed secretome varies with cell type, differentiation state, and environmental cues. Experimentally verified and/or described secretomes include, for example, cardiac,³ adipocyte,⁴ immunocell,⁵ cellular senescence,⁶ stem cell,⁷ cancer cell,⁸ and the blood secretome.⁹ There are also descriptions of therapy-induced secretomes where inhibition of receptor tyrosine kinases leads to drug-stressed cells with large changes in the expressed secretome and increased drug resistance.¹⁰

Phenotypic drug discovery (PDD) is based on the principle of inducing or preventing a functional readout in a biologically relevant cell system using a sample library of choice.^11,12 Use of large compound libraries for PDD has been described,¹³ but it is more common to use a smaller set of well-annotated, structurally diverse compounds that are known to act on certain biological targets (i.e., have an annotated mechanism of action [MoA]).¹⁴ A number of small-molecule phenotypic screens for the identification of novel targets regulating various biological phenotypes, for example, epigenetic processes,¹⁵ stem cell proliferation,¹⁶ cardiac cell proliferation,^17,18 tumor cell suppression,¹⁹ and regulatory T-cell stability,²⁰ have been reported. However, there are technical challenges associated with the identification/verification of the molecular target of an interesting small-molecule hit, termed “target deconvolution.”^12,21 Although recent advances in chemoproteomics, machine learning, cell microarray, and other methods have been adopted to facilitate target deconvolution,^22–27 there remain few examples of novel targets identified through this approach.²⁸ Thus, there is a need to explore the benefits of additional screening modalities for target discovery. Phenotypic screening using functionalized fragment sets has been described to enable straightforward target deconvolution since fragments will be irreversibly bonded to target proteins.²⁹ Small interfering (si) RNA and lately CRISPR libraries have allowed functional genomics screens for the identification of novel drug targets.^30–33

Secretome-based screening builds on the concept that the secreted proteins are biologically active signaling molecules. When a secretome resource/library is used in combination with PDD, novel targets and signaling pathways can be identified. In this review article, we describe the concept of secretome-based screening using human secreted proteins in combination with different cellular readouts. We give a summary of the field, including industrial efforts, for example, EMDSereno,³⁴ FivePrime Therapeutics,^35,36 Genomics Institute of the Novartis Research Foundation (GNF),^37–40 Novartis,⁴¹ and academic efforts.^42,43 We describe the screening platform that has been set up at AstraZeneca, in collaboration with the Royal Institute of Technology (KTH) in Stockholm, in more detail. Learnings, insights, and challenges are shared. In addition, our view on future applications of secretome-based screening in drug discovery, as well as accessibility to a secretome library, is discussed.

Antibody-based phenotypic screening or methods to study interactions between secreted proteins and receptors will not be covered and readers are referred to other reviews.^44–46

The Concept of Secretome-Based Screening

Secretome-based screening was first described in the postgenomics era when secreted proteins could be identified by a bioinformatics approach, produced by recombinant expression in high-throughput fashion, and tested in different cell-based assays to identify an effect^34,35,37,41 ( Fig. 1A ). In the human body, secreted proteins function as agonists ( Fig. 1A , scenario 1) or antagonists ( Fig. 1A , scenario 2), inducing or inhibiting physiological effects at a cellular level, respectively. Therefore, different types of responses are ideally measured in cell-based assays such as those used in a secretome-based approach. Secreted proteins can also function as decoy factors in signaling pathways, resulting in an antagonist response. There are several examples of decoy factors in the transforming growth factor β (TGF-β) superfamily of proteins,⁴⁷ which regulate the signaling via the receptors by binding to cognate ligands and inhibiting a response.^48,49 A similar effect can be observed when extracellular domains (ECDs) of single-pass plasma TM proteins are shed from the membrane.^50,51 The output of a screen can identify proteins of therapeutic interest at several discrete steps of the workflow ( Fig. 1B ). Actives from the primary screen can be of interest. The cognate receptor can be identified in follow-up screening experiments using ECDs.³⁵ Genes in pathways that are upregulated by secretome proteins can be identified by transcriptome profiling⁵ and confirmed by siRNA and/or the generation of cell line models that lack the gene of interest.^52,53

Figure 1.

Concept of secretome-based screening—the combination of a secretome library and a cell-based assay with a disease-relevant readout results in the identification of novel targets and elucidation of signal transduction pathways. (A) Different responses can be measured in a secretome-based assay. (1) A secreted ligand (triangle) induces an agonist response that results in an increase in the signal of the phenotypic readout. (2) A secreted ligand functions as an antagonist and reduces the signal of the phenotypic readout. (3) A secreted ligand or ECD function as a decoy factor. (4) A secreted ligand is an enzyme, which produces a metabolite that affects the phenotypic readout. The arrows illustrate that both agonist (gray) and antagonist (black) readouts can be measured. (B) Novel biology and putative targets can be discovered at different stages of the secretome-based workflow.

Affinities and/or Kd values between ligand–cognate receptor pairs reported in the literature vary between low picomolar and low micromolar.^35,54–56 It is critical that the concentration of individual samples in the secreted protein library is determined before doing an unbiased secretome screen. This avoids false negatives due to low concentrations of proteins in the assay. This is why most efforts have established methods to quantify the amount of protein produced. However, a low-affinity interaction is sometimes accompanied by a high-affinity interaction, within a ligand–heterodimeric receptor complex, enabling fine-tuning and specificity of a particular signaling response.⁵⁶

A secretome library offers various benefits over other libraries used for PDD. First, the size of the library is small compared with other libraries, enabling screening with precious and disease-relevant cells. Second, cells used for the secretome-based screens do not need to be manipulated via transfection as needed for siRNA screens or to be manipulated to enable Cas9 expression as needed for CRISPR functional genomics screens. This enables screening with primary cells without manipulations. Third, the constituents of the secretome library are biologically relevant. This should make target identification potentially easier than for small molecules. Finally, the secretome library has better coverage of agonistic MoAs than the small-molecule compound set. However, the secretome library also has certain drawbacks. It primarily captures biology that is orchestrated from the plasma membrane and thus is less ideal when probing biology that is mechanistically initiated from the intracellular environment, for example, DNA-repair mechanisms and metabolic events. It has less coverage of antagonistic MoAs than the small-molecule set and CRISPR library and the inherent selectivity may lead to a low hit rate. Therefore, ideally, a secretome library should be used as a complementary approach together with small-molecule, siRNA, and CRISPR libraries. Various secretome libraries and outcomes from secretome-based screens are summarized in Tables 1 and 2 and reviewed in more detail later.

Table 1.

Summary of Different Secretome-Based Efforts That Have Been Described in the Literature.

Research Institute/Company	Library	Expression System	Scale	Amount	Advantages	Disadvantages
Liu et al.,⁴³ School of Pharmaceutical Sciences, Peking University	3899 proteins in medium	HEK secretome-enriched lentiviral library	Cells in secretome-screen transduced with library in 96-well plate	Not described	Quick production of library Potential enrichment of proteins that are secreted via unconvential mechanisms	Library also contains intracellular proteins False negatives and false positives due to using conditioned medium
van Asten et al.,⁴² Faculty of Medicine, University of Amsterdam	756 proteins in medium	HEK cDNA	96 wells; 1 mL	30 ng/mL and 1.4 μg/mL 2 model proteins	Quick production of library	False negatives and false positives due to using conditioned medium
KTH⁶⁸	923 purified proteins	CHO cDNA	60 mL	14 µM	Library can be stored Each protein at known concentration QCed	Long time to produce complete library Stability of produced proteins may be an issue
FivePrime Therapeutics³⁵	4180^a proteins in medium	HEK cDNA	96 wells	20 ng/mL		False negatives and false positives due to using conditioned medium
GNF³⁷	806–4000^b purified proteins	HEKcDNA	35 mL	>100 μg	Library can be storedRelatively quick production of purified proteinsQCed	Rigid process with PEPP
EMD Serono³⁴	2200^a purified proteins	HEKcDNA	100 mL	35 μg/100 mL	Library can be storedQCed proteins	Low amounts generated
Novartis⁴¹	2803 proteins in medium	HEKcDNA	384 wells	ND	Quick production of library	False negatives and false positives due to using conditioned medium

ND = not described.

A similar library has been used in additional described efforts.³⁶

A similar library has been used in additional described efforts.^5,38–40

Table 2.

Summary of Targets That Have Been Generated from Secretome-Based Screens and Additional Validation Assays.

Research Institute/Company	Target Identified	Biological Mechanism	Receptor Deconvolution	Receptor	Follow-Up Assays	In Vivo Models/Clinical Trials	Drug Discovery Project
AstraZeneca⁶⁸	FGF16	CPC proliferation	No, but used biosensor analysis to map receptor preferences	FGFR	Primary mouse CPCsiPSC-derived CMs	FGF16 induces cardiomyocyte replication in neonatal mice subjected to cryoinjury¹¹¹	No
Barrow et al.^{5 a}	PDGF-DD	GFP reporter cell line expressing NKp44-CD3z fusion membrane protein	Primary screen resulted in identification of PDGF-DD–NKp44 ligand–receptor pair	Interaction confirmed by Biacore	NK cell activation	Tumor-derived PDGF-DD restricts outgrowth of solid tumor	No, for a recent review see¹⁰⁶
GNF^{37 a}	PEDF	Human embryonic stem cell self-renewal	Knockdown of PEDF receptor mRNA with shRNA	PEDF receptor	Long-term growth in secondary hESC self-renewal assays	PEDF injected into SCID mice promoted teratomas	No; for a recent review on PEDF see He et al.¹¹⁴
FivePrime Therapeutics³⁵	IL-34	Specific for monocyte viability assay; tested across a panel of 25 assays	Unbiased ECD screeningAntibody blocking of effect	CSF-1R; note that CSF-1R has 2 ligands and IL-34 was identified by the secretome-based screen	Promoted formation of macrophage progenitor cells in human bone marrow cultures	In several clinical trials—pancreatic cancer, liver cancer, biliary tract cancer, pancreatic cancer, melanomaAlso in combination therapy with additional antibodies	Monoclonal antibody against CSF-1RBlocks binding of IL-34 and CSFR; this blocks the production of inflammatory mediators by macrophages and monocytes and reduces inflammation
FivePrime Therapeutics³⁶	FAM150A, FAM150B	Identification of ligands for human leukocyte tyrosine kinase receptor	FAM150A and FAM150B identified in primary screen	Leukocyte tyrosine kinase receptor	Enzyme-linked immunosorber assay (ELISA) readout: receptor phosphorylation in HEK293 cells	No—proof-of-concept screen	No
Novartis⁴¹	EGF and FGF family of proteins, HGF	Proof-of-concept assayProliferation of cancer cells in the presence of relevant inhibitor	ND	ND	Additional cancer cell line assays	Combination therapies in mouse xenograft models	Combinatorial small-molecule drug inhibitor therapies
Liu et al.,⁴³ School of Pharmaceutical Sciences, Peking University	CSF2	Proof-of-concept assay, proliferation of TF-1 cells	ND	ND	ND	ND	ND
Locci et al.^{39 a}	Activin	Primary CD4+ T differentiation into T_fh cells	No	Antibody against type II activin receptors decreased effect	Flow cytometry of primary CD4+ T cells from different donorsSmall-molecule inhibition of signaling	No, lack of translation of effect, using primary cells from mice	ND
Sampath et al.^{38 a}	Oncostatin M	Induction of muscle cell quiescence using primary mouse muscle cells and pooled screening	No	Confirmation of receptor expression by qPCR and in vivo testing in mouse model lacking obligate receptor	In vivo-based imaging screen	Engraftment in mouse model	ND
Scietti et al.^{40 a}	1. C1q2. Lox1	1. 75 proteins from the complement system screened against a staphylococcal library2. Identifying ligands in the complete secretome library for meningococcal adhesin NadA	ND	Interactions confirmed by BLI biophysical approaches2. Interaction confirmed by overexpression of Lox1 on mammalian cells	BLI-based octet analysis and flow cytometry	ND	ND
van Asten et al.,⁴² Faculty of Medicine, University of Amsterdam	FGF16	Inhibition of viral infection of HAP1 cells	No	FGFR	Confirmed in other cell lines	ND	Could lead to the development of novel antiviral medicines

BLI = bio-layer interferometry; ND = not described.

All efforts used libraries generated by GNF.³⁷

Secretome Libraries

Annotation of the Library

There are several different databases available for annotation of the library, including Uniprot (https://www.uniprot.org/),⁵⁷ Ensembl (http://www.ensembl.org/),⁵⁸ pfam (http://pfam.xfam.org/),⁵⁹ and the NCBI Reference Sequence Database (https://www.ncbi.nlm.nih.gov/refseq/).⁶⁰

When a gene list for production of a secretome library is assembled, secreted proteins in the genome need to be stratified from the remaining human genome consisting of intracellular proteins and membrane proteins. Underlying the stratification is prediction of cellular location by, for example, identification of sorting sequence⁶¹ or membrane region.^62,63 There are several databases focusing on identification of secreted proteins including the Secreted Protein Discovery Initiative,⁶⁴ the Secreted Protein Database,⁶⁵ and the more recent MetazSecKB and VerSeDa^66,67 databases. In Uniprot, secreted proteins are also annotated with a function according to keywords for molecular function and/or assigned to a biological process. These include growth factors, cytokines, hormones, regenerative factors, coagulation factors, and different classes of enzymes. The largest annotated group is enzymes, followed by proteins related to immunity, growth factors, and cytokines.

Different secretome-based screening initiatives have stratified their libraries slightly differently^34,35,37,68 but basically using the same principle of predictions of a signal peptide and TM regions, based on algorithms and public information available.

In Jennbacken et al.,⁶⁸ we describe how we stratified our library. Secreted proteins were defined as all Uniprot entries having the subcellular location “Secreted” in addition to all genes with at least one transcript predicted to be secreted according to the Human Protein Atlas (HPA). For prediction of secreted proteins in HPA, three different signal peptide prediction algorithms,^69–71 in combination with seven different TM region prediction algorithms,⁷² were used. To be categorized as secreted, a transcript must have a signal peptide predicted by at least two of three methods, and no TM region predicted by four or more methods. Selected ECDs were also included in the secretome library. One-pass TM proteins for the production of ECDs were selected from Uniprot entries with subcellular location “one-pass TM proteins,” as well as from HPA TM region predictions.⁷²

Additional stratification of a secretome library is useful to be able to prioritize in which order the library should be produced. Gonzalez et al. describes³⁷ how they use genome-wide association scans of different traits that are relevant to disease in human samples^73,74 and traits of inbred mice⁷⁵ to stratify their library. At AstraZeneca we used additional databases such as GeneOntology: “Extracellular space” annotation, Ingenuity Pathway Analysis (IPA) analysis, and an in silico survey of relevant literature, including the “Human Secretome Atlas”⁷⁶ and specific stratification for cardiac cells,^77–81 for the library described above.

A functional analysis of the library, using the Uniprot keywords ( Fig. 2A ), shows that the library contains growth factors, cytokines, and regenerative factors such as the fibroblast growth factor (FGF) family, PDGF family, interferons, growth/differentiation factor (GDF) proteins, and neurotrophic factors, with one-fifth of the proteins in the library, according to Uniprot keywords, having no annotated function. This secretome library is similar to the other described libraries^35,37 and contains well-known secreted proteins, as well as a substantial number of less characterized proteins.

Figure 2.

The secretome library—the constituents, how to produce it, information flow, and sample management. (A) Annotation of the KTH secretome library comprising more than 1500 produced secreted proteins and ECDs. Secreted proteins can be divided into different subcategories based on Uniprot keywords for molecular function and/or biological process. The circle diagram shows the division into subfamilies as indicated. (B) Overview of protein production. (1) Bioinformatics to design constructs for all human secreted proteins and selected ECDs of one-pass TM proteins. (2) Gene synthesis and custom cloning of the constructs followed by sequence verification. (3) Plasmid preparation and additional sequence verification before entering the protein production. (4) Protein expression using the episomal QMCF vector in CHO cells. (5) Protein purification using the C-terminal HPC4 tag. (6) Protein quality check. (C) Overview of the information flow and sample management process. (1) Purified proteins in 2D barcoded vials. (2) Protein batches were thawed once and dispensed into subaliquots (15–20 µL) that were snap-frozen in liquid nitrogen. (3) Aliquots were stored at −80 °C until tested in the cell-based screens. (4) Proteins were dispensed and diluted in 384-well plates before addition to cell-based assays. (5) Data information handling. The library is registered in AstraZeneca compound management databases to allow for the integration between compound handling, assay screening, and data analysis.

Recently, Uhlén et al.⁹ published an extensive additional annotation of the human secretome where the actively secreted proteins in humans were identified. Starting with a bioinformatics-based definition of the secretome, a set consisting of 2641 genes with at least one predicted secreted isoform was manually annotated and classified into three major categories: (1) the blood proteins, (2) the locally secreted proteins, and (3) the intracellular or membrane-associated proteins. Groups 1 and 2 were defined as the secreted genes and consisted of 1709 genes with at least one secreted protein isoform (available at the http://www.proteinatlas.org/blood). The remaining 932 genes in group 3 were annotated as having an intracellular or membrane-associated location.

Liu et al. do the stratification in an entirely different way.⁴³ The practical work is started with the complete set of open reading frames in the human genome (ORFeome V8.1 library; http://horfdb.dfci.harvard.edu/)⁸² in frame with a truncated form of human CD4 containing the membrane-spanning part. The resulting lentiviral library is transduced into HEK293 cells and sorted using a CD4 antibody. Genes that correspond to proteins that are secreted will be displayed on the surface and constitute the basis of the secretome library. Almost 4000 DNA sequences were found in the secretome-enriched library. Nine hundred sequences correspond to secreted proteins according to database annotation, and these account for ~80% of the produced proteins in the library based on copy number. Around 1000 genes are annotated as intracellular proteins and 800 genes have an unknown function. This method to produce the library is interesting since it could possibly be more effective in enriching for proteins that are secreted via unconventional mechanisms.² However, this will require more data to be generated using the secretome-enriched library to be conclusive.

The library generated by FivePrime Therapeutics is based on generation of cDNAs from human tissue material from diverse sources such as fetal, normal adult, cancer adult, and inflamed adult human tissue.³⁵ The library is highlighted in an investment report from 2014 (http://investor.fiveprime.com/static-files/c4ea6f83-e6ed-4334-accb-7ffb002a0122; p 81) and described to comprise a more comprehensive collection of “full-length” cDNA clones based on the use of proprietary technology to capture additional mRNAs with intact 5′ ends.

Production and Quality Control of the Library

Secretome libraries can be produced in different ways, including de novo produced recombinant proteins in conditioned medium,^35,37,41,42 purified proteins,^34,37,68 or a lentiviral secretome-enriched open reading frame library.⁴³

The quality and the concentration of the secreted protein library to be used in the functional assay is critical. Therefore, it is very important to choose a eukaryotic expression host, preferably a mammalian host, for production of the secretome library.^83,84 This should enable correct folding and posttranslational modifications of produced human secretome proteins. Posttranslational modifications include glycosylations⁸⁵ and removal of pro-peptides that are present in some secreted protein families such as the TGF-β superfamily⁴⁷ and the PDGF family.⁸⁶ Proteins that contain pro-peptides will most likely be inactive in a secretome-based screen if they are not processed properly. The most frequently described expression host for production of a secretome library is human embryonic kidney (HEK) cells, followed by Chinese hamster ovary (CHO) cells, also used for the production of most biological drugs.⁸⁷

It is useful to establish a general process for production when making a secretome library (illustrated in Fig. 2B ). Frequently, the native signal peptide is replaced by a generic signal peptide (e.g., IgK or CD33) to standardize secretion from cells.^37,68,88 Proteins that are normally secreted via an unconventional mechanism² are generally produced via the secretory pathway, by inserting a signal peptide at the N-terminus. This might possibly lead to the production of a protein that deviates at the N-terminus, compared with the endogenous protein.

Table 1 lists different human secretome libraries that have been published. The libraries can be divided into three categories: (1) conditioned medium libraries containing expressed secretome proteins, but also metabolites, growth factors, and extracellular matrix proteins secreted by the cells; (2) purified protein libraries; and (3) a secretome-enriched lentiviral library that starts with the full human ORFeome.⁴³

Conditioned Medium Libraries

Generation of a conditioned medium library is described in the seminal article by Lin et al.³⁵ After the initial stratification for production, human cDNAs were generated from different human tissues, resulting in a cDNA collection consisting of a total of 4180 constructs for protein production. All proteins were expressed in HEK293T suspension cells in 96-well plates. A generic signal peptide was used for expression of the ECDs. Proteins were expressed with and without a C-terminal V5-His affinity tag to allow for the detection and quantification of secreted protein. According to Lin et al., 90% of clones secreted detectable protein into the medium, with the median concentration of protein produced being 20 ng/mL.

Harbinski et al.⁴¹ and van Asten et al.⁴² used a similar setup except for that genes were sourced commercially; there was no affinity tag included for detection and, in the case of van Asten et al., cultures were at the larger 24-well scale.⁴²

A serum-free defined medium is preferred when proteins are used directly for screening, without purification, since it cannot be excluded that, for example, growth factors and other agents could contribute to a polypharmacological effect on the functional readout. Lin et al. used a serum-containing medium,³⁵ and all conditioned medium samples were used in screens within 12 h. This also avoids degradation or modification of produced proteins by components that are present in the medium.

It usually takes a few weeks to produce a complete secretome-conditioned medium library containing a few thousand proteins if proteins are produced in parallel in 96-well plates.

Purified Protein Libraries

The first description of a purified human secretome library was published in 2006.³⁴ In this example, all proteins were expressed in HEK293-EBNA cells at 100–500 mL culture scale. In this large project that lasted for 4 years, 2200 protein batches were purified in total. The success rate of production was quite low (30%).³⁴ This may be explained by choice of cells for expression and use of native signal peptide instead of generic.⁸⁸ Also, recent advances in mammalian cell culture^83,84 have probably contributed to higher success rates in later efforts.

Gonzalez et al.³⁷ described a fully automated Protein Expression and Purification Platform (PEPP) robot for the expression and purification of 24 proteins in parallel at <100 mL scale. Success rates were as high as 70%, with average yields reported to be 3 µg/mL.³⁷ In this example, proteins were secreted with and without an FC tag enabling immobilization of the library for different downstream applications, such as the identification of ligand–receptor pairs.⁵ Throughput for the production of a purified protein library varies between 25 proteins/week⁶⁸ and 200–400 proteins/week using the PEPP platform.³⁷

For purified proteins, a more rigorous quality package should be established.^34,37,68 It is vital to implement a sample management process that reduces the number of freeze–thaw cycles and to control how protein stability is affected by storage. As a consequence, a separate vial of protein should be used for the quality check (QC) analysis.

Generation of the KTH Library

We use the KTH library as an example to illustrate in detail how a purified secretome library is generated. A standardized pipeline has been established, including construct design, gene synthesis, protein production, protein purification, and quality control, with the aim of producing pure protein samples (illustrated in Fig. 2B ). It is important that the secretome proteins contain low endotoxin levels, since some cell types including immune cells (e.g., macrophages and dendrocytes) regulate the immunoresponse to pathogens and are exquisitely sensitive to endotoxin.⁸⁹ Thus, measures were taken to ensure that all proteins were produced in a low-endotoxin environment. All proteins were transiently produced in CHO cells using an episomal protein production system.⁹⁰ The recombinant proteins were produced with a purification tag (HPC4) at the C-terminus facilitating affinity purification using an antibody-based chromatography resin with calcium ion-dependent affinity for the HPC4 tag. This system was chosen since it enables a mild elution from the capturing resin.⁹¹ After desalting, the protein concentration was determined by A280 absorbance measurement. The purity was analyzed using sodium dodecyl sulfate–polyacrylamide gel electrophorsis (SDS-PAGE) and Western blot, and tandem mass spectrometry (MS/MS) peptide mapping was used for final identification. The average concentration of the more than 1500 proteins produced was 14 µM. Proteins that had a concentration lower than 2 µM were excluded from the final library. Since cell culture medium was removed during the purification process, polypharmacological effects due to other contaminating factors could be largely excluded.

Sample Management and Protein Storage

There are different requirements for the handling of secretome samples depending on if conditioned medium or purified proteins are used for screening. In the former case, when screening takes place immediately after production of the library, a liquid handling robotic system is needed. If a purified protein secretome library is produced within a few weeks,³⁷ long-term storage of proteins may not need to be considered. However, when library production takes several months to years, a process needs to be established.^34,68 Our process for sample management is outlined in Figure 2C and is similar to the one described by Battle et al.³⁴ After production, each protein batch was thawed once and divided into smaller aliquots before snap-freezing and long-term storage. Before each new screen, an aliquot was thawed and dispensed at the desired concentration in a deep well plate before adding to the cell-based assay. All information about the individual proteins in the library (i.e., gene name, sequence, concentration, and QC report) is maintained in a laboratory information system at KTH and exported to AstraZeneca’s Labguru application (BioData; http://www.labguru.com//), which is used to share information about preclinical bioreagents. The library of proteins is also registered in AstraZeneca’s compound management databases (internal AstraZeneca software and Mosaic; https://www.titian.co.uk), originally used for small molecules but now expanded to handle proteins. This allow for seamless integration between compound handling, assay screening, and data analysis. Cross-referencing of the databases allows full traceability of results and information.

A Secretome-Based Screening Workflow

In Figure 3A a secretome-based workflow is illustrated for a flow cytometry secretome-based screen in 384-well format using two different marker readouts. Similar cell-based screening setups have been published.^20,39 Production of the library should occur in the same location as the screening if conditioned medium is used. Ideally, each individual protein in the secretome library should be tested in duplicate at several different concentrations,⁶⁸ provided that a sufficient number of cells are available. One rationale for testing the library at multiple concentrations is that response curves can be bell-shaped due to receptor desensitization, counterregulatory mechanisms, or self-inhibition of a receptor at higher concentration.^92–94 The library should be screenable in a small number of plates (<20 plates) due to its size and the compatibility of cell-based assays with the 384-well format. Cells are usually incubated with secretome proteins for 1–5 days to capture cell proliferation, differentiation, and de novo expression of specific marker proteins. Cells are usually fixed before analysis. Ideally, positive and negative controls should be included on all plates. Z′ calculated from positive and negative controls is normally used for characterizing assay performance. Assay quality criteria similar to those of other PDD approaches apply to secretome-based screens. Primary actives are identified by applying an activity threshold cutoff.

Figure 3.

The secretome-based workflow from initial screen to confirmed active. (A) Schematic flow diagram showing the different steps in a typical secretome-based screen using purified proteins. (1) Usually the full library is tested at three concentrations in duplicate. A small volume of secretome protein (typically 1 µL) is added to each well (typically 40–50 µL). This results in a top concentration of 200 nM protein for a majority of samples tested. Occasionally, another dose of protein is added to the cells during incubation if the assay is running for a long period of time (>3 days). (2) Actives from the primary screen are confirmed in dose response in the primary assay. (3) A list of confirmed active proteins will be annotated in silico. This involves, for example, literature searches, expression data, disease relevance, and human target validation. (4) Additional protein will be produced so that the secretome library is not depleted. (5) Annotated actives will be tested in additional biologic effect assays (BEAs) before initiating any mechanistic studies (6). (B) An illustrative example of one assay where two markers are measured simultaneously.^20,39 As a result, four types of actives are identified that affect the markers differently (see main text).

The output from a primary screen, with independent measurement of more than one marker, can be quite complex; for example, four types of actives are identified if two markers are measured ( Fig. 3B ). Next, the primary actives should be reconfirmed in concentration response (CR). If human primary cells are used, cells from different donors need to be tested since variability of cells can be expected. Cell viability should be checked in parallel to rule out that the effect was caused by a contaminating agent such as endotoxins. Locci et al. tested 2688 unique proteins for the induction of two markers (C-X-C chemokine receptor type 5 and programmed cell death protein 1). Cells were treated with secretome proteins for 5 days before analysis by flow cytometry. Eleven interferons were shown to inhibit the expression of markers. Activin A was described as the most potent inducer of both markers.³⁹ The results were confirmed in cells from multiple donors using purified activin A from different vendors.

Once initial actives have been confirmed in CR, the list of confirmed active proteins should be annotated in silico, including information concerning cognate receptor and signaling. In addition, expression data, disease relevance, and human target validation should be considered. Prioritization of the annotated proteins will generate a list to guide additional protein production, to avoid depletion of the secretome library. Commercially available proteins^5,39,41 or internally purified proteins³⁵ are usually used at this stage if conditioned medium is used for primary screening. This enables the testing of a few selected candidate proteins in additional biologic effect assays, biophysical experiments, and combinatorial screening with compounds to elucidate receptor preferences.

In the 2008 article from Lin et al.,³⁵ note that some proteins with “low selectivity” emerge as actives across the 25 different assays. Proteins denoted as low-selectivity proteins include the interferon-α family, FGF2, FGF3, and some of the interleukins. In contrast, other proteins, such as interleukin 34 (IL-34), are active for a specific cell type and thus display a “high functional selectivity.” We have made similar observations when applying the KTH library to different assays (unpublished).⁶⁸ For example, FGF9 is active on different cell types, including cardiac progenitor cells (CPCs) and cardiac fibroblasts (CFs), whereas FGF16 is specific for the CPCs.⁶⁸ It should also be noted that FGF16 and other members of the FGF family were identified as a potent inhibitor of viral replication in a secretome-based screen using the near-haploid cancer cell line HAP-1.⁴²

After an active has been identified in a phenotypic screen, the next step is to determine whether the effect is mediated via a ligand–receptor interaction or via some other mechanism (different scenarios are illustrated in Figs. 1A and 4). If enzymes are identified as actives from a secretome screen, it should be established whether the effect on the functional readout is dependent on the catalytic activity of the enzyme. For example, a catalytic-dead mutant version of the protein can be produced and tested in the assay. Another approach is to use a small-molecule inhibitor to block a function.^39,95 In some cases, when there good precedence for the identity of the cognate receptor, such as FGF ligands and FGF receptors or interferon-α ligands and interferon-α 1 and interferon-α 2 receptors, there may be little need for additional receptor deconvolution. However, this is often not the case. Moreover, there is a strong desire to identify novel ligand–receptor pairs from a secretome-based screen.

Figure 4.

A summary of different steps needed to identify a receptor and signaling pathway induced by a secreted ligand. When an active has been identified from a secretome-based screen, the next step is to identify the cognate receptor and/or enzymatic activity that is needed to transduce the signal into the cells. There are several methods available to establish the identity of the receptor as described in the main text. Also, gene expression analysis can be utilized to profile the transcriptional events that are induced by the active secretome proteins. Finally, this can be confirmed by siRNA or precise genome editing (PGE). See text for more details.

Different ways of receptor deconvolution are illustrated in Figure 4 . Ligand–receptor pairs can be identified via ECD screening, as discussed previously. As a result, the functional effect will be antagonized by the ECD binding to the ligand. Ligand–receptor pairs can also be identified in the primary screen by using a target-based approach as described by Zhang et al.³⁶ and Barrow et al.⁵ Also, a small-molecule library annotated for plasma membrane receptor can be used. Arrayed CRISPR and siRNA libraries comprising the plasma membrane proteome can also be applied.⁵⁴

The receptor of the GDF15 ligand, which regulates appetite, was identified by several different groups by overexpression of a cDNA library comprising the plasma membrane proteome,^55,96–98 or by bespoke pull-down experiments.⁹⁹ More involved methods include chemoproteomic approaches where the purified ligand is labeled with a reactive group that can be crosslinked to, for example, live cells.¹⁰⁰

Pathway biology can be interrogated via transcriptomics experiments independent of whether receptor identification experiments were successful. Once putative genes have been identified, this can be followed by targeted siRNA or CRISPR knockout experiments. This should be finalized by translational experiments, for example, in vivo model experiments looking at a therapeutical benefit in a relevant disease model.

In summary, access to a high-quality secretome library as well as disease-relevant cells and assays enables the identification of secreted proteins that regulate a phenotype of interest. Target identification for secretome actives with known receptors can be easier than for small-molecule actives. However, for secretome actives with unknown receptors, the deconvolution work will require bespoke approaches that may not result in the identification of the receptor/target.

Application of Secretome-Based Screening for Target Identification

A secretome-based screening approach was first described by Merck Serono/EMD Serono.³⁴ However, no details were given on therapeutic alignment or cellular readouts applied with the generated secretome library. FivePrime Therapeutics³⁵ reported the development of a proprietary secretome-based platform that facilitated the discovery of IL-34 as a target for cancer treatment (described in more detail below). The GNF library³⁷ enabled the identification of PEDF as a regulator of stem cell renewal.³⁷ Collaborations between GNF and other groups have resulted in the identification of novel target biology (summarized in Table 2 ). For example, Locci et al.³⁹ describes the identification of activin as a regulator of differentiation of follicular helper T (T_fh) cells for the treatment of autoimmune diseases. Sampath et al. describe the identification of oncostatin M as an inducer of muscle stem cell quiescence, which could be of interest to stem cell engraftment.³⁸ Barrow et al. reported the identification of ligand–receptor pairs by screening a secretome library⁵ (described in more detail below). Finally, Scietti et al. uses the library to interrogate host–pathogen interactions relevant to infection diseases.⁴⁰ Novartis⁴¹ has also used commercial cDNA libraries to identify signaling pathways that are involved in drug-induced resistance in different cancers. Independent academic efforts include the identification of secreted factors that inhibit virus infection, in addition to well-known secreted proteins such as the interferons.⁴²

At AstraZeneca there is a growing interest in regenerative aspects of biology in several therapeutic areas, for example, in heart failure, in which there is an ambition to regenerate cells in the heart to treat heart failure. In the respiratory therapeutic area, there is an ambition to repair lung damage by regrowth of lung tissue, potentially restoring lost lung function in obstructive lung disease. The role of the human secretome, including growth factors, cytokines, and additional factors, in regulating regenerative processes is attractive for target discovery in these therapy areas. FGF16 was identified as an interesting candidate for cardiac regeneration and repair⁶⁸ (described in more detail below).

Identification of the Novel Cytokine IL-34 That Regulates Monocyte Viability

Tumor-associated macrophages (TAMs) inhibit antitumor T-cell activity in the tumor microenvironment. In pancreatic and other cancers, high levels of TAMs are associated with poor prognosis. Signaling through the CSF-1R promotes the maintenance and function of TAMs (https://www.fiveprime.com/file.cfm/16/docs/CB_2017_11_SITC_Oral_CSF-1R.pdf).^101,102 IL-34 was first discovered in the secretome-based screen performed by Lin et al.³⁵ (summarized in Table 2, Fig. 5A). A total of 4180 conditioned medium samples were tested in a CellTiter-Glo viability assay (Promega, Madison, WI) using primary monocytes isolated from donors. Curiously, the cDNA clone, derived from tissue material, differed by one amino acid compared with the available hypothetical protein sequence in the database. All follow-up work was performed with purified IL-34. Competition experiments showed that IL-34 bound to CD14+ monocytes in peripheral blood mononuclear cells and promoted the formation of macrophage progenitor cells in human bone marrow cultures. Lin et al. continued to identify the receptor for IL-34. IL-34 was preincubated with the different ECDs in the library before repeating the primary screen setup ( Fig. 5A ). This resulted in identification of CSF-1R, based on the fact that CSF-1R ECD abolished the effect of IL-34 using the primary readout. The affinity between CSF-1R and IL-34 was determined to be in the one-digit picomolar range by surface plasmon resonance experiments. The results from the initial library screen and the follow-up ECD screen were unexpected since macrophage colony-stimulating factor 1 (CSF-1) has already been identified as the cognate ligand of CSF-1R.¹⁰³ Experiments performed by Lin et al.³⁵ and by other groups¹⁰⁴ showed that IL-34 and CSF-1 have nonoverlapping binding sites and are functionally redundant, but the two mRNAs are differentially expressed during development.

Figure 5.

Examples of targets discovered by secretome-based screening.^5,35,68 (A) A secretome-based screen to identify targets that affect the viability of monocytes.³⁵ IL-34 was discovered in the primary screen using primary human monocytes. The methodology used was CellTiter-Glo. The activity of IL-34 was confirmed in human bone marrow cultures (BMCs), in which IL-34 promoted the formation of macrophage progenitor cells. The receptor of IL-34 was discovered by preincubating the protein with ECDs in the secretome library and measuring cell viability. Preincubation with macrophage CSF-1R-ECD resulted in an inhibition of the effect compared with other IL-34-ECD samples. (B) Identification of the NKp44-PDGF-DD receptor pair.⁵ A NKp44-GFP reporter cell line was used to identify the ligand of NKp44 as PDGF-D. The activity of purified PDGF-DD was confirmed using human NK cells from donors, by measuring phosphorylation of downstream substrates Akt and Erk and by measuring proinflammatory cytokine release (interferon-γ and tumor necrosis factor). (C) Identification of FGF16 as a specific inducer of human CPC proliferation.⁶⁸ The ability of secretome proteins to induce iPSC-CPC proliferation was measured by nuclear count. All actives were counterscreened in a CF proliferation assay. The interaction of FGF9 and FGF16 with CPCs and CFs was quantified using biosensor analysis.¹¹⁵ Conditioned medium libraries were used in A and B, whereas a purified protein library was used in C.

Based on this initial discovery, the cabiralizumab antibody, which inhibits the signaling via CSF-1R, was developed. It is now in several different phase 2 clinical trials, in combination therapy with a PD-1 antibody, for treatment of different cancer indications. The humanized monoclonal antibody is directed against the CSF-1R expressed on monocytes, macrophages, and osteoclasts, and it inhibits the binding of macrophage CSF-1 and IL-34 to CSF-1R (https://www.cancer.gov/publications/dictionaries/cancer-drug/def/cabiralizumab; https://www.fiveprime.com/programs/cabiralizumab/).¹⁰⁵

Identification of PDGF-DD as the Ligand of the NKp44 Receptor

Natural cytotoxicity receptors are potential targets for autoimmune diseases and, together with their ligands, can be successfully targeted for cancer immunotherapy.¹⁰⁶ Natural cytotoxicity triggering receptor 2 (NKp44) is a receptor found on natural killer (NK) cells. It was originally identified in 1999 as a novel receptor.¹⁰⁷ However, the ligands of the receptor have remained elusive.⁵ To identify ligands, a secretome-based screening approach was performed using a gene reporter assay and a library similar to the one described in Gonzalez et al. (consisting of 806 proteins) and described to contain more than 4000 mouse and human proteins⁵ ( Fig. 5B ). A chimeric receptor consisting of NKp44 extracellular and TM domains fused to an intracellular reporter domain was used in the reporter assay to monitor ligand–receptor interaction. Interaction of a ligand with the NKp44 chimeric receptor resulted in Ca²⁺ mobilization and activation of GFP expression. The secretome proteins, comprising secreted ligands and ECDs, were coated onto a 384-well plate in duplicate, with protein concentrations varying between ∼0.02 and ∼10 μM. Reporter cells were seeded into plates, the following day after washing, and the GFP reporter signal and dead cells were measured after 24 h of incubation using flow cytometry. One of the active proteins, which was reconfirmed in CR, was platelet-derived growth factor D (PDGF-D). Interestingly, this growth factor is produced by cells in a latent form with an N-terminal CUB domain that is proteolytically removed after secretion from cells.⁸⁶ Follow-up experiments with PDGF-D and receptor binding competent PDGF-DD showed that NKp44 receptor specifically recognized the PDGF-DD form and also suggested that the originally expressed PDGF-D latent form in the medium was processed by proteases during production. Follow-up work using primary human NK cells showed that PDGF-DD induced phosphorylation of expected downstream substrates and triggered proinflammatory cytokine release ( Fig. 5B ). It remains to be seen if a therapy can be developed based on the interaction between PDGF-DD and NKp44.¹⁰⁶

Identification of FGF16 That Induces Proliferation of CPCs

Heart failure after myocardial infarction is a clinical condition that causes high morbidity and mortality.¹⁰⁸ There is a large unmet medical need for treating heart failure driven by the loss of functional cardiomyocytes that occurs during myocardial infarction. It has been shown that CPCs present in the heart contribute to repair of the myocardium and are promising candidates for cardiac repair/regenerative therapies.¹⁰⁹ A subset of the KTH secretome library (923 proteins) was screened with the aim to identify proteins that stimulate proliferation ( Fig. 5C ).⁶⁸ Human induced pluripotent stem cell (iPSC) CPCs were used in the screen in a previously established human iPSC-CPC proliferation assay¹¹⁰ in 384-well format. Secretome proteins were added at three different concentrations and proliferation measured after 3 days of treatment. The primary protein actives were further tested in 10-point CR in triplicate. The CPC screen identified 12 active proteins with varying potencies, including FGF16 and FGF9. However, FGF9 was also shown to be active in a human CF proliferation counterscreen. Follow-up analysis using quartz crystal microbalance biosensor experiments suggested that FGF9 and FGF16 bound to different FGF receptors on the cardiac cells and also showed that FGF16 proliferated mouse native CPCs and iPSC-derived cardiomyocytes.⁶⁸ Cardiac-specific overexpression of FGF16 in neonatal mouse heart subjected to cryoinjury has been shown to induce cardiomyocyte replication and improve heart function in vivo.¹¹¹

Summary

Secreted proteins regulate numerous physiological functions in humans and have become attractive tools for drug discovery. Screening using secretome libraries in disease-relevant assays has been successfully applied in phenotypic target discovery. There are many examples of successful identification of novel targets regulating various biological phenotypes, for example, cell proliferation, tumor cell suppression, and viral infection.

Although secretome-based screening has been successfully applied for target discovery, there is a benefit to making further improvements, both regarding the constituents of the library and from a screening perspective. For example, including secreted proteins that are part of the “hidden human proteome” encoded, for example, by “noncoding genes”^112,113 will facilitate the study of the unknowns in the secretome. Smaller peptides are highly relevant from a disease perspective but are often difficult to generate using a recombinant approach. Instead, smaller peptides can be generated by peptide synthesis and included in the library. Additional improvements to mammalian expression systems and automation of production should speed up the process to produce a purified and quality-checked library. Co-expression with factors that are required to make the processed and activated secreted protein should also be pursued.

We also expect technology developments when it comes to screening. A combination of multiplexed readouts with miniaturization of assays should generate more value from co-cultures of different cell types. For example, one cell type could be expressing and secreting the sample library and thereby affecting another cell type that expresses the relevant receptors. In addition, “combination” screening, where two or more secreted proteins are combined and used for screening, would facilitate the identification of synergistic and antagonistic interactions.

Development of the secretome screen platform at AstraZeneca in collaboration with KTH has provided new opportunities in our drug discovery process. There is an excellent opportunity for collaborations in this area to maximize the use/value of the secretome library to explore a wide range of biologies as exemplified by the collaborations based on the availability of the library produced by the GNF group (e.g., Barrow et al.,⁵ Sampath et al.,³⁸ Locci et al.,³⁹ and Scietti et al.⁴⁰). Similarly, AstraZeneca has recently launched the concept of secretome-based screening via the Open innovation platform (https://openinnovation.astrazeneca.com/).

Finally, even though secretome-based screening is based on a protein sample library, it is agnostic of drug modality. Different approaches can follow after a secretome-based active has been validated. An antibody, or an antisense approach, can be used if an antagonistic effect is desired resulting in inhibition of signaling via the cognate receptor. A protein or RNA therapeutics can be developed to mimic a desired agonist response caused by the endogenous secreted ligand. A small-molecule inhibitor can be applied to inhibit signaling downstream of the ligand–receptor interaction. Thus, a secretome library is a valuable complement to any drug discovery toolbox.

Footnotes

Acknowledgements

The authors would like to thank anonymous reviewers for providing insightful feedback during the revision of this review. The authors would like to thank Jeremie Boucher for critical reading of the manuscript. The authors would also like to thank the “Protein Factory” at KTH for producing the secretome library, which has been funded by the Knut and Alice Wallenberg Foundation, Novo Nordisk Foundation, and AstraZeneca.

Declaration of Conflicting Interests

The authors declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: Mei Ding, Arjan Snijder, Mats Ormö, Per-Erik Strömstedt, Rick Davies, and Lovisa Holmberg Schiavone are employees of AstraZeneca

Funding

The authors received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Mei Ding

References

Uhlen

Fagerberg

Hallstrom

B. M.

; et al. Proteomics. Tissue-Based Map of the Human Proteome. Science 2015, 347, 1260419.

Rabouille

Pathways of Unconventional Protein Secretion. Trends Cell Biol. 2017, 27, 230–240.

Khanabdali

Rosdah

A. A.

Dusting

G. J.

; et al. Harnessing the Secretome of Cardiac Stem Cells as Therapy for Ischemic Heart Disease. Biochem. Pharmacol. 2016, 113, 1–11.

Wang

G. X.

Zhao

X. Y.

Lin

J. D.

The Brown Fat Secretome: Metabolic Functions Beyond Thermogenesis. Trends Endocrinol. Metab. 2015, 26, 231–237.

Barrow

A. D.

Edeling

M. A.

Trifonov

; et al. Natural Killer Cells Control Tumor Growth by Sensing a Growth Factor. Cell 2017, 172, 534–548.e19.

Childs

B. G.

Durik

Baker

D. J.

; et al. Cellular Senescence in Aging and Age-Related Disease: From Mechanisms to Therapy. Nat. Med. 2015, 21, 1424–1435.

Tran

Damaser

M. S.

Stem Cells as Drug Delivery Methods: Application of Stem Cell Secretome for Regeneration. Adv. Drug Deliv. Rev. 2015, 82–83, 1–11.

da Cunha

B. R.

Domingos

Stefanini

A. C. B.

; et al. Cellular Interactions in the Tumor Microenvironment: The Role of Secretome. J. Cancer 2019, 10, 4574–4587.

Uhlén

Karlsson

M. J.

Hober

; et al. The Human Secretome. Sci. Signal. 2019, 12, eaaz0274.

10.

Obenauf

A. C.

Zou

A. L.

; et al. Therapy-Induced Tumour Secretomes Promote Resistance and Tumour Progression. Nature 2015, 520, 368–372.

11.

Swinney

D. C.

Anthony

How Were New Medicines Discovered?

Nat. Rev. Drug Discov. 2011, 10, 507–519.

12.

Moffat

J. G.

Vincent

Lee

J. A.

; et al. Opportunities and Challenges in Phenotypic Drug Discovery: An Industry Perspective. Nat. Rev. Drug Discov. 2017, 16, 531–543.

13.

Clare

R. H.

Bardelle

Harper

; et al. Industrial Scale High-Throughput Screening Delivers Multiple Fast Acting Macrofilaricides. Nat. Commun. 2019, 10, 11.

14.

Jones

L. H.

Bunnage

M. E.

Applications of Chemogenomic Library Screening in Drug Discovery. Nat. Rev. Drug Discov. 2017, 16, 285–296.

15.

Chung

C. W.

Coste

White

J. H.

; et al. Discovery and Characterization of Small Molecule Inhibitors of the BET Family Bromodomains. J. Med. Chem. 2011, 54, 3827–3838.

16.

Yin

Fufa

Chandrasekar

; et al. Phenotypic Screen Identifies a Small Molecule Modulating ERK2 and Promoting Stem Cell Proliferation. Front. Pharmacol. 2017, 8, 726.

17.

Paunovic

A. I.

Drowley

Nordqvist

; et al. Phenotypic Screen for Cardiac Regeneration Identifies Molecules with Differential Activity in Human Epicardium-Derived Cells versus Cardiac Fibroblasts. ACS Chem. Biol. 2017, 12, 132–141.

18.

Woo

L. A.

Tkachenko

Ding

; et al. High-Content Phenotypic Assay for Proliferation of Human iPSC-Derived Cardiomyocytes Identifies L-Type Calcium Channels as Targets. J. Mol. Cell. Cardiol. 2019, 127, 204–214.

19.

de Waal

Lewis

T. A.

Rees

M. G.

; et al. Identification of Cancer-Cytotoxic Modulators of PDE3A by Predictive Chemogenomics. Nat. Chem. Biol. 2016, 12, 102–108.

20.

Ding

Brengdahl

Lindqvist

; et al. A Phenotypic Screening Approach Using Human Treg Cells Identified Regulators of Forkhead Box p3 Expression. ACS Chem. Biol. 2019, 14, 543–553.

21.

Haasen

Schopfer

Antczak

; et al. How Phenotypic Screening Influenced Drug Discovery: Lessons from Five Years of Practice. Assay Drug Dev. Technol. 2017, 15, 239–246.

22.

Counihan

J. L.

Wiggenhorn

A. L.

Anderson

K. E.

; et al. Chemoproteomics-Enabled Covalent Ligand Screening Reveals ALDH3A1 as a Lung Cancer Therapy Target. ACS Chem. Biol. 2018, 13, 1970–1977.

23.

Freeth

Soden

New Advances in Cell Microarray Technology to Expand Applications in Target Deconvolution and Off-Target Screening. SLAS Discov. 2020, 25, 223–230.

24.

Gautam

Jaiswal

Aittokallio

; et al. Phenotypic Screening Combined with Machine Learning for Efficient Identification of Breast Cancer-Selective Therapeutic Targets. Cell Chem. Biol. 2019, 26, 970–979.e4.

25.

Polyakov

V. R.

Moorcroft

N. D.

Drawid

Enrichment Analysis for Discovering Biological Associations in Phenotypic Screens. J. Chem. Inf. Model. 2014, 54, 377–386.

26.

Saxena

Higgs

R. E.

Zhen

; et al. Small-Molecule Affinity Chromatography Coupled Mass Spectrometry for Drug Target Deconvolution. Expert Opin. Drug Discov. 2009, 4, 701–714.

27.

Lee

Bogyo

Target Deconvolution Techniques in Modern Phenotypic Profiling. Curr. Opin. Chem. Biol. 2013, 17, 118–126.

28.

Morgan

Brown

D. G.

Lennard

; et al. Impact of a Five-Dimensional Framework on R&D Productivity at AstraZeneca. Nat. Rev. Drug Discov. 2018, 17, 167–181.

29.

Parker

C. G.

Galmozzi

Wang

; et al. Ligand and Target Discovery by Fragment-Based Screening in Human Cells. Cell 2017, 168, 527–541.e29.

30.

Dorsett

Tuschl

siRNAs: Applications in Functional Genomics and Potential as Therapeutics. Nat. Rev. Drug Discov. 2004, 3, 318–329.

31.

McCrae

Dzgoev

Stahlman

; et al. Lanosterol Synthase Regulates Human Rhinovirus Replication in Human Bronchial Epithelial Cells. Am. J. Respir. Cell Mol. Biol. 2018, 59, 713–722.

32.

Shalem

Sanjana

N. E.

Zhang

High-Throughput Functional Genomics Using CRISPR-Cas9. Nat. Rev. Genet. 2015, 16, 299–311.

33.

Ford

McDonald

Mali

Functional Genomics via CRISPR-Cas. J. Mol. Biol. 2019, 431, 48–65.

34.

Battle

Antonsson

Feger

; et al. A High-Throughput Mammalian Protein Expression, Purification, Aliquoting and Storage Pipeline to Assemble a Library of the Human Secretome. Comb. Chem. High Throughput Screen. 2006, 9, 639–649.

35.

Lin

Lee

Hestir

; et al. Discovery of a Cytokine and Its Receptor by Functional Screening of the Extracellular Proteome. Science 2008, 320, 807–811.

36.

Zhang

Pao

L. I.

Zhou

; et al. Deorphanization of the Human Leukocyte Tyrosine Kinase (LTK) Receptor by a Signaling Screen of the Extracellular Proteome. Proc. Natl. Acad. Sci. U.S.A. 2014, 111, 15741–15745.

37.

Gonzalez

Jennings

L. L.

Knuth

; et al. Screening the Mammalian Extracellular Proteome for Regulators of Embryonic Human Stem Cell Pluripotency. Proc. Natl. Acad. Sci. U.S.A. 2010, 107, 3552–3557.

38.

Sampath

S. C.

Sampath

S. C.

A. T. V.

; et al. Induction of Muscle Stem Cell Quiescence by the Secreted Niche Factor Oncostatin M. Nat. Commun. 2018, 9, 1531.

39.

Locci

J. E.

Arumemi

; et al. Activin A Programs the Differentiation of Human TFH Cells. Nat. Immunol. 2016, 17, 976–984.

40.

Scietti

Sampieri

Pinzuti

; et al. Exploring Host-Pathogen Interactions through Genome Wide Protein Microarray Analysis. Sci. Rep. 2016, 6, 27996.

41.

Harbinski

Craig

V. J.

Sanghavi

; et al. Rescue Screens with Secreted Proteins Reveal Compensatory Potential of Receptor Tyrosine Kinases in Driving Cancer Growth. Cancer Discov. 2012, 2, 948–959.

42.

van Asten

S. D.

Raaben

Nota

; et al. Secretome Screening Reveals Fibroblast Growth Factors as Novel Inhibitors of Viral Replication. J. Virol. 2018, 92, 1–13.

43.

Liu

Jia

; et al. Construction and Screening of a Lentiviral Secretome Library. Cell Chem. Biol. 2017, 24, 767–771.e3.

44.

Minter

R. R.

Sandercock

A. M.

Rust

S. J.

Phenotypic Screening—The Fast Track to Novel Antibody Discovery. Drug Discov. Today Technol. 2017, 23, 83–90.

45.

Blanchard

J. W.

Xie

El-Mecharrafie

; et al. Replacing Reprogramming Factors with Antibodies Selected from Combinatorial Antibody Libraries. Nat. Biotechnol. 2017, 35, 960–968.

46.

Gonzalez

L. C.

Protein Microarrays, Biosensors, and Cell-Based Methods for Secretome-Wide Extracellular Protein-Protein Interaction Mapping. Methods (San Diego, Calif.) 2012, 57, 448–458.

47.

Weiss

Attisano

The TGFbeta Superfamily Signaling Pathway. Wiley Interdiscip. Rev. Dev. Biol. 2013, 2, 47–63.

48.

Groppe

Greenwald

Wiater

; et al. Structural Basis of BMP Signalling Inhibition by the Cystine Knot Protein Noggin. Nature 2002, 420, 636–642.

49.

Harrington

A. E.

Morris-Triggs

S. A.

Ruotolo

B. T.

; et al. Structural Basis for the Inhibition of Activin Signalling by Follistatin. EMBO J. 2006, 25, 1035–1045.

50.

Dong

How

Kirkbride

K. C.

; et al. The Type III TGF-beta Receptor Suppresses Breast Cancer Progression. J. Clin. Invest. 2007, 117, 206–217.

51.

Tien

W. S.

Chen

J. H.

K. P.

SheddomeDB: The Ectodomain Shedding Database for Membrane-Bound Shed Markers. BMC Bioinformatics 2017, 18, 42.

52.

Martin

S. E.

Caplen

N. J.

Applications of RNA Interference in Mammalian Systems. Annu. Rev. Genom. Human Genet. 2007, 8, 81–108.

53.

Kim

J. S.

Genome Editing Comes of Age. Nat. Protoc. 2016, 11, 1573–1578.

54.

Shan

Chen

; et al. OLFR734 Mediates Glucose Metabolism as a Receptor of Asprosin. Cell Metab. 2019, 30, 319–328.e8.

55.

Hsu

J. Y.

Crawley

Chen

; et al. Non-Homeostatic Body Weight Regulation through a Brainstem-Restricted Receptor for GDF15. Nature 2017, 550, 255–259.

56.

Schreiber

The Molecular Basis for Differential Type I Interferon Signaling. J. Biol. Chem. 2017, 292, 7285–7294.

57.

UniProt Consortium. UniProt: A Worldwide Hub of Protein Knowledge. Nucleic Acids Res. 2019, 47, D506–D515.

58.

Zerbino

D. R.

Achuthan

Akanni

; et al. Ensembl 2018. Nucleic Acids Res. 2018, 46, D754–D761.

59.

El-Gebali

Mistry

Bateman

; et al. The Pfam Protein Families Database in 2019. Nucleic Acids Res. 2019, 47, D427–D432.

60.

O’Leary

N. A.

Wright

M. W.

Brister

J. R.

; et al. Reference Sequence (RefSeq) Database at NCBI: Current Status, Taxonomic Expansion, and Functional Annotation. Nucleic Acids Res. 2016, 44, D733–D745.

61.

Emanuelsson

Brunak

von Heijne

; et al. Locating Proteins in the Cell Using TargetP, SignalP and Related Tools. Nat. Protoc. 2007, 2, 953–971.

62.

Fukasawa

Tsuji

S. C.

; et al. MitoFates: Improved Prediction of Mitochondrial Targeting Sequences and Their Cleavage Sites. Mol. Cell Proteomics 2015, 14, 1113–1126.

63.

Pelham

H. R.

The Retention Signal for Soluble Proteins of the Endoplasmic Reticulum. Trends Biochem. Sci. 1990, 15, 483–486.

64.

Clark

H. F.

Gurney

A. L.

Abaya

; et al. The Secreted Protein Discovery Initiative (SPDI), a Large-Scale Effort to Identify Novel Human Secreted and Transmembrane Proteins: A Bioinformatics Assessment. Genome Res. 2003, 13, 2265–2270.

65.

Chen

Zhang

Yin

; et al. SPD—A Web-Based Secreted Protein Database. Nucleic Acids Res. 2005, 33, D169–D173.

66.

Meinken

Walker

Cooper

C. R.

; et al. MetazSecKB: The Human and Animal Secretome and Subcellular Proteome Knowledgebase. Database 2015, 2015, 1–14.

67.

Cortazar

A. R.

Oguiza

J. A.

Aransay

A. M.

; et al. VerSeDa: Vertebrate Secretome Database. Database 2017, 2017, 1–6.

68.

Jennbacken

Wagberg

Karlsson

; et al. Phenotypic Screen with the Human Secretome Identifies FGF16 as Inducing Proliferation of iPSC-Derived Cardiac Progenitor Cells. Int. J. Mol. Sci. 2019, 20, 1–16.

69.

Petersen

T. N.

Brunak

von Heijne

; et al. SignalP 4.0: Discriminating Signal Peptides from Transmembrane Regions. Nat. Methods 2011, 8, 785–786.

70.

Kall

Krogh

Sonnhammer

E. L.

Advantages of Combined Transmembrane Topology and Signal Peptide Prediction—The Phobius Web Server. Nucleic Acids Res. 2007, 35, W429–W432.

71.

Viklund

Bernsel

Skwark

; et al. SPOCTOPUS: A Combined Predictor of Signal Peptides and Membrane Protein Topology. Bioinformatics 2008, 24, 2928–2929.

72.

Fagerberg

Jonasson

von Heijne

; et al. Prediction of the Human Membrane Proteome. Proteomics 2010, 10, 1141–1149.

73.

Zeggini

Weedon

M. N.

Lindgren

C. M.

; et al. Replication of Genome-Wide Association Signals in UK Samples Reveals Risk Loci for Type 2 Diabetes. Science 2007, 316, 1336–1341.

74.

Saxena

Voight

B. F.

Lyssenko

; et al. Genome-Wide Association Analysis Identifies Loci for Type 2 Diabetes and Triglyceride Levels. Science 2007, 316, 1331–1336.

75.

McClurg

Janes

; et al. Genomewide Association Analysis in Diverse Inbred Mice: Power and Population Structure. Genetics 2007, 176, 675–683.

76.

Brown

K. J.

Seol

Pillai

D. K.

; et al. The Human Secretome Atlas Initiative: Implications in Health and Disease Conditions. Biochim. Biophys. Acta 2013, 1834, 2454–2461.

77.

Winter

E. M.

van Oorschot

A. A.

Hogers

; et al. A New Direction for Cardiac Regeneration Therapy: Application of Synergistically Acting Epicardium-Derived Cells and Cardiomyocyte Progenitor Cells. Circ. Heart Fail. 2009, 2, 643–653.

78.

Stastna

Van Eyk

J. E.

Investigating the Secretome: Lessons about the Cells That Comprise the Heart. Circ. Cardiovasc. Genet. 2012, 5, o8–o18.

79.

Smart

Dube

K. N.

Riley

P. R.

Epicardial Progenitor Cells in Cardiac Regeneration and Neovascularisation. Vascul. Pharmacol. 2013, 58, 164–173.

80.

Aurora

A. B.

Porrello

E. R.

Tan

; et al. Macrophages Are Required for Neonatal Heart Regeneration. J. Clin. Invest. 2014, 124, 1382–1392.

81.

Lien

C. L.

Schebesta

Makino

; et al. Gene Expression Analysis of Zebrafish Heart Regeneration. PLoS Biol. 2006, 4, e260.

82.

Temple

Gerhard

D. S.

Rasooly

; et al. The Completion of the Mammalian Gene Collection (MGC). Genome Res. 2009, 19, 2324–2333.

83.

McKenzie

E. A.

Abbott

W. M.

Expression of Recombinant Proteins in Insect and Mammalian Cells. Methods (San Diego, Calif.) 2018, 147, 40–49.

84.

Dyson

M. R.

Fundamentals of Expression in Mammalian Cells. Adv. Exp. Med. Biol. 2016, 896, 217–224.

85.

Croset

Delafosse

Gaudry

J. P.

; et al. Differences in the Glycosylation of Recombinant Proteins Expressed in HEK and CHO cells. J. Biotechnol. 2012, 161, 336–348.

86.

Fredriksson

Eriksson

The PDGF Family: Four Gene Products Form Five Dimeric Isoforms. Cytokine Growth Factor Rev. 2004, 15, 197–204.

87.

Hefzi

Ang

K. S.

Hanscho

; et al. A Consensus Genome-Scale Reconstruction of Chinese Hamster Ovary Cell Metabolism. Cell Syst. 2016, 3, 434–443.e8.

88.

Guler-Gane

Kidd

Sridharan

; et al. Overcoming the Refractory Expression of Secreted Recombinant Proteins in Mammalian Cells through Modification of the Signal Peptide and Adjacent Amino Acids. PLoS One 2016, 11, e0155340.

89.

Schwarz

Schmittner

Duschl

; et al. Residual Endotoxin Contaminations in Recombinant Proteins Are Sufficient to Activate Human CD1c+ Dendritic Cells. PLoS One 2014, 9, e113840.

90.

Silla

Haal

Geimanen

; et al. Episomal Maintenance of Plasmids with Hybrid Origins in Mouse Cells. J. Virol. 2005, 79, 15277–15288.

91.

Stearns

D. J.

Kurosawa

Sims

P. J.

; et al. The Interaction of a Ca2+-Dependent Monoclonal Antibody with the Protein C Activation Peptide Region. Evidence for Obligatory Ca2+ Binding to Both Antigen and Antibody. J. Biol. Chem. 1988, 263, 826–832.

92.

Atanasova

Whitty

Understanding Cytokine and Growth Factor Receptor Activation Mechanisms. Crit. Rev. Biochem. Mol. Biol. 2012, 47, 502–530.

93.

Gruber

B. L.

Marchese

M. J.

Kew

Angiogenic Factors Stimulate Mast-Cell Migration. Blood 1995, 86, 2488–2493.

94.

Yakymovych

Heldin

C. H.

Intracellular Trafficking of Transforming Growth Factor Beta Receptors. Acta Biochim. Biophys. Sin. 2018, 50, 3–11.

95.

El Ouaamari

Dirice

Gedeon

; et al. SerpinB1 Promotes Pancreatic Beta Cell Proliferation. Cell Metab. 2016, 23, 194–205.

96.

Mullican

S. E.

Lin-Schmidt

Chin

C. N.

; et al. GFRAL Is the Receptor for GDF15 and the Ligand Promotes Weight Loss in Mice and Nonhuman Primates. Nat Med 2017, 23, 1150–1157.

97.

Yang

Chang

C. C.

Sun

; et al. GFRAL Is the Receptor for GDF15 and Is Required for the Anti-Obesity Effects of the Ligand. Nat Med 2017, 23, 1158–1166.

98.

Yang

Padkjaer

S. B.

Wang

; et al. Construction of a Versatile Expression Library for All Human Single-Pass Transmembrane Proteins for Receptor Pairings by High Throughput Screening. J. Biotechnol. 2017, 260, 18–30.

99.

Emmerson

P. J.

Wang

; et al. The Metabolic Effects of GDF15 Are Mediated by the Orphan Receptor GFRAL. Nat. Med. 2017, 23, 1215–1219.

100.

Frei

A. P.

Moest

Novy

; et al. Ligand-Based Receptor Identification on Living Cells and Tissues Using TRICEPS. Nat. Protoc. 2013, 8, 1321–1336.

101.

Cannarile

M. A.

Weisser

Jacob

; et al. Colony-Stimulating Factor 1 Receptor (CSF1R) Inhibitors in Cancer Therapy. J. Immunother. Cancer 2017, 5, 53.

102.

Goswami

K. K.

Ghosh

; et al. Tumor Promoting Role of Anti-Tumor Macrophages in Tumor Microenvironment. Cell. Immunol. 2017, 316, 1–10.

103.

Sherr

C. J.

Rettenmier

C. W.

Sacca

; et al. The c-fms Proto-Oncogene Product is Related to the Receptor for the Mononuclear Phagocyte Growth Factor, CSF-1. Cell 1985, 41, 665–676.

104.

Wei

Nandi

Chitu

; et al. Functional Overlap but Differential Expression of CSF-1 and IL-34 in Their CSF-1 Receptor-Mediated Regulation of Myeloid Cells. J. Leukoc. Biol. 2010, 88, 495–505.

105.

Bellovin

Wondyfraw

Levin

; et al. cmFPA008, an Anti-Mouse CSF-1R Antibody, Combines with Multiple Immunotherapies to Reduce Tumor Growth in Nonclinical Models. J. Immunother. Cancer 2015, 3, P351.

106.

Barrow

A. D.

Martin

C. J.

Colonna

The Natural Cytotoxicity Receptors in Health and Disease. Front. Immunol. 2019, 10, 909.

107.

Cantoni

Bottino

Vitale

; et al. NKp44, a Triggering Receptor Involved in Tumor Cell Lysis by Activated Human Natural Killer Cells, Is a Novel Member of the Immunoglobulin Superfamily. J. Exp. Med. 1999, 189, 787–796.

108.

Lloyd-Jones

Adams

Carnethon

; et al. Heart Disease and Stroke Statistics—2009 Update: A Report from the American Heart Association Statistics Committee and Stroke Statistics Subcommittee. Circulation 2009, 119, e21–e181.

109.

Sharma

Mishra

Bigham

G. E.

; et al. A Deep Proteome Analysis Identifies the Complete Secretome as the Functional Unit of Human Cardiac Progenitor Cells. Circ. Res. 2017, 120, 816–834.

110.

Drowley

Koonce

Peel

; et al. Human Induced Pluripotent Stem Cell-Derived Cardiac Progenitor Cells in Phenotypic Screening: A Transforming Growth Factor-beta Type 1 Receptor Kinase Inhibitor Induces Efficient Cardiac Differentiation. Stem Cells Transl. Med. 2016, 5, 164–174.

111.

Huang

Tian

; et al. GATA4 Regulates Fgf16 to Promote Heart Repair after Injury. Development 2016, 143, 936–949.

112.

Ingolia

N. T.

Lareau

L. F.

Weissman

J. S.

Ribosome Profiling of Mouse Embryonic Stem Cells Reveals the Complexity and Dynamics of Mammalian Proteomes. Cell 2011, 147, 789–802.

113.

Zhang

Lian

; et al. A Hidden Human Proteome Encoded by ‘Non-Coding’ Genes. Nucleic Acids Res. 2019, 47, 8111–8125.

114.

Cheng

Benyajati

; et al. PEDF and Its Roles in Physiological and Pathological Conditions: Implication in Diabetic and Hypoxia-Induced Angiogenic Diseases. Clin. Sci. (Lond.) 2015, 128, 805–823.

115.

Salanti

Clausen

T. M.

Agerbaek

M. O.

; et al. Targeting Human Cancer by a Glycosaminoglycan Binding Malaria Protein. Cancer Cell 2015, 28, 500–514.