Biologically Relevant Heterogeneity: Metrics and Practical Insights

Abstract

Japanese

Korean

Chinese

Heterogeneity is a fundamental property of biological systems at all scales that must be addressed in a wide range of biomedical applications, including basic biomedical research, drug discovery, diagnostics, and the implementation of precision medicine. There are a number of published approaches to characterizing heterogeneity in cells in vitro and in tissue sections. However, there are no generally accepted approaches for the detection and quantitation of heterogeneity that can be applied in a relatively high-throughput workflow. This review and perspective emphasizes the experimental methods that capture multiplexed cell-level data, as well as the need for standard metrics of the spatial, temporal, and population components of heterogeneity. A recommendation is made for the adoption of a set of three heterogeneity indices that can be implemented in any high-throughput workflow to optimize the decision-making process. In addition, a pairwise mutual information method is suggested as an approach to characterizing the spatial features of heterogeneity, especially in tissue-based imaging. Furthermore, metrics for temporal heterogeneity are in the early stages of development. Example studies indicate that the analysis of functional phenotypic heterogeneity can be exploited to guide decisions in the interpretation of biomedical experiments, drug discovery, diagnostics, and the design of optimal therapeutic strategies for individual patients.

Keywords

heterogeneity high-content screening flow cytometry drug discovery cellular models organs-on-chips computational pathology systems biology quantitative systems pharmacology precision medicine

Introduction: Biological Heterogeneity Is a Fundamental Property of Life

Heterogeneity is a fundamental property of biological systems that contributes to development,¹ differentiation,^2,3 immune-mediated responses,¹ and many other cellular, tissue, organ, and organism functions¹ as well as diseases and disease progression.^4–6 Figure 1 illustrates the different scales or levels of biological systems exhibiting heterogeneity that can be measured with the appropriate methods. This perspective will focus primarily on heterogeneity in populations of cells in vitro and in tissue sections, but much of the discussion, especially with reference to the need for standard metrics and their application to biomedical research, drug discovery, and diagnostics, can also be applied to populations at all scales.

Figure 1.

The multiple scales of biological heterogeneity detected in a population of organisms, as well as within organs, tissues, cells, molecules, pathways, and networks. (A) Individuals in a population exhibit heterogeneity in a variety of genomic and phenotypic measures. Heterogeneity can be detected (B) between and within organs and tissues; (C) between cells in terms of expression levels, genomics, and functions; and within cells in terms of (D) cellular constituents. (E) Combinations of molecules interact in time and space within and between cells as part of biological pathways that result in normal and abnormal cellular functions. (F) Computational or mathematical models of “systems,” including cellular pathways, organ, multiorgan, and organism, can be generated and used to predict responses that must incorporate heterogeneity of components in the models.

Heterogeneity results from genetic variation,⁷ nongenetic characteristics,¹ or a combination of these ( Fig. 2 ). Nongenetic heterogeneity can be driven by extrinsic factors (e.g., tissue microenvironment) and intrinsic factors (e.g., variation in protein expression).¹ Although heterogeneity is sometimes referred to as “noise” or as arising from “noise” in cellular networks, the presence of noise hinders information transfer, while the presence of heterogeneity provides information.

Figure 2.

Classification of the types of heterogeneity that can be exhibited by a population of cells (adapted from Huang¹). (A) Heterogeneity can be the result of genetic variations and/or nongenetic factors even in a clonal population. Nongenetic heterogeneity, also called phenotypic heterogeneity, can be driven by extrinsic factors, such as the microenvironment in a tissue that can influence, for example, the protein expression levels in surrounding cells. Extrinsic factors drive spatial heterogeneity often exhibited as macro-heterogeneity. Intrinsic heterogeneity can be detected even in a uniform environment and has been classified as macro- or micro-heterogeneity depending on the characteristics of the distribution. (B) Macro-heterogeneity refers to variations in one or more cellular traits that results in discrete phenotypes or subpopulations of cells and can be driven by both extrinsic and intrinsic factors. Micro-heterogeneity refers to random variations within a single phenotype that can include population “noise” resulting from variations in regulatory networks, for example, or temporal “noise” such as variation in protein synthesis over time. Highlighted in red are three important measurable components of the distribution of a cell feature.

Analysis of heterogeneity is expected to inform a wide range of biological applications, from biomedical research to medical diagnostics. Whether developing an assay for drug discovery, a therapy for cancer, or optimizing a protocol for stem cell differentiation, the prevalence of heterogeneity in biological systems suggests that more can be learned through analysis of the population distribution than merely evaluating the population average. In contrast, most cell experimentation currently assumes a normal distribution of data and uses the population average for the sake of speed and simplicity. However, it is becoming clear that heterogeneity is the rule rather than the exception, such that homogeneity in population data cannot be assumed when analyzing and interpreting data.

Measurement of heterogeneity most often involves methods with single-cell resolution ( Fig. 3 ), although population-based methods have also been used to detect heterogeneity. For example, experiments by Luria and Delbruck⁸ on populations of bacteria demonstrated in the 1940s that bacteria spontaneously mutated, forming a heterogeneous population in which predisposed subpopulations, harboring virus-resistance mutations, were selected as a result of viral infection. More commonly, though, heterogeneity is detected through examination of the phenotypes of the individuals in the population and is characterized by quantitation of the distributions of those phenotypes. In studies where cellular heterogeneity has been characterized, the methods and metrics have varied ( Table 1 ).^2,4,9–17 The lack of an accepted standard for measuring and reporting cellular heterogeneity makes it difficult to compare the degree of heterogeneity in different studies and biological systems. Therefore, at the present time, only the methods and metrics can be compared. However, we will make suggestions on the application of metrics.

Figure 3.

Heterogeneity in populations of cells can be quantified by a variety of methods that permit cell-by-cell measurements. (A) Single-cell genomics, epigenomics, proteomics, and metabolomics (reprinted with permission from Spagnolo et al.²¹) and/or transcriptomics (from Saadatpour et al.²³⁵) to study heterogeneity use either ground-up tissue samples or single cells and can provide a comprehensive analysis of heterogeneity for a large number of cells. (B) High-content screening and digital pathology²¹ employ multiple fluorescent probes to capture a broad range of information, including expression levels and subcellular localization of molecules within and across individual cells. (C) Optical (from Hines et al.¹¹⁴) and mass cytometry (from Spitzer and Nolan¹⁰⁵) can provide information on expression levels of several molecules simultaneously as well as some morphological information in large populations but do not report spatial heterogeneity. (D) Mass spectrometry readouts expand the range of molecules that can be simultaneously detected in flow cytometry (mass cytometry) and can be used to image tissues and cells in imaging mass cytometry (from Giesen et al.⁹³) and imaging mass spectrometry (from Zavalin et al.¹⁰⁰).

Table 1.

Example Approaches to Quantifying Heterogeneity.

Approach	Examples	Characteristics
Univariate, Gaussian statistics	Mean,²³⁰ standard deviation,²³⁰ z score,²⁴ skew,²³¹ kurtosis,²³¹ moment²³⁰	Assumes normal distribution, insensitive to subpopulations, no information on type of heterogeneity
Entropy	Quadratic,^4,76,134 Shannon,²³² Simpson,²³² Renyi²³³	Established measures of diversity and information content, only established for univariate data
Nonparametric statistics	Kolmogorov-Smirnov statistic^14,145	Can improve accuracy of results, no assumptions on distribution, no information on distribution shape
Model functions	Gaussian mixture models^61,88	Assumes there is some number of normally distributed subpopulations, can be applied to multivariate data, normal model may not be appropriate
Combined metrics	Pittsburgh Heterogeneity Indices (PHI)^4,37	Model independent, descriptive of heterogeneity
Spatial methods	Fractal dimension,²³³ pointwise mutual information (PMI)²¹	No assumption of distribution, leverages spatial interactions, applies to multivariate data
Temporal methods	Temporal distance between robust centers of mass of two feature sets^13,234	Applies to multivariate data, method developed based on genomic data

Biologically relevant heterogeneity can be divided into three categories: population heterogeneity, spatial heterogeneity, and temporal heterogeneity ( Table 2 ). In each category, the heterogeneity can be characterized as micro- or macro-heterogeneity, depending on the nature of the distribution ( Fig. 2 ). Micro-heterogeneity refers to heterogeneity within an apparently uniform population (i.e., the variance of a single bell-shaped distribution), whereas macro-heterogeneity refers to the presence of distinct populations (i.e., multimodal).¹ Establishing standardized terminology, methods, and metrics will be essential to the routine extraction and communication of insights from biological heterogeneity.

Table 2.

Selected Definitions.

Term	Definition
Biologically relevant heterogeneity	General term for heterogeneity detected or measured at some scale (level) of a biological system (molecule, cell, tissue, organ, organism) after correcting for any instrumental “systems response” variations, as well as sample preparation.
Population heterogeneity	Variation in some phenotype(s) among individuals in a population at a single time point. Requires measurements of many individuals in a population.
Spatial heterogeneity	Variation in some variable(s) at different spatial locations within a sample. Requires a set of measurements at different spatial locations
Temporal heterogeneity	Variation in some variable(s) measured as a function of time. Requires a set of measurements at different time points.
Quantitative systems pharmacology (QSP)	Determining the mechanism(s) of disease progression and mechanism(s) of action of drugs on multiscale systems through iterative and integrated computational and experimental methods to optimize the development of therapeutic strategies.
Precision medicine	Development and use of individual or combinations of features that tell clinicians about risk of disease, selection of the best treatment, and likely disease course, including response to treatment for a specific patient.
Pseudotime	Quantitative measures of biological progression.¹²⁷
Heteroscedasticity	Unequal variance in the distribution (with respect to an independent variable).

Detection of Biologically Relevant Heterogeneity

Biologically relevant heterogeneity can be detected and quantified with a variety of methods, provided they have sufficient fidelity over the population. One of the earliest indications of biological heterogeneity was in tumors, where morphological variations were noted by pathologists who examined fixed sections of animal and human cancers.¹⁸ However, manual cell-by-cell scoring limits the size of the regions and number of cells that can be analyzed as well as the objectivity of the analysis. Digital pathology now enables a more comprehensive and objective assessment of cellular phenotypes in tissues, allowing analysis of population and spatial heterogeneity of biomarkers and microenvironment components such as immune cells.^19–23 The detection of heterogeneity is currently most advanced in isolated cell systems, where automated microscope imaging (e.g., high-content screening [HCS]) is used to extract multiple phenotypic features from many, relatively large populations of adherent cells,^24–26 flow cytometry is used for bacterial^27,28 and suspension cell analysis,^29–32 and other single-cell methods, such as recent developments in single-cell genomics and proteomics^33–36 where there has been progress toward in situ analysis ( Fig. 3 ).

Distinguishing between biologically relevant heterogeneity and “system variability” resulting from sample preparation, data acquisition, and/or data processing requires a well-founded understanding of the sources of noise in the measurements, achieved by calibration and characterization of the systems response using appropriate standards or reference measures.^4,37–39 The importance of minimizing the “system variability” is critical to achieving consistent, quantitative measurements, as has been discussed in detail for high-content imaging and flow cytometry.^40–42 Flow cytometry has a long history and mature process for system calibration, characterization, and standardization, including published protocols⁴³ and an array of reference standards.^44–46 As a result, data can be generated and compared between different systems and in different labs. However, manual gating and segmentation of populations of cells can still be a source of variation in the results.⁴⁷ Recent progress on automated segmentation of cell populations shows some promise in addressing this source of variability.^48–51 Establishing standard methods and metrics for the characterization of system reproducibility is a key to more reliable detection and quantitation of biological heterogeneity.

The Need for Methods/Metrics to Detect, Quantify, and Characterize Heterogeneity in Biological Systems

Historically, “population average” metrics have dominated the measurement and interpretation of cellular data. Most cellular assays rely on whole-well measurements, such as total enzyme activity or total fluorescence intensity per well, making contributions from subpopulations or extreme outliers impossible to parse from the average response. This well average approach has also extended to high-content cellular assays, where the standard methods generally assume a normal data distribution to “save time” and to “simplify analysis” by producing a single value that is easily understood, even if not fully representative of the biology. In fact, when computational models based on these assumptions are employed and fail to explain observations or give variable results, investigators will often discover biologically relevant heterogeneity in the system they are studying.⁸

Population average measures are routinely used as assay readouts and to assess assay performance in chemical or biological library screens or structure-activity relationship (SAR) campaigns. Standard assay performance metrics such as the Z′ factor⁵² or the strictly standardized mean difference (SSMD)⁵³ only measure the degree of separation of the positive and negative control wells, based on the average and standard deviation (SD) of the assay readouts. The assumptions are that there is a normal distribution of the assay readouts across wells and that the assay readout adequately represents the biology in the well. However, population average metrics do not adequately reflect the distribution of the biology within the wells, which can lead to misinterpretation of assay consistency. This was recently illustrated by Gough et al.³⁷ in a retrospective analysis of a high-content assay where there was heterogeneity in the interleukin-6 (IL-6) activation of STAT3. They showed that even though the Z′ prime indicated a robust assay at the well level (Z′ ≥ 0.5) across all the plates, the fundamental biology on several plates was found to be quite different. Thus, to reliably assess the biology, it is necessary to establish quality control (QC) metrics for the distribution of the cell population within each well.

Meaningful quantitation of heterogeneity requires selecting an appropriate set of metrics, while interpretation of heterogeneity requires a strategy for dissecting the inherent complexity of cellular distributions ( Fig. 2 ). In one approach, the distinction between homogeneous and heterogeneous data is defined by a measure of diversity in the sample. In a sample that exhibits heterogeneity, micro-heterogeneity is indicated by a normal distribution and macro-heterogeneity¹ by the degree of nonnormality, using a metric such as the Kolmogorov-Smirnov (KS) statistic.^4,54 Macro-heterogeneity requires the use of analytics that can characterize the distribution, visually or using model functions, as consisting of a number of discrete subpopulations (sometimes referred to as modality), a continuous and potentially complex distribution, or some combination.¹⁴

In addition to population heterogeneity, it is also important to consider spatial heterogeneity. The detection and interpretation of spatial heterogeneity, using methods such as pointwise mutual information (PMI) or computational modeling, can be used to identify patterns of phenotypic heterogeneity that may be correlated with the microenvironment or potentially the result of intrinsic factors.²¹ The analysis of temporal heterogeneity is also important and presents some unique challenges, including deconvolution of cell cycle effects (which may also be a source of heterogeneity) and avoiding artifacts in monitoring cells over time.⁵⁵ However, there are examples of live-cell studies that have addressed these challenges, collecting large cell-level data sets to analyze and model the temporal changes and heterogeneity in live-cell phenotypes.^55–57 A systematic approach to the detection, quantitation, and characterization of heterogeneity will make it a source of insight, rather than simply an added burden to investigators.

Even though researchers are more frequently detecting and investigating heterogeneity, additional attention must be given to the practical need for robust generally applicable tools that can be implemented in high-throughput production environments, rather than continuing to introduce custom solutions that are intrinsically too narrow in scope to support integration of data sets. Ultimately, we need commonly understood metrics for heterogeneity, just as we use statistical concepts like mean and SD for normal distributions.

Potential Insights from the Analysis of Heterogeneity in Biology and Drug Discovery

Whether heterogeneity is inherent to a population of cells,⁵⁸ induced by the microenvironment,^59,60 or induced by compound or reagent treatment,^4,61–63 analysis of phenotypically similar cell subpopulations, derived from the analysis of heterogeneity, is expected to improve the accuracy of cellular measurements, better support the interpretation of the data, provide insights into the regulation of cellular networks, guide the computational modeling of the networks, guide the prioritization of compounds for development in drug discovery, and optimize the development of diagnostics for precision medicine and further basic biological knowledge.

Cell-to-cell variability is believed to be the result of deterministic molecular regulatory mechanisms that remain largely uncharacterized.^1,64,65 Subpopulations of cells with distinct phenotypes isolated from a macro-heterogeneous population have been demonstrated to revert to the original macro-heterogeneous phenotype distribution over time,^66,67 indicating that heterogeneity is a persistent characteristic of a population, reflecting transitions among distinct metastable cell states induced by cell-autonomous and non-cell-autonomous signaling in contrast to simply noise.⁶⁶ A recent study suggesting that heterogeneity can be decomposed into groups of biomarkers that are consistent with known signaling pathways, also implies a mechanistic basis for the cell-to-cell variation.⁹ In other studies, it has been shown that patterns of signaling heterogeneity can distinguish cellular subpopulations with different drug sensitivities.^4,68 The differential sensitivity to drug treatment of subpopulations of cells may well provide an indication of compound mechanism(s) of action.^{1,5,9,64,65,68,69} Differential sensitivity measurements in vitro also provide insights into how effective a therapy might be in vivo. For example, if the half-maximal response represents all cells showing 50% inhibition, then treatment cycles in vivo may produce a different response rate than if the half maximal response is a result of 100% inhibition in half of the cells. In the latter case, a significant survivor population among the unaffected cells may result in a treatment with poor efficacy in the clinic, despite apparently good efficacy in cell assays. In addition, cells treated with drugs of similar mechanism of action exhibited similar heterogeneity.⁶¹ Taken together, these findings suggest that there is an integral link between phenotypes, networks, drug sensitivity, and patterns of heterogeneity. The analysis of heterogeneity therefore provides a basis for the generation of hypotheses regarding regulatory networks, such as that suggested by Gascoigne and Taylor⁶² that the heterogeneity induced by drugs was the result of interacting networks.

Implications of Heterogeneity for Precision Medicine

Because there is heterogeneity among individual patients, the challenges associated with improving the success rate in developing therapies may seem daunting. However, the solution may be in the development of precision therapies that address the heterogeneity exhibited in subpopulations of patients, as discussed by Stern et al.⁶ in a perspective on quantitative systems pharmacology. There is growing evidence that some heterogeneity enables physiological and evolutionary adaptation.^70,71 The association between cellular heterogeneity and adaptation suggests that ignoring heterogeneity in the in vitro cellular response to candidate therapeutics may lead to the selection of compounds to which cells will readily adapt, leading to a loss of efficacy.^72,73 On the other hand, an understanding of interclonal interactions that can lead to disease-specific phenotypic traits could provide novel therapeutic opportunities.^72,74

When heterogeneity is associated with dysregulated genetic-based and/or non-genetic-based functions, it can play a critical role in the progression of complex diseases such as cancer,⁷⁵ where intratumor heterogeneity poses a formidable challenge to the development of therapeutics,^5,65 as well as diagnostics.^5,21,22,76 Thus, identifying, quantifying, and characterizing heterogeneity in patient samples and disease-relevant models using validated cell-by-cell analysis methods^{5,21,73,75–78} addresses an important unmet need.

Methods for Single-Cell Evaluation in Cell Populations

There are many systems and methods for the evaluation of single cells in the context of a population, including high-content imaging methods such as high-content screening (HCS) and digital pathology, imaging mass spectrometry (IMS), imaging mass cytometry (IMC), flow cytometry, mass cytometry (MC), and single-cell “omics” ( Fig. 3 ). In general, each of these approaches delivers information with enough signal-to-noise at the single cell level and sufficient throughput at the population level to characterize the heterogeneity in cellular phenotypes. The metrics discussed below can be applied to all of these methods.

Optical High-Content Imaging/Digital Pathology

High-content imaging, such as HCS or digital pathology, when applied to multiple labeled targets, can provide data from large numbers of cells in large numbers of samples. HCS is commonly used to measure fixed or live cells in up to five dimensions (3D plus time and wavelength) using expressed fluorescent protein biosensors, a wide range of fluorescent probes, and transmitted light methods.^24,79 Digital pathology typically uses stains for transmitted light imaging and fluorescent antibodies and nucleic acid probes to label specific biomarkers in formalin-fixed, paraffin-embedded (FFPE) tissue sections. Both applications benefit from capturing a broad range of information about the population, including spatial distributions of tissue structures and molecules within each cell, within cellular compartments, and spatial relationships between cells. Live-cell imaging also provides temporal and direct functional readouts such as cell motility and division.^80–82 Light microscopic approaches range from low-magnification, large area images that contain hundreds to thousands of cells that are analyzed individually, to one-by-one serial evaluation of tens to hundreds of cells with high magnification, including super-resolution.^83–85 In addition to HCS applications,^37,86 a wide range of automated microscopy analyses are routinely used in research^4,9,68,87,88 and digital pathology.^{20,21,23,69,76,89}

Several light microscope imaging platforms have been developed to acquire multivariate information from images of large area tissue sections and tissue microarrays (TMAs) using DNA, RNA, and protein biomarkers.^22,90,91 Although typically limited to one to six labels per cell due to spectral overlap, recent technological advances have now enabled imaging of highly multiplexed (“hyperplexed”) biomarkers (>60) in many individual cells in situ in fixed tissues, with subcellular resolution that captures the spatial arrangement of many discrete cellular phenotypes (i.e., spatial heterogeneity).^{73,77,92–94} It is now possible to “map” the location of specific cell types, cell activation states, and cell biomarker expression levels, as well as extracellular constituents, in tissue sections and TMAs. The determination of spatial heterogeneity at subcellular resolution is still nascent, but it promises to help elucidate the cellular networks, as well as their cell-autonomous and heterotypic signaling interactions, involved in the regulation of both normal and disease processes. The importance of understanding the dynamic regulation of cellular heterogeneity is discussed below.

IMS and IMC

The application of mass spectrometry (MS) to image analysis has enabled a higher degree of multiplexing of a wider range of analytes that can be simultaneously imaged in cell and tissue samples at the single-cell level. There are basically three approaches to imaging that use MS: a label-free method, IMS, and two epitope tagging methods (IMC and multiplex ion beam imaging [MIBI]).

IMS is a label-free method that allows the visualization of ionizable species within a given mass range while retaining spatial information.⁹⁵ The technique can measure a range of molecular species from small-molecule drugs to full-length proteins in samples ranging from whole animals to single cells.^95–97 There are three basic ionization approaches for IMS: matrix-assisted laser desorption ionization (MALDI), secondary ion mass spectroscopy (SIMS), and desorption electrospray ionization (DESI).⁹⁸ Each approach has advantages in terms of types of analytes that can be measured and the spatial resolution. Lipids, peptides, and small molecules can be detected by all three, with MALDI also capable of measuring full-length proteins with a molecular mass of ~50 kDa. The spatial resolution of IMS typically ranges from 100 µm for DESI, 30 to 50 µm for MALDI, and 0.5 to 1 µm for SIMS,⁹⁵ although advances in MALDI technology have enabled subcellular resolution.^97,99,100 Furthermore, IMS can report molecular distributions in 3D volumes, thereby extending the spatial environment.¹⁰¹

Both IMC⁹³ and MIBI¹⁰² use antibodies that are tagged with nonbiological, unique rare earth metal reporters that are easily identified in MS. Samples are ionized with a laser or ion beam, the metal tags are quantified, and then the images are computationally reconstructed based on known raster positions of the laser or ion beams.^93,102,103 These approaches are still developing but have already enabled quantification of >40 parameters at the single-cell level^103–105 and have been used to detect heterogeneity in breast cancer tissues.^93,106

The power of IMS lies in its ability to quantitatively measure hundreds of analytes simultaneously, enabling the discernment of novel molecular species involved in specific biological contexts. IMS can be used in a targeted mode, looking at known molecular entities, or in a discovery mode, which requires no prior knowledge of the biology. This aspect has been successful in identifying intratumor heterogeneity at the molecular level in otherwise histomorphologically homogeneous tumor regions in primary gastric cancers.¹¹ In a more targeted approach, Mao et al.¹⁰⁷ used air flow–assisted ionization mass spectrometry to image the distribution of lipids in breast cancer tissues and demonstrated that various histological grades of invasive ductal carcinoma and ductal carcinoma in situ can be distinguished by the lipid profile. Other studies have reported the application of IMS to studying intratumor heterogeneity and differentiation of tumor/tissue types^108,109 as well as heterogeneous distribution of drugs in tissues.¹¹⁰ The ability of IMS to quantify metabolites enables a functional assessment of the biology not seen by other methods and enables a deeper understanding of the disease state as well as mechanisms of action of drugs.⁹⁶

Flow Cytometry

Flow cytometry is a standard method that rapidly evaluates many cells (up to ~10,000 cells/s) in a population one at a time. The application of flow cytometry to the analysis of heterogeneity in cellular systems is certainly not new.²⁷ Like the microscopy methods described above, cells can be labeled using expressed fluorescent proteins as well as with a wide range of fluorescent probes and antibodies. Highly multiplexed flow cytometry allows up to 17 fluorescent markers¹¹¹ per cell using photodetection or more than 36 mass markers per cell using mass cytometry detection.¹¹²

In flow cytometry, individual biomarkers are most often used for binary classification of cells, using either manual or automated gating to distinguish positive from negative cells, but the data collected from the samples include the distribution of the intensity of the labels and therefore can be used to identify and characterize the heterogeneity of the cells.¹¹³ Because cells must be suspended to be measured, flow cytometry is most often used for nonadherent cells but can be used for any cells that can be isolated and suspended in media.³⁰ By suspending cells in media, the spatial context of the cell is lost, as well as some of the subcellular spatial context, but the cells can be sorted based on the signal intensity from one or more markers, allowing the selection of live subpopulations of cells for further experiment. Sample preparation, especially when isolating cells from tissue, can lead to significant differences between samples and laboratories and therefore needs to be carefully controlled.¹¹⁴

Single-Cell “Omics”

Multidisciplinary technological advances in experimental design and computational analysis have now made it possible to measure global gene expression in thousands of individual cells in a single experiment to infer biochemical and genetic regulatory mechanisms.¹¹⁵ Single-cell RNA-seq (scRNAseq)^116,117 and its complementary single-cell-based platforms for epigenome (i.e., bisulfite sequencing^118,119 and DNAse I hypersensitivity^120–122), proteome,^93,104,123 and metabolome¹²⁴ analyses have begun to provide an unprecedented view of cellular heterogeneity.¹¹⁵ The power of defining the spatial and temporal relationships among distinct subpopulations of cells circumvents the limitations of averaged readouts intrinsic to bulk analyses,¹²⁵ enabling the determination of the dynamics and regulation of cellular processes such as differentiation, tissue homeostasis, and complex disease progression.¹¹⁵ However, the single-cell measurements are often quite variable, requiring that novel normalization strategies be introduced into the experimental design to distinguish technical variability from genuine biological variability.¹²⁶ Furthermore, while variation in measurements (i.e., gene expression) linked to the cell cycle can provide important biological insights, this variation could also obscure more physiologically important differences among cells.³⁶ To address the potential confounding effects of cell cycle asynchrony and more generally discriminate among different sources of biological heterogeneity, single-cell latent variable models have been introduced.³⁶ This computational approach for analyzing cell-to-cell heterogeneity has enabled the identification of otherwise undetectable subpopulations of cells that, for example, have provided insights into the differentiation of naive T cells into T-helper cells.³⁶ Normalized single-cell data for which sources of heterogeneity have been addressed can be processed using unsupervised clustering algorithms to identify cell types, define stable states, and reconstruct transition paths (i.e., trajectories) between these stable states.¹¹⁵ Quantitative measures of biological progression (i.e., pseudotime; Table 2 ) through complex processes such as differentiation and oncogenic transformation can be generated using these algorithms that in turn provide valuable mechanistic insights.¹²⁷ For instance, Monocle has been designed to work with scRNAseq and, by analogy, Wanderlust¹⁰⁴ with high dimensional cytometry for proteomic measures of pseudotime. We expect, for example, that the mechanistic insights gained from comprehensive network-based single-cell analysis of heterogeneity will be applied to circulating tumor cells for the early detection of rare resistant subpopulations to inform precision therapeutic strategies.^59,128

Need for a Standard Set of Heterogeneity Metrics

There have been many methods and metrics applied to the analysis of heterogeneity. Table 1 lists some of the major classes of metrics with their key characteristics. Most of the metrics are focused on characterization of population heterogeneity, while relatively few methods address the important spatial aspect of heterogeneity, and temporal heterogeneity^56,57 remains to be addressed.

Value in Establishing a Standard Set of Metrics

Although a single set of standard metrics for heterogeneity may not be optimal in all situations, it would provide a number of advantages. First, it would encourage integration into software packages like Spotfire (Tibco Software, Boston, MA), R¹²⁹ and HCS, and flow cytometry analysis packages. Second, it would facilitate communication and enable comparison of heterogeneity between systems and assays. Third, only after a method has been established through a peer-reviewed, transparent approach can it be routinely used in a scope beyond the focus of the investigator who developed it. As the formal quantification and analysis of heterogeneity becomes more common, there is a need both for tools that can be applied efficiently, but also tools that provide some insights into the system under study.

The most important characteristics of an optimal set of heterogeneity metrics are to facilitate interpretation of the biology and to produce clear communication of the results of the analysis. Heterogeneity measures need to describe the shape of the population distribution and should be as simple and clear as describing a normal (unimodal) distribution by the “mean,” “median,” “mode,” and “standard deviation.” A second key aspect of optimal metrics is a clear understanding of where they can be applied and why they are appropriate for a particular situation. Optimal metrics for heterogeneity, as they gain acceptance, will have more general or more specific applications.

Comparison of Published Metrics for Heterogeneity

Several types of metrics have been applied to the identification of heterogeneity in cell populations. Generally, the metrics characterize three aspects of the distribution: the overall extent or diversity, the shape or modality, and the tails. As a first pass, graphical methods, including histograms and the Q-Q plot, can be useful for visualization and detection of modality.¹⁴ Nonparametric statistics, such as interquartile range (IQR),¹³⁰ percent outliers,¹³¹ the KS statistic,⁵⁴ Shannon index,¹³² Simpson index,¹³³ and quadratic entropy,¹³⁴ have been used to describe the distribution of a population. Extent measures include the IQR and entropy measures. The IQR, defined to be the first quartile subtracted from the third quartile, is a measure of statistical dispersion¹³⁰ that can be applied to any distribution, but half the data falls outside the range and therefore the IQR is only sensitive to the central portion of the distribution. The Shannon entropy and Simpson indices have been used to describe the diversity of species in the ecological sciences. The disadvantage of both Shannon and Simpson indices is that they ignore the magnitude of the difference between species. The quadratic entropy incorporates a distance matrix to create a more robust measure of diversity by including the magnitude of the differences. Quadratic entropy has been applied to describe the diversity in cell populations.^4,37,76,134 Shape measures often use a normal distribution as a reference and make a qualitative or quantitative comparison with the data.

The KS statistic is a well-known method for quantifying the difference between two distributions. This can be used, for example, as a normality test when a sample distribution is compared to a normal distribution^4,14,37 or as a QC test to track the shape of the distribution in controls. Other statistical tests of normality, such as Anderson-Darling, also compare a sample distribution to a normal distribution, returning a numerical measure of the goodness of fit.¹³⁵ In selecting a test, it is important to consider the sample size, as some tests of normality work best for small sample sizes of 10 to 1000. Cellular assays may contain data from hundreds to many thousands of cells, and such tests may be too sensitive for these large populations and thus may overestimate the significance of small differences in heterogeneity. Finally, the tail of the distribution can be characterized by the outliers. The percent outliers in the population^4,37 can indicate whether the population has a normal or more heavy-tailed distribution.¹³¹

A simple pair of metrics to indicate a nonhomogeneous response is the measure of maximum effect (efficacy, E_max) and the Hill slope (HS), which can be observed even in population averaged measurements but only in a dose-response format. Maximal effects that plateau below 100% could be indicative of differential response to treatment by subpopulations that should be investigated further. In a study looking at the response of a panel of breast cancer cell lines to various anticancer compounds with different mechanisms of action, Fallahi-Sichani et al.¹³⁶ suggest that during drug development where the aim is to understand variability in patient response, E_max and HS are more informative than simply looking at potency. A shallow HS in the concentration-response curve was shown to be correlated with high cell-to-cell variability in target inhibition. This variability could be the result of fluctuations of target amount, activity, or other interactions of the target in different cells. Interestingly, in that study, it was noted that inhibitors of the mTOR pathway, which is subject to complex feedback regulation and potentially a high degree of heterogeneity, had the lowest HS values. While E_max and HS may be useful as indicators of heterogeneity, alone they provide no specific information about the nature of the heterogeneity.

Another common approach to characterizing heterogeneity is the use of principal components analysis (PCA) to reduce the dimensionality of multiparameter data followed by segmentation of the population using a Gaussian mixture model (GMM).^61,68,88 When there are clear subpopulations, GMM can be a powerful approach to quantifying the relative size of subpopulations and the movement of cells between subpopulations in response to treatment. However, this approach is not conducive to automated, high-throughput applications.

An alternative is the direct analysis of the shape of the distributions of cellular phenotypes, without assuming some number of discrete subpopulations. In this method, the distributions are characterized and compared using three indices that describe the diversity, normality, and percent outliers in the distribution. Together, referred to as the Pittsburgh heterogeneity indices (PHI), the quadratic entropy, the norm-KS test, and the percent outliers can be used to quantify heterogeneity.^4,37 This approach is broadly applicable, can be used to compare data between laboratories and methods, can be incorporated in existing cell analysis software packages, and is able to identify differential sensitivity of individual cells to compound exposure. The University of Pittsburgh is presently working with one of the suppliers of data analysis packages to incorporate the PHI as a standard approach to the quantitation of population heterogeneity and will also provide an R-script to calculate the PHI on the University of Pittsburgh Drug Discovery Institute website.¹³⁷

QC Metrics for Characterizing the Reproducibility of Population Distributions

An important question in the analysis of heterogeneity is reproducibility from day to day, week to week, or even month to month. Analysis of heterogeneity in large-scale biology and drug discovery projects requires methods for validation of consistent cell-to-cell variability^4,37–39 and establishment of a quality control procedure to monitor reproducibility.³⁷ It is important to note that metrics such as the Z′ factor or the SSMD give no information about the consistency of the distributions in the wells.³⁷ Figure 4 illustrates a workflow for heterogeneity analysis that addresses the need for metrics and quality control. The suggested procedure follows the same principles used for quality control in screening and therefore integrates well with a standard screening protocol. The procedure adopts a new metric, the QC-KS ( Fig. 4 , steps 2 and 3) that uses the KS statistic to compare the distributions in the control wells on each plate to a set of reference distributions established during validation.³⁷ The QC-KS metric ensures that the shape of the control distributions is consistent throughout the project.

Figure 4.

A workflow for quantitation of heterogeneity. The quantitative analysis of biological heterogeneity requires assay validation and quality control similar to a screen but with the addition of quality control methods and metrics for ensuring the reproducibility of the population distributions. After establishing the assay SOP (1), one approach is to establish a reference distribution while characterizing assay performance (2). The reference distribution is used throughout the project (3) to track the population distributions in the control wells. Once the consistency of the assay has been established, heterogeneity metrics can be applied to dissect the heterogeneity (4) and interactive analysis and visualization tools used to examine filtered or clustered distributions (5). Selected distributions can then be analyzed with various models and used to guide interpretations or drive the next experiments (6). KS, Kolmogorov-Smirnov; QE, quadratic entropy; QC, quality control; S/B, signal to background; SOP, standard operating procedure.

Informatics Tools for Evaluating, Visualizing, and Comparing Population Distributions in Biological Data

The analysis of heterogeneity presents a major opportunity to enhance our understanding of biological systems. Extracting insights from the heterogeneity in cell-based experiments requires informatics tools to support visualization and analysis of population distributions. Visualization of the distribution of data is most often the initial evidence of heterogeneity in a set of measurements. However, the application of heterogeneity metrics is expected to be a more reliable, quantitative, and objective indication of heterogeneity. Selection of the optimal visualization tools often depends on the type of data or the data distribution. For example, histograms are useful for univariate data while scatterplot matrices or density plots are more useful for multivariate data.¹³⁸ Visualization not only provides some immediate understanding of the nature of the variation in phenotypes but guides the selection of analysis approaches. Informatics tools for heterogeneity analysis can be categorized as interactive visualization tools for “drilling down” into distributions; modeling tools for clustering, classification, and pathway modeling; general-purpose tools that combine visualization and modeling; and application-specific tools that are customized to the specific data source.

Drilling Down into the Distributions

Whatever the initial method for detecting heterogeneity, there is a need for data exploration tools that provide general mathematical and statistical functions along with interactive visualization. Optimally, these tools would also provide a means to incorporate heterogeneity metrics. Figure 5 illustrates how six different patterns of heterogeneity for a single phenotype might appear in some standard visualizations. Figure 5A illustrates the six patterns as they might appear in an image, where color saturation or pseudocolor could be used to indicate variations in the phenotype and where a few outlier cells (depicted as stars) might exhibit a more extreme phenotype. While heterogeneity can be directly observed in images, it is difficult to assess and compare the extent of the heterogeneity or the presence or absence of outliers, except perhaps for a few extremes. As an initial evaluation of a distribution, a histogram like the ones in Figure 5B might be used. However, although the overall shape of the distributions is clear, and it is fairly easy to see whether the distribution is uni- or multimodal and whether it is reasonably normal (micro-heterogeneous) or more complex (macro-heterogeneous), the presence and distribution of outliers are not easy to see. Figure 5C , D illustrates two plot types, the histo-box plot⁴ and the violin plot,¹³⁹ respectively, that combine the features of a histogram with a display of outliers similar to a box plot ( Fig. 5E ). Combining the histogram with the distribution of outliers provides a more detailed view of the heterogeneity in the sample data. Note that in the images and standard box plot, it is generally not possible to distinguish between micro- and macro-heterogeneity. Multidimensional scatterplots or density plots are also commonly used to visualize heterogeneity. It is relatively easy to visually pick out a cluster that represents a subpopulation in a scatterplot. Software tools for detailed analysis of distributions are available in a wide range of statistical and data visualization packages, including commercial and open-source packages described below.

Figure 5.

Visualization of patterns of heterogeneity in population samples. Patterns are described based on six general classes of heterogeneity on the horizontal axis. (A) Depiction of the various types of heterogeneity among cells as they might appear in an image. (B) Histograms with outliers depicted as individual points based on a standard box plot (“Histo-box plot”⁴). (C) Traditional histograms. (D) “Violin plots,”¹³⁹ essentially double-sided histograms. (E) A standard box plot.

General-Purpose Informatics Tools

Currently, many general-purpose data analysis tools can be used to implement metrics and visualizations for heterogeneity analysis. Commercial software like Matlab (Mathworks, Natick, MA) and open-source software like R are programmable and provide large archives of user-contributed functions. In addition, some commercial programs like Spotfire (Tibco Software), primarily a data visualization tool with some statistical analysis functions, provide an interface for incorporating R or Matlab scripts into the analysis.^4,37 Commercial statistical analysis packages such as SAS/JMP (SAS Institute, Cary, NC), SPSS (SPSS, Inc., an IBM Company, Chicago, IL), and Minitab (Minitab, State College, PA) all have many functions to characterize and visualize distributions of data.

Defining the development of tools for assessing heterogeneity is based on two needs. The first is that the resources described above are very powerful and flexible but generally require some training before using them. This makes for a high cost to adopt (in terms of effort required to analyze data), therefore limiting acceptance and general use by researchers. Second, they also become highly individualized solutions, resulting in numerous methods for quantifying heterogeneity, making comparisons across systems or studies difficult. In this regard, some universal definitions of heterogeneity and standard practices, such as the workflow in Figure 4 , will help develop a general appreciation of and consensus on when heterogeneity analysis is suggested or even required for interpreting an experiment.

Machine Learning: Clustering Data and Classifying Subpopulations

Although implicit in much of the discussion above, it becomes important at this point to recognize that heterogeneity results from multiple signaling or metabolic effects.¹⁴⁰ Generally, these may be measured at the same time, thus providing some opportunity to explore the complex influence of heterogeneity and interactions between networks and signals. In this regard, combining multiparameter experimental and computational methods with detailed analysis of heterogeneity is necessary to understand the highly dynamic mechanisms that control cell plasticity and fate.¹⁴¹ Much of the work in this area incorporates methods for clustering and classifying multiparametric flow cytometry, HCS and transcriptional profiling data, and general methods for machine learning derived from ecology, business intelligence, and other fields.¹⁴²

Statistical measures such as KS distance can be used to quantitatively compare distributions of a single biomarker, for example, with respect to a reference distribution.^143–145 Although each cell can be simply described using the levels of one or more biomolecules, the abundance of data collected from phenotyping experiments allows much more detailed descriptions. Often biomarker levels are transformed into derived features, thereby amplifying the separation between distinct subpopulations that are identified using machine learning approaches. Image data allow calculating higher moments (variance, skewness, etc.) of intracellular biomarker levels, as well as morphological features, including shapes of cellular compartments or standard texture features such as Haralick or Zernike features.¹⁴⁶ A cell that is imaged using three-channel immunofluorescence (IF) can easily be described as a vector of hundreds of derived features,^61,68,147 and this space can be reduced using PCA, t-distributed stochastic neighbor embedding (T-SNE), or other methods.^61,147–149 Subpopulations within the selected feature space can be identified by clustering using standard methods like K-means¹⁵⁰ or hierarchical agglomerative^144,151 clustering or by fitting the data to distributions of known form, such as GMMs.^61,88 Quantitatively defined cellular phenotypes are useful for training classifiers¹⁵ and represent the first step toward constructing mechanistic models to explore the biochemical origins of heterogeneity.^152–154

Application-Specific Tools

Many data acquisition systems such as flow cytometry, mass cytometry, and HCS come with advanced but proprietary tools for visualization and analysis of the data. In some applications, third parties provide additional commercial and open-source software tools. The establishment of standard metrics for heterogeneity would encourage manufacturers to incorporate those metrics into their proprietary software tools, facilitating the analysis. Meanwhile, open-source software presents the most immediate opportunity for integration of heterogeneity metrics. For flow cytometry, open-source data analysis tools include the BioConductor¹⁵⁵ packages iFlow¹⁵⁶ and OpenCyto,¹⁵⁷ as well as FlowCytometryTools,¹⁵⁸ a python package. For HCS data analysis, open-source options include Cell Profiler Analyst,¹⁵⁹ HCS-Analyzer,¹⁶⁰ KNIME,¹⁶¹ and OMERO.¹⁶² High dimensional data, such as that produced by mass cytometry and hyperplexed fluorescence imaging, present some unique challenges for visualization and heterogeneity analysis, for which tools are being developed, including viSNE,¹⁶³ which has been integrated into a workflow for discovery and characterization of cell subsets.¹⁶⁴

Current Application of Heterogeneity Analysis in Drug Discovery

Drug Discovery and Development

The development of disease-relevant models and assays begins with the analysis of disease and normal patient samples to identify suitable biomarkers and assay readouts, as well as to characterize the organization and heterogeneity profiles of the selected biomarkers. Physiologically relevant models of the disease state, such as 3D tissue models and organs-on-chips, should recapitulate the architecture of the normal and disease tissues, including multiple cell types, which optimally will also recapitulate the tissue heterogeneity.⁶

In a screening campaign to identify compounds for drug development, heterogeneity indices (HIs) would then be reported alongside the compound potency and assay performance statistics, including a heterogeneity QC metric, flagging compound concentrations that exceed thresholds established during assay development, indicating significant heterogeneity in the response. In drug development, compounds exhibiting macro-heterogeneity would need to be further studied, perhaps starting with histo-box plots for the dose series. Compounds exhibiting heterogeneity within a defined population (e.g., subpopulation of cells targeted for therapy development) present two options: (1) deprioritize in favor of compounds that modulate the cell population more uniformly or (2) select the compounds with complementary efficacy in subpopulations for use in a combination therapy strategy. The objective of monitoring heterogeneity in secondary assays should be to make more informed decisions in selecting compounds to advance through drug development by identifying potential differences in mechanism of action (MOA) among lead compounds. To the latter point, the distribution of cell responses affects the interpretation of drug activity.

The objective of phenotypic drug discovery is to identify compounds that can revert the disease phenotype to the normal phenotype. These clinical phenotypes are represented in the assay by the negative and positive control samples, respectively. Profiling the changes in the distributions with compound treatment in a screen provides insight into the MOA. This is illustrated in Figure 6A , B , where the concentration response profiles for the inhibition of STAT3 activation by pyridone-6 (a pan kinase inhibitor) and Stattic (an SH2 binding domain inhibitor) are different, consistent with their different MOA.

Figure 6.

The shape of a dose-response curve can be influenced by the underlying distributions of measurements at each dose. The distinctive transitions in the populations may indicate different biological processes. (A) Histo-box plots of pyridone-6 inhibition of interleukin-6 (IL-6)–activated STAT3 shows a gradual inhibition with increasing concentration indicating differential sensitivity of the cells. The mean (white bar) and median (black bar) are shown on the distributions. The negative control (red) and the positive control (green) are shown for reference. The green horizontal line is 3 standard deviations above the mean of the positive control representing the cutoff between cells with and without activated STAT3. The blue arrow indicates the conventional IC₅₀, while the red arrow indicates the concentration at which 50% of the cells are inhibited. (B) Histo-box plots of the inhibition by Stattic show a much steeper inhibition, indicating a more uniform population response, even though the cells at each dose show a variable sensitivity. (C) Dose-response curves for pyridone-6 inhibition calculated based on the population average (blue) or the percentage of cells that were inhibited (red).

Analysis of the distributions in response is also important in establishing an optimal assessment of compound activity. If the goal of the screen is to identify compounds that bring the population to a state equivalent to the positive control, then the distribution of the positive control should be used to establish relevant criteria for identifying cells that have reached that state. For example, cells within 3 SDs of the mean positive control response could be classified as positive. It is usually assumed that the IC₅₀ derived from the population average measures indicates the concentration at which the population has been induced (or inhibited) halfway to the positive control state. IC₅₀s calculated on well-averaged data represent the point at which the signal drops 50% between the negative and positive controls. This calculation does not indicate if the signal in all of the cells was reduced by 50% (which would be a homogeneous response) or, for example, if all of the signal in only half of the cells was reduced (which would be a heterogeneous response). Cell-level analysis allows for the detection of heterogeneity and an assessment of when 50% of the cells have reverted to the positive control state (such as within 3 SD of the positive control population). The blue arrow in Figure 6A indicates the IC₅₀ calculated using the well-averaged signal, and the red arrow indicates the point at which 50% of the cells have reached the positive control state. This calculation considers heterogeneity in cell response. As shown in Figure 6C , analysis of the pyridone-6 dose dependence of the distribution of cells revealed that the concentration required to induce half of the cells into the positive control state (red curve), which may be a more relevant measure of the IC₅₀, is 2- to 10-fold higher than the population-averaged IC₅₀ (blue curve). Furthermore, the degree of rightward shift from the population-averaged IC₅₀ can vary depending on the complexity of the transition profile. In the case of Stattic, the steep dose response leads to similar results for the average and the percent inhibited, while shallower curves result in a significantly greater differential.

Finally, it is important to follow the heterogeneity profile while investigating the SAR in the lead optimization stage to ensure that changes in the compound structure do not introduce additional or undesirable heterogeneity in the response, implying altered mechanisms of action. Furthermore, the heterogeneity profile can provide a more sensitive determination of changes in compound potency and therefore be used in combination with traditional measures of potency to help drive the SAR of a lead series toward a “normal” profile.⁴

Insights from Heterogeneity Analysis on Basic Biomedical Research and Drug Discovery

Cellular heterogeneity arises from biological networks and therefore provides insights into the network connectivity that can be used to guide selection of biomarkers.^9,165 Observations of individual cell behavior also provide information about the role of heterogeneity in cell differentiation, an essential component of tumor evolution,¹⁶⁶ as well as the transition from normal to disease cellular states.^88,166 Neglecting cell heterogeneity can lead to errors in disease classification.¹⁶⁶ When combined with computational models, the analysis of cell heterogeneity can be used to predict the responses of subpopulations of cells to drugs (e.g., cancer therapies).^9,61,65,167 For example, Johnston et al.¹⁶⁸ demonstrated using HCS that patient-specific and cell type–specific differences in the response of primary breast epithelial cell subpopulations to ionizing radiation were correlated with gene function. Furthermore, an analysis of fluctuations in the disease proteome, together with targeting of the proteins that contributed the most to the heterogeneity within a population, has been used to design combination therapy strategies.¹⁶⁹ These and other insights gained from heterogeneity analysis are expected to lead to a better understanding of the biology of disease and the design of more effective therapies.

Current Applications of Heterogeneity Analysis for Computational Pathology

Digital Pathology Enables Quantitative Analysis of Heterogeneity

Digital pathology typically uses transmitted light and/or fluorescence imaging for a comprehensive assessment of heterogeneity in tissues at the cellular and subcellular levels.^20,21,91 Recently, however, there has been increasing application of IMS to imaging tissue sections.^98,107,110 Subcellular resolution permits the identification of the activation state of specific biomarkers, such as translocation of transcription factors into the nucleus.⁵ In one study, quadratic entropy was used as a measure of diversity, called the HetMap, based on the pathologist’s scoring of individual cells in regions of interest in the tissue.⁷⁶ The HetMap was shown to be correlated with discordant scoring between pathologists and therefore useful to identify more complex tissues that required more detailed analysis. However, the dependence on manual cell scoring limited the extent of the regions that could be analyzed and the objectivity of the analysis. Digital pathology enables a more objective and comprehensive assessment of heterogeneity in the tissues²⁰ and has been used to identify population and spatial heterogeneity in the overall abundance or activation of biomarkers,¹⁷⁰ as well as various microenvironment components, including immune cells.¹⁹

Importance of the Spatial Aspect of Heterogeneity in Tissue

For many malignancies, molecular and cellular heterogeneity is a prominent feature among tumors from different patients, between different sites of neoplasia in a single patient and within a single tumor.¹⁷¹ Intratumor heterogeneity involves phenotypically distinct clonal cell subpopulations and distinct cell types that comprise the tumor microenvironment (TME) or “tumor tissue system,” including local and bone marrow–derived stromal stem and progenitor cells, subclasses of immune inflammatory cells that are either tumor promoting or tumor killing, cancer-associated fibroblasts, endothelial cells, and pericytes.^4,22,172,173 The TME can be viewed as an evolving ecosystem where cancer cells engage in heterotypic interactions with these other cell types and use available resources to proliferate and survive.^72,74 Consistent with this perspective, the spatial relationships among the cell types within the TME (i.e., spatial heterogeneity) appear to be one of the main drivers of disease progression and therapy resistance.^73,75,174 Thus, it is imperative to define the spatial heterogeneity within the TME to properly diagnose the specific disease subtype and identify the optimal course of therapy for individual patients.

Intratumor heterogeneity has been explored using three major approaches. The first approach is to take multiple core samples from specific regions of tumors and measure population heterogeneity within each core and spatial heterogeneity among the cores. The specific analyses include whole-exome sequencing,^175–179 epigenetics,¹⁸⁰ proteomics,^11,181 and metabolomics.¹¹ The second approach involves “single-cell analyses” using the above methods,^182,183 RNASeq,³³ microscope imaging,⁵⁷ or flow cytometry¹⁸⁴ following separation of the cells from the tissue. The third approach uses the spatial resolution of light microscope imaging or IMS, coupled with molecular-specific labels, to capture the spatial context of biomarkers in the cells.^{21,22,185,186}

Heterogeneity and Application of Image Statistics

A major challenge in digital pathology is to develop algorithms that quantify key spatial relationships (interactions or lack thereof) within the TME, based on images of panels of biomarkers. Figure 7A illustrates the spatial heterogeneity of cancer cells and stromal cells, including the migratory immune cells, within a tumor. Indeed, the spatial organization of cancer and noncancer cells in the TME has been hypothesized to be an important diagnostic¹⁸⁷ in addition to the expression level of the selected biomarkers.

Figure 7.

Canonical pointwise mutual information (PMI) maps depicting various forms of spatial intratumor heterogeneity. (A) Illustration of the heterogeneity in a tumor. (B) Cartoon representation of eight different cellular phenotypes based on high-dimensional biomarker intensity patterns acquired via pattern recognition algorithms. (C) A PMI map with strong diagonal entries and weak off-diagonal entries describes a globally heterogeneous but locally homogeneous tumor. In this example, the PMI map highlights locally homogeneous tumor microdomains containing cells of only one type each, phenotypes 2, 4, and 8, respectively. (D) On the contrary, a PMI map with strong off-diagonal entries describes a tumor that is locally heterogeneous. In this example, locally heterogeneous tumor microdomains exist, as portrayed by the off-diagonal entries. One domain contains phenotypes 1 and 5, another contains phenotypes 2 and 4, and yet another contains phenotypes 3 and 8. (E) PMI maps can also portray anti-associations (e.g., if phenotype 1 never occurs spatially near phenotype 3). The ensemble of associations and anti-associations of varying intensities along or off the diagonal represents the true complexity of tumor images in a format that can be summarized and interrogated. In this example, changing the distance threshold used in the PMI calculations has minor effects on the results. While increasing the distance tends to promote positive associations and decreasing the distance tends to increase negative associations, the effects are not significant and the overall conclusions regarding the heterogeneity remain the same. Figures B to E reprinted with permission from Spagnolo et al.²¹

To address this challenge, a method was developed to quantify intratumor spatial heterogeneity ( Fig. 7B–E ) of a single biomarker, as well as multiplexed or hyperplexed biomarkers. The method learns a set of dominant biomarker intensity patterns and maps the spatial distribution of the patterns with a network. The pairwise association statistics for the patterns are described using PMI^188,189 and visually represented as a 2D heat map. PMI is generalizable to spatial data from other in situ methods such as FISSEQ¹⁹⁰ and CyTOF⁹³ that sample multiple markers within the TME.

Other methods applied to the characterization of heterogeneity in tumors have used region of interest sampling but without a network-based approach or taking advantage of multiplexed data,⁷⁶ have characterized multiplexed cell phenotype associations within the tumor but not the underlying spatial organization,⁹ or have analyzed linear relationships between biomarkers in multiplexed/hyperplexed IF data without considering nonlinear associations or spatial information.¹⁹¹ The PMI method uses both the expression and spatial information of an entire tumor tissue section and/or spot in a TMA to characterize spatial associations of both major and minor subpopulations as a 2D heterogeneity map. The characterization of intratumor spatial heterogeneity by the PMI is expected to become an important diagnostic biomarker for cancer progression, proliferation, and response to therapy and to uncover key interactions in the TME that contribute to disease proliferation and progression.²¹

Insights from Spatial Heterogeneity Analysis in Pathology Samples

Non-cell-autonomous interactions often govern cell fate decisions and consequently play a major role in complex biological processes.^72,74 Spatial heterogeneity reflects these heterotypic signaling and extracellular matrix reorganization networks. Given the role that TME interactions have in tumorigenesis and metastasis, it may well be expected that spatial genetic heterogeneity can be correlated with poor long-term patient outcome, as exemplified in HER-2–positive breast cancer.⁷³ Several groups have developed computational strategies to infer spatial reconstruction of single-cell RNAseq data from dissociated cells by integrating single-cell expression data with in situ RNA patterns in developing mouse and zebrafish embryos.^192–194 By integrating these computational strategies with combinatorial fluorescence in situ hybridization approaches such as SeqFISH and MERFISH,^195,196 it may be possible to spatially reconstruct single-cell data derived from tissues such as tumors where, in contrast to embryos, there is no guarantee of reproducible spatial patterning.

Despite the valuable information that can be generated from these powerful approaches focused on single-cell analysis, they cannot account for perturbation of the signaling state of an individual cell or a biased recovery of specific cell types during single-cell dissociation from bulk tissue. In addition, analysis of cell lysates precludes resolution of subcellular spatial heterogeneity of RNAs and proteins and their associated complexes and networks. Platforms that integrate optical or mass spectrometry imaging with subcellular resolution have great potential to connect spatial and population heterogeneity with cell state, function, and communication.^77,92,93,197 These in situ approaches are compatible with FFPE biopsies and represent transformative computational pathology platforms aimed at optimizing diagnosis and treatment for individual patients.

Outlook for Heterogeneity Analysis in Biomedical Research

Basic Biology

The presence of heterogeneity in biological systems has been demonstrated and discussed in many publications, but the functional roles for heterogeneity are just beginning to be elucidated. As an example, recent technological advances in lineage tracing and specific subpopulation ablation, using inducible genetic labeling^198,199 in conjunction with in vivo imaging,²⁰⁰ have provided evidence for the role of dynamic cell population heterogeneity in the regulation of cell fate decisions intrinsic to processes, including differentiation, proliferation, and tumorigenesis.^3,17,71,72 Hyperplexed measurements with single-cell resolution using flow cytometry (e.g., transcriptome profiling, mass cytometry²⁰¹) coupled with machine learning algorithms have been used to circumvent averaging artifacts of bulk population measurements (i.e., Simpson’s paradox¹²⁵), enable the reconstruction of complex cellular hierarchies of differentiation, reveal rare cell states, and identify novel regulators.^113,127

Pluripotent stem cells are a platform with tremendous potential for the development of patient-specific disease models, for modeling biological development, and for regenerative medicine. However, stem cells exhibit heterogeneity on several levels: in the functional capacity to differentiate, in messenger RNA (mRNA) expression profiles, and in epigenetic and genetic states.²⁰² Studies of differentiating stem cells have found that heterogeneity reflects the presence of an evolving mixture of phenotypically distinct subpopulations, consistent with a hypothesis that differentiating cells transit through multiple robust and discrete phenotypic states.^66,88,203 Improved understanding and manipulation of the differentiation of stem cells require tools to reliably characterize and monitor the evolution of these subpopulations and their associated phenotypes.

The maintenance and repair of cycling adult tissues usually rely on the turnover of a small population of adult stem cells that possess the ability to self-renew, giving rise to differentiated progeny while maintaining their number.^3,204,205 Tissue homeostasis can be achieved only when the rates of stem cell proliferation and differentiation are balanced. Fate asymmetry can occur at the level of a single stem cell involving asymmetric segregation of fate determinants during cell division, leading one cell to follow a differentiation pathway and the other to stay in the stem cell compartment.^3,204 Alternatively, fate asymmetry can be achieved at the population level where differentiation of one stem cell is compensated for by the symmetric division of a neighboring stem cell.^3,204 In this case, it is only the population that persists, whereas the life span of any individual stem cell is not defined. Although either of these alternative models can be induced by intrinsic and extrinsic factors, each nevertheless suggests distinct regulatory mechanisms and therefore a need to identify and monitor these subpopulations of cells.^3,205

Recent studies of intestinal maintenance,²⁰⁶ mammalian spermatogenesis,²⁰⁷ and hair follicle cycling²⁰⁸ employing genetic lineage tracing and in vivo imaging suggest a more flexible organization in which long-term self-renewal potential, fate, and proliferative activity may be modulated by location within specific microenvironments (e.g., stem cell niche) and by dynamic changes in transcriptional activity that are often induced epigenetically.^{3,198,205,209,210} In this model, stem cells form a dynamically heterogeneous pool in which cells may transfer reversibly among states of variable survival and fate potential.^3,205 In addition, progenitors that are normally committed to differentiation may reacquire (through dedifferentiation) long-term self-renewal potential following exposure to niche factors. Such flexibility may strengthen the resilience of tissues to crisis and injury, enabling the population of differentiating progeny to function as a stem cell reserve.^3,205 Thus, the heterotypic signaling between stem cells and the niche, likely to be symbiotic,²⁰⁴ indicates the important regulatory role of spatial heterogeneity in tissue homeostasis.²⁰⁵ Several studies also suggest a reversible transfer of stem cells between an active and quiescent state.³ This manifestation of dynamic heterogeneity may provide a robust mechanism to maintain a stem cell pool such that the overall turnover rate of the tissue is steady but slow, particularly in the context of aging.³ Perhaps equally important, a dormant state within a cycling tissue may provide an insurance mechanism to protect the wider population from the stressful demands of active cell cycling, ensuring the long-term integrity of the tissue.³ Such behavior would mirror the strategy of phenotypic switching observed in bacterial populations.²¹¹

Study of Cell Signaling Networks/Pathways

Throughout this perspective, a recurring theme has been that heterogeneity both reflects and influences cellular networks and therefore encodes a wealth of basic biological information that can be extracted with systems modeling techniques. Only recently has there been a definitive push to understand phenotypic heterogeneity through systems modeling, revealing the role of “noise” and cell-to-cell variability in cellular systems organization.^212,213 Mechanistic models²¹⁴ that represent the chemical underpinnings of the cell are easy to interpret in terms of basic molecular principles, but the trade-off for these insights is the effort required to assemble and parameterize them.^215,216 Simply identifying the correct network topology poses a challenge, as network topology may vary by cell type²¹⁷ and inconsistencies exist among curated databases of molecular interactions.²¹⁸

Computational modeling studies have shown that phenotypic heterogeneity in apoptosis is more dependent on extrinsic factors, rather than from intrinsic differences in cells.^213,219,220 Modeling also suggests that spatial heterogeneity influences tumor aggression. Heterogeneous environments may provide safe havens within which resistant tumors can flourish,²²¹ and spatial heterogeneity promotes immunosuppressive signaling in the TME.²²² Incorporating heterogeneity into models of cell signaling networks will be a key to understanding the details of how specific pathway activity drives cellular heterogeneity and how heterogeneity affects the regulation of the network.

Drug Discovery—Example of Cancer

Darwinian-like clonal evolution in tumors significantly contributes to the observed phenotypic diversity,^7,75,173 as do epigenetic changes^7,75,153 and heterotypic signaling in the TME.^173,223 This diversity and plasticity present a major challenge to the development of therapeutic regimens, as the targeting of a predominant tumor subpopulation often only provides transient benefit that will inevitably result in the emergence of resistant populations and relapse.²²⁴ However, recent studies suggest that knowledge of the tumor composition and the response of heterogeneous subpopulations to single drugs, in conjunction with computational and experimental modeling, can identify drug combinations that minimize the outgrowth of resistant subpopulations in tumors while enhancing tumor-free survival in mice.^225,226 Importantly, the experimentally validated simulations demonstrated that the prediction of the optimal drug combination required the analysis of multiple tumor subpopulations, not just a particular subpopulation. Over time, the role of these models in developing treatments will increase, and the “one target one drug” paradigm will be replaced by strategies driven by quantitative systems pharmacology (QSP), where development is focused on rationally designed drug combinations.^6,227

Precision Medicine—Example of Cancer

Intratumor genetic heterogeneity¹⁷¹ and its region-specific diversity,^175,179 reflecting genetic instability as an acquired hallmark of cancer,¹⁷³ have been well studied. Darwinian forces of evolution, however, act on heritable phenotypes and not genotypes per se.⁷² Although historically challenging to study in patients, recent studies employing genetic models and novel in situ single-cell imaging methods have begun to shed light on phenotypic heterogeneity that arises from environmental selection pressures in the tumor.^73,75 Positive interactions among distinct clonal subpopulations have been observed and can be thought of as one of the major drivers of persistent intratumor heterogeneity.^74,228 These types of collaborative interactions support the possibility that instead of a clonal population accumulating all the necessary mutations that enable it to acquire the hallmarks of cancer, a time-consuming and inefficient process, several cooperating partially transformed subclones may circumvent full transformation and thus accelerate tumor progression.^72,229 In situ single-cell analysis of primary tumors of HER-2–positive breast cancer patients receiving neoadjuvant chemotherapy indicated a therapy-induced spatial heterogeneity among clones that was associated with shorter disease-free survival following adjuvant therapy with trastuzumab.⁷³ In contrast, no such association was evident when the fraction of cells harboring resistance-conferring mutations (i.e., PIK3CA) or the overall cellular diversity changes before and after neoadjuvant treatment was considered.⁷³ Because long-term survival is largely defined by progression to metastatic disease, these results imply a potential role for spatial heterogeneity in the TME in selecting treatment-resistant cancer cells capable of migration and metastatic dissemination.⁷³

The results of the above study suggest that neoadjuvant chemotherapy prior to HER-2 targeted therapy may be contraindicated as it might promote treatment resistance. Furthermore, this study suggests the potential benefit of implementing in situ single-cell hyperplexing technologies with subcellular resolution^77,92,93 in conjunction with machine learning algorithms as a powerful diagnostic platform to identify targetable tumor dependencies resulting from heterotypic signaling networks (e.g., positively cooperating subclonal populations).⁷³ Thus, knowledge of functional phenotypic heterogeneity, in contrast to simply genetic heterogeneity, could be exploited to guide the design of precision therapeutic strategies.

Footnotes

Acknowledgements

We acknowledge the discussions with members of their laboratories, as well as other colleagues who influenced our thinking about heterogeneity.

Declaration of Conflicting Interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported in part by funding from the University of Pittsburgh Cancer Institute (2P30 CA047904), the State of Pennsylvania (4100068731), and the National Institutes of Health (5UH3TR000503).

References

Huang

Non-Genetic Heterogeneity of Cells in Development: More Than Just Noise. Development 2009, 136, 3853–3862.

Gingold

J. A.

Coakley

E. S.

. Distribution Analyzer, a Methodology for Identifying and Clustering Outlier Conditions from Single-Cell Distributions, and Its Application to a Nanog Reporter RNAi Screen. BMC Bioinformatics 2015, 16, 225.

Krieger

Simons

B. D.

Dynamic Stem Cell Heterogeneity. Development 2015, 142, 1396–1406.

Gough

A. H.

Chen

Shun

T. Y.

. Identifying and Quantifying Heterogeneity in High Content Analysis: Application of Heterogeneity Indices to Drug Discovery. PLoS One 2014, 9, e102678.

Gough

Lezon

Faeder

J. R.

. High-Content Analysis with Cellular and Tissue Systems Biology: A Bridge between Cancer Cell Biology and Tissue-Based Diagnostics. In The Molecular Basis of Cancer, Mendelsohn

Howley

P. M.

Israel

M. A.

., eds., Saunders/Elsevier: Philadelphia, PA, 2015, 4th ed., pp. 369–392.

Stern

A. M.

Schurdak

M. E.

Bahar

. A Perspective on Implementing a Quantitative Systems Pharmacology Platform for Drug Discovery and the Advancement of Personalized Medicine. J. Biomol. Screen. 2016, 21, 521–534.

Marusyk

Almendro

Polyak

Intra-Tumour Heterogeneity: A Looking Glass for Cancer?

Nat. Rev. Cancer 2012, 12, 323–334.

Luria

S. E.

Delbruck

Mutations of Bacteria from Virus Sensitivity to Virus Resistance. Genetics 1943, 28, 491–511.

Steininger

R. J.

Rajaram

Girard

. On Comparing Heterogeneity across Biomarkers. Cytometry A 2015, 87, 558–567.

10.

Ruiz

Luttgen

M. S.

. Limited Genomic Heterogeneity of Circulating Melanoma Cells in Advanced Stage Patients. Phys. Biol. 2015, 12, 016008.

11.

Balluff

Frese

C. K.

Maier

S. K.

. De Novo Discovery of Phenotypic Intratumour Heterogeneity Using Imaging Mass Spectrometry. J. Pathol. 2015, 235, 3–13.

12.

Shalek

A. K.

Satija

Shuga

. Single-Cell RNA-seq Reveals Dynamic Paracrine Control of Cellular Variation. Nature 2014, 510, 363–369.

13.

Schwarz

R. F.

Trinh

Sipos

. Phylogenetic Quantification of Intra-Tumour Heterogeneity. PLoS Comput. Biol. 2014, 10, e1003535.

14.

Haney

S. A.

Rapid Assessment and Visualization of Normality in High-Content and Other Cell-Level Data and Its Impact on the Interpretation of Experimental Results. J. Biomol. Screen. 2014, 19, 672–684.

15.

Loo

L. H.

Lin

H. J.

Steininger

R. J.

III . An Approach for Extensibly Profiling the Molecular States of Cellular Subpopulations. Nat. Methods 2009, 6, 759–765.

16.

Spudich

J. L.

Koshland

D. E.

Jr.

Non-Genetic Individuality: Chance in the Single Cell. Nature 1976, 262, 467–471.

17.

Bhang

H. E.

Ruddy

D. A.

Krishnamurthy Radhakrishna

. Studying Clonal Dynamics in Response to Cancer Therapy Using High-Complexity Barcoding. Nat. Med. 2015, 21, 440–448.

18.

Rubin

Early Origin and Pervasiveness of Cellular Heterogeneity in Some Malignant Transformations. Proc. Natl. Acad. Sci. U. S. A. 1984, 81, 5121–5125.

19.

Tan

K. L.

Scott

D. W.

Hong

. Tumor-Associated Macrophages Predict Inferior Outcomes in Classic Hodgkin Lymphoma: A Correlative Study from the E2496 Intergroup Trial. Blood 2012, 120, 3280–3287.

20.

Heindl

Nawaz

Yuan

Mapping Spatial Heterogeneity in the Tumor Microenvironment: A New Era for Digital Pathology. Lab. Invest. 2015, 95, 377–384.

21.

Spagnolo

D. M.

Gyanchandani

Al-Kofahi

. Pointwise Mutual Information Quantifies Intra-Tumor Heterogeneity in Tissue Sections Labeled with Multiple Fluorescent Biomarkers. J. Pathol. Inform., in press.

22.

Critchley-Thorne

R. J.

Miller

S. M.

Taylor

D. L.

. Applications of Cellular Systems Biology in Breast Cancer Patient Stratification and Diagnostics. Comb. Chem. High Throughput Screen. 2009, 12, 860–869.

23.

Prichard

J. W.

Davison

J. M.

Campbell

B. B.

. TissueCypher(™): A Systems Biology Approach to Anatomic Pathology. J. Pathol. Inform. 2015, 6, 48.

24.

Mitchison

T. J.

Small-Molecule Screening and Profiling by Using Automated Microscopy. Chembiochem 2005, 6, 33–39.

25.

Giuliano

K. A.

DeBiasio

R. L.

Dunlay

R. T.

. High-Content Screening: A New Approach to Easing Key Bottlenecks in the Drug Discovery Process. J. Biomol. Screen. 1997, 2, 249–259.

26.

Abraham

V. C.

Taylor

D. L.

Haskins

J. R.

High Content Screening Applied to Large-Scale Cell Biology. Trends Biotechnol. 2004, 22, 15–22.

27.

Kell

D. B.

Ryder

H. M.

Kaprelyants

A. S.

. Quantifying Heterogeneity: Flow Cytometry of Bacterial Cultures. Antonie van Leeuwenhoek. 1991, 60, 145–158.

28.

Davey

H. M.

Kell

D. B.

Flow Cytometry and Cell Sorting of Heterogeneous Microbial Populations: The Importance of Single-Cell Analyses. Microbiol. Rev. 1996, 60, 641–696.

29.

Edwards

B. S.

Sklar

L. A.

Flow Cytometry: Impact on Early Drug Discovery. J. Biomol. Screen. 2015, 20, 689–707.

30.

Keller

P. J.

Lin

A. F.

Arendt

L. M.

. Mapping the Cellular and Molecular Heterogeneity of Normal and Malignant Breast Tissues and Cultured Cell Lines. Breast Cancer Res. 2010, 12, R87.

31.

Khan

I. A.

Lupi

Campbell

. Interoperability of Time Series Cytometric Data: A Cross Platform Approach for Modeling Tumor Heterogeneity. Cytometry A 2011, 79, 214–226.

32.

Ambriz-Avina

Contreras-Garduno

J. A.

Pedraza-Reyes

Applications of Flow Cytometry to Characterize Bacterial Physiological Responses. Biomed. Res. Int. 2014, 2014, 461941.

33.

Patel

A. P.

Tirosh

Trombetta

J. J.

. Single-Cell RNA-seq Highlights Intratumoral Heterogeneity in Primary Glioblastoma. Science 2014, 344, 1396–1401.

34.

Wang

Bodovitz

Single Cell Analysis: The New Frontier in ‘Omics’. Trends Biotechnol. 2010, 28, 281–290.

35.

Diercks

Kostner

Ozinsky

Resolving Cell Population Heterogeneity: Real-Time PCR for Simultaneous Multiplexed Gene Detection in Multiple Single-Cell Samples. PLoS One 2009, 4, e6326.

36.

Buettner

Natarajan

K. N.

Casale

F. P.

. Computational Analysis of Cell-to-Cell Heterogeneity in Single-Cell RNA-Sequencing Data Reveals Hidden Subpopulations of Cells. Nat. Biotech. 2015, 33, 155–160.

37.

Gough

Shun

T. Y.

Lansing Taylor

. A Metric and Workflow for Quality Control in the Analysis of Heterogeneity in Phenotypic Profiles and Screens. Methods 2016, 96, 12–26.

38.

Singh

Bray

M. A.

Jones

T. R.

. Pipeline for Illumination Correction of Images for High-Throughput Microscopy. J. Microsc. 2014, 256, 231–236.

39.

Bray

M.-A.

Fraser

A. N.

Hasaka

T. P.

. Workflow and Metrics for Image Quality Control in Large-Scale High-Content Screens. J. Biomol. Screen. 2012, 17, 266–274.

40.

Wang

Y.-L.

Taylor

D. L.

Fluorescence Microscopy of Living Cells in Culture, Part B: Quantitative Fluorescence Microscopy-Imaging and Spectroscopy, Academic Press: San Diego, 1989, Vol. 30.

41.

Wang

Y.-L.

Taylor

D. L.

Fluorescence Microscopy of Living Cells in Culture, Part A: Fluorescent Analogs, Labeling Cells, and Basic Microscopy, Academic Press: San Diego, 1988, Vol. 29.

42.

Chakravarty

Bowman

Ecsedy

J. A.

. Developing Robust High Content Assays. In High Content Screening, John Wiley: New York, 2007, pp. 85–109.

43.

Hoffman

R. A.

Standardization, Calibration, and Control in Flow Cytometry. Curr. Protoc. Cytom. 2005, Chapter 1, Unit 13.

44.

Schwartz

Marti

G. E.

Poon

. Standardizing Flow Cytometry: A Classification System of Fluorescence Standards Used for Flow Cytometry. Cytometry 1998, 33, 106–114.

45.

Mittag

Tarnok

Basics of Standardization and Calibration in Cytometry—A Review. J. Biophotonics 2009, 2, 470–481.

46.

Alvarez

D. F.

Helm

Degregori

. Publishing Flow Cytometry Data. Am. J. Physiol. Lung Cell Mol. Physiol. 2010, 298, L127–L130.

47.

Aghaeepour

Finak

Hoos

. Critical Assessment of Automated Flow Cytometry Data Analysis Techniques. Nat. Methods 2013, 10, 228–238.

48.

Aghaeepour

Jalali

O’Neill

. RchyOptimyx: Cellular Hierarchy Optimization for Flow Cytometry. Cytometry A 2012, 81, 1022–1030.

49.

O’Neill

Jalali

Aghaeepour

. Enhanced flowType/RchyOptimyx: A BioConductor Pipeline for Discovery in High-Dimensional Cytometry Data. Bioinformatics 2014, 30, 1329–1330.

50.

Brinkman

R. R.

Aghaeepour

Finak

. Automated Analysis of Flow Cytometry Data Comes of Age. Cytometry A 2016, 89, 13–15.

51.

Van Gassen

Vens

Dhaene

. FloReMi: Flow Density Survival Regression Using Minimal Feature Redundancy. Cytometry A 2016, 89, 22–29.

52.

Zhang

J.-H.

Chung

T. D. Y.

Oldenburg

K. R.

A Simple Statistical Parameter for Use in Evaluation and Validation of High Throughput Screening Assays. J. Biomol. Screen. 1999, 4, 67–73.

53.

Zhang

X. D.

A Pair of New Statistical Parameters for Quality Control in RNA Interference High-Throughput Screening Assays. Genomics 2007, 89, 552–561.

54.

Lilliefors

H. W.

On the Kolmogorov-Smirnov Test for Normality with Mean and Variance Unknown. J. Am. Stat. Assoc. 1967, 62, 399–402.

55.

Basak

Behar

Hoffmann

Lessons from Mathematically Modeling the NF-κB Pathway. Immunol. Rev. 2012, 246, 221–238.

56.

Lee

Robin E.C.

Walker

Sarah R.

Savery

. Fold Change of Nuclear NF-κB Determines TNF-Induced Transcription in Single Cells. Mol. Cell. 2014, 53, 867–879.

57.

Cohen

A. A.

Geva-Zatorsky

Eden

. Dynamic Proteomics of Individual Cancer Cells in Response to a Drug. Science 2008, 322, 1511–1516.

58.

Sisan

D. R.

Halter

Hubbard

J. B.

. Predicting Rates of Cell State Change Caused by Stochastic Fluctuations Using a Data-Driven Landscape Model. Proc. Natl. Acad. Sci. U. S. A. 2012, 109, 19262–19267.

59.

Chen

Zhuang

Lin

. New Horizons in Tumor Microenvironment Biology: Challenges and Opportunities. BMC Med. 2015, 13, 45.

60.

Lloyd

M. C.

Cunningham

J. J.

Bui

M. M.

. Darwinian Dynamics of Intratumoral Heterogeneity: Not Solely Random Mutations but Also Variable Environmental Selection Forces. Cancer Res. 2016, 76, 3136–3144.

61.

Slack

M. D.

Martinez

E. D.

L. F.

. Characterizing Heterogeneous Cellular Responses to Perturbations. Proc. Natl. Acad. Sci. U. S. A. 2008, 105, 19306–19311.

62.

Gascoigne

K. E.

Taylor

S. S.

Cancer Cells Display Profound Intra- and Interline Variation following Prolonged Exposure to Antimitotic Drugs. Cancer Cell 2008, 14, 111–122.

63.

Toriello

N. M.

Douglas

E. S.

Thaitrong

. Integrated Microfluidic Bioprocessor for Single-Cell Gene Expression Analysis. Proc. Natl. Acad. Sci. U. S. A. 2008, 105, 20173–20178.

64.

Snijder

Pelkmans

Origins of Regulated Cell-to-Cell Variability. Nat. Rev. Mol. Cell Biol. 2011, 12, 119–125.

65.

Altschuler

S. J.

L. F.

Cellular Heterogeneity: When Do Differences Make a Difference?

Cell 2010, 141, 559–563.

66.

Chang

H. H.

Hemberg

Barahona

. Transcriptome-Wide Noise Controls Lineage Choice in Mammalian Progenitor Cells. Nature 2008, 453, 544–547.

67.

Tzanakakis

E. S.

Deconstructing Stem Cell Population Heterogeneity: Single-Cell Analysis and Modeling Approaches. Biotechnol. Adv. 2013, 31, 1047–1062.

68.

Singh

D. K.

C. J.

Wichaidit

. Patterns of Basal Signaling Heterogeneity Can Distinguish Cellular Populations with Different Drug Sensitivities. Mol. Syst. Biol. 2010, 6, 369.

69.

Gerdes

M. J.

Sood

Sevinsky

. Emerging Understanding of Multiscale Tumor Heterogeneity. Front. Oncol. 2014, 4, 366.

70.

Tawfik

D. S.

Messy Biology and the Origins of Evolutionary Innovations. Nat. Chem. Biol. 2010, 6, 692–696.

71.

Meacham

C. E.

Morrison

S. J.

Tumour Heterogeneity and Cancer Cell Plasticity. Nature 2013, 501, 328–337.

72.

Tabassum

D. P.

Polyak

Tumorigenesis: It Takes a Village. Nat. Rev. Cancer 2015, 15, 473–483.

73.

Janiszewska

Liu

Almendro

. In Situ Single-Cell Analysis Identifies Heterogeneity for PIK3CA Mutation and HER2 Amplification in HER2-Positive Breast Cancer. Nat. Genet. 2015, 47, 1212–1219.

74.

Marusyk

Tabassum

D. P.

Altrock

P. M.

. Non-Cell-Autonomous Driving of Tumour Growth Supports Sub-Clonal Heterogeneity. Nature 2014, 514, 54–58.

75.

Almendro

Cheng

Y. K.

Randles

. Inference of Tumor Evolution during Chemotherapy by Computational Modeling and In Situ Analysis of Genetic and Phenotypic Cellular Diversity. Cell Rep. 2014, 6, 514–527.

76.

Potts

S. J.

Krueger

J. S.

Landis

N. D.

. Evaluating Tumor Heterogeneity in Immunohistochemistry-Stained Breast Cancer Tissue. Lab. Invest. 2012, 92, 1342–1357.

77.

Gerdes

M. J.

Sevinsky

C. J.

Sood

. Highly Multiplexed Single-Cell Analysis of Formalin-Fixed, Paraffin-Embedded Cancer Tissue. Proc. Natl. Acad. Sci. U. S. A. 2013, 110, 11982–11987.

78.

Lawson

D. A.

Bhakta

N. R.

Kessenbrock

. Single-Cell Analysis Reveals a Stem-Cell Program in Human Metastatic Breast Cancer Cells. Nature 2015, 526, 131–135.

79.

Abraham

V. C.

Samson

Lapets

. Automated Classification of Individual Cellular Responses across Multiple Targets. Preclinica 2004, 2, 349–355.

80.

Stilwell

J. L.

Guan

Neve

R. M.

. Systems Biology in Cancer Research: Genomics to Cellomics. Methods Mol. Biol. 2007, 356, 353–365.

81.

Shi

Orth

J. D.

Mitchison

Cell Type Variation in Responses to Antimitotic Drugs That Target Microtubules and Kinesin-5. Cancer Res. 2008, 68, 3269–3276.

82.

McCann

Live Cell Imaging: An Industrial Perspective. Methods Mol. Biol. 2010, 591, 47–66.

83.

Pereira

P. M.

Almada

Henriques

High-Content 3D Multicolor Super-Resolution Localization Microscopy. In Methods in Cell Biology, Ewa

K. P.

ed.; Academic Press: New York, 2015, pp. 95–117.

84.

Legant

W. R.

Shao

Grimm

J. B.

. High-Density Three-Dimensional Localization Microscopy across Large Volumes. Nat. Methods 2016, 13, 359–365.

85.

Taylor

D. L.

Past, Present, and Future of High Content Screening and the Field of Cellomics. Methods Mol. Biol. 2007, 356, 3–18.

86.

LaPan

Zhang

Pan

. Single Cell Cytometry of Protein Function in RNAi Treated Cells and in Native Populations. BMC Cell Biol. 2008, 9, 43.

87.

Bright

G. R.

Whitaker

J. E.

Haugland

R. P.

. Heterogeneity of the Changes in Cytoplasmic pH upon Serum Stimulation of Quiescent Fibroblasts. J. Cell Physiol. 1989, 141, 410–419.

88.

Loo

L. H.

Lin

H. J.

Singh

D. K.

. Heterogeneity in the Physiological States and Pharmacological Responses of Differentiating 3T3-L1 Preadipocytes. J. Cell Biol. 2009, 187, 375–384.

89.

Racoceanu

Belhomme

Breakthrough Technologies in Digital Pathology. Comput. Med Imaging Graph. 2015, 42, 1.

90.

Nederlof

Watanabe

Burnip

. High-Throughput Profiling of Tissue and Tissue Model Microarrays: Combined Transmitted Light and 3-Color Fluorescence Digital Pathology. J. Pathol. Inform. 2011, 2, 50.

91.

McCabe

Dolled-Filhart

Camp

R. L.

. Automated Quantitative Analysis (AQUA) of In Situ Protein Expression, Antibody Concentration, and Prognosis. J. Natl. Cancer Inst. 2005, 97, 1808–1815.

92.

Lee

J. H.

Daugharthy

E. R.

Scheiman

. Highly Multiplexed Subcellular RNA Sequencing In Situ. Science 2014, 343, 1360–1363.

93.

Giesen

Wang

H. A.

Schapiro

. Highly Multiplexed Imaging of Tumor Tissues with Subcellular Resolution by Mass Cytometry. Nat. Methods 2014, 11, 417–422.

94.

Lin

J. R.

Fallahi-Sichani

Sorger

P. K.

Highly Multiplexed Imaging of Single Cells Using a High-Throughput Cyclic Immunofluorescence Method. Nat. Commun. 2015, 6, 8390.

95.

Weaver

E. M.

Hummon

A. B.

Imaging Mass Spectrometry: From Tissue Sections to Cell Cultures. Adv. Drug Deliv. Rev. 2013, 65, 1039–1055.

96.

Aichler

Walch

MALDI Imaging Mass Spectrometry: Current Frontiers and Perspectives in Pathology Research and Practice. Lab. Invest. 2015, 95, 422–431.

97.

Lanni

E. J.

Rubakhin

S. S.

Sweedler

J. V.

Mass Spectrometry Imaging and Profiling of Single Cells. J. Proteomics 2012, 75, 5036–5051.

98.

Bodzon-Kulakowska

Suder

Imaging Mass Spectrometry: Instrumentation, Applications, and Combination with Other Visualization Techniques. Mass Spectrom. Rev. 2016, 35, 147–169.

99.

Passarelli

M. K.

Ewing

A. G.

Single-Cell Imaging Mass Spectrometry. Curr. Opin. Chem. Biol. 2013, 17, 854–859.

100.

Zavalin

Todd

E. M.

Rawhouser

P. D.

. Direct Imaging of Single Cells and Tissue at Sub-Cellular Spatial Resolution Using Transmission Geometry MALDI MS. J. Mass Spectrom. 2012, 47, i.

101.

Seeley

E. H.

Caprioli

R. M.

3D Imaging by Mass Spectrometry: A New Frontier. Anal. Chem. 2012, 84, 2105–2110.

102.

Angelo

Bendall

S. C.

Finck

. Multiplexed Ion Beam Imaging of Human Breast Tumors. Nat. Med. 2014, 20, 436–442.

103.

Bodenmiller

Multiplexed Epitope-Based Tissue Imaging for Discovery and Healthcare Applications. Cell Syst. 2016, 2, 225–238.

104.

Bendall

S. C.

Davis

K. L.

Amir el

A. D.

. Single-Cell Trajectory Detection Uncovers Progression and Regulatory Coordination in Human B Cell Development. Cell 2014, 157, 714–725.

105.

Spitzer

M. H.

Nolan

G. P.

Mass Cytometry: Single Cells, Many Features. Cell 2016, 165, 780–791.

106.

Levenson

R. M.

Borowsky

A. D.

Angelo

Immunohistochemistry and Mass Spectrometry for Highly Multiplexed Cellular Molecular Imaging. Lab. Invest. 2015, 95, 397–405.

107.

Mao

. Application of Imaging Mass Spectrometry for the Molecular Diagnosis of Human Breast Tumors. Sci. Rep. 2016, 6, 21043.

108.

Jones

E. A.

van Remoortere

van Zeijl

R. J. M.

. Multiple Statistical Analysis Techniques Corroborate Intratumor Heterogeneity in Imaging Mass Spectrometry Datasets of Myxofibrosarcoma. PLoS One 2011, 6, e24913.

109.

Tata

Zheng

Ginsberg

H. J.

. Contrast Agent Mass Spectrometry Imaging Reveals Tumor Heterogeneity. Anal. Chem. 2015, 87, 7683–7689.

110.

Thompson

C. G.

Bokhart

M. T.

Sykes

. Mass Spectrometry Imaging Reveals Heterogeneous Efavirenz Distribution within Putative HIV Reservoirs. Antimicrob. Agents Chemother. 2015, 59, 2944–2948.

111.

Perfetto

S. P.

Chattopadhyay

P. K.

Roederer

Seventeen-Colour Flow Cytometry: Unravelling the Immune System. Nat. Rev. Immunol. 2004, 4, 648–655.

112.

Bendall

S. C.

Nolan

G. P.

Roederer

. A Deep Profiler’s Guide to Cytometry. Trends Immunol. 2012, 33, 323–332.

113.

Bodenmiller

Zunder

E. R.

Finck

. Multiplexed Mass Cytometry Profiling of Cellular States Perturbed by Small-Molecule Regulators. Nat. Biotechnol. 2012, 30, 858–867.

114.

Hines

W. C.

Kuhn

. Sorting Out the FACS: A Devil in the Details. Cell Rep. 2014, 6, 779–781.

115.

Trapnell

Defining Cell Types and States with Single-Cell Genomics. Genome Res. 2015, 25, 1491–1498.

116.

Nakamura

Yabuta

Okamoto

. SC3-seq: A Method for Highly Parallel and Quantitative Measurement of Single-Cell Gene Expression. Nucleic Acids Res. 2015, 43, e60.

117.

Macosko

E. Z.

Basu

Satija

. Highly Parallel Genome-Wide Expression Profiling of Individual Cells Using Nanoliter Droplets. Cell 2015, 161, 1202–1214.

118.

Guo

Zhu

. Single-Cell Methylome Landscapes of Mouse Embryonic Stem Cells and Early Embryos Analyzed Using Reduced Representation Bisulfite Sequencing. Genome Res. 2013, 23, 2126–2135.

119.

Smallwood

S. A.

Lee

H. J.

Angermueller

. Single-Cell Genome-Wide Bisulfite Sequencing for Assessing Epigenetic Heterogeneity. Nat. Methods 2014, 11, 817–820.

120.

Jin

Tang

Wan

. Genome-Wide Detection of DNase I Hypersensitive Sites in Single Cells and FFPE Tissue Samples. Nature 2015, 528, 142–146.

121.

Cusanovich

D. A.

Daza

Adey

. Multiplex Single-Cell Profiling of Chromatin Accessibility by Combinatorial Cellular Indexing. Science 2015, 348, 910–914.

122.

Buenrostro

J. D.

Litzenburger

U. M.

. Single-Cell Chromatin Accessibility Reveals Principles of Regulatory Variation. Nature 2015, 523, 486–490.

123.

Xue

Eisele

M. R.

. Highly Multiplexed Profiling of Single-Cell Effector Functions Reveals Deep Functional Heterogeneity in Response to Pathogenic Ligands. Proc. Natl. Acad. Sci. U. S. A. 2015, 112, E607–E615.

124.

Onjiko

R. M.

Moody

S. A.

Nemes

Single-Cell Mass Spectrometry Reveals Small Molecules That Affect Cell Fates in the 16-Cell Embryo. Proc. Natl. Acad. Sci. U. S. A. 2015, 112, 6545–6550.

125.

Simpson

E. H.

The Interpretation of Interaction in Contingency Tables. J. Roy. Stat. Soc. B. 1951, 13, 238–241.

126.

Stegle

Teichmann

S. A.

Marioni

J. C.

Computational and Analytical Challenges in Single-Cell Transcriptomics. Nat Rev Genet. 2015, 16, 133–145.

127.

Trapnell

Cacchiarelli

Grimsby

. The Dynamics and Regulators of Cell Fate Decisions Are Revealed by Pseudotemporal Ordering of Single Cells. Nat. Biotechnol. 2014, 32, 381–386.

128.

Schissler

A. G.

Chen

J. L.

. Analysis of Aggregated Cell-Cell Statistical Distances within Pathways Unveils Therapeutic-Resistance Mechanisms in Circulating Tumor Cells. Bioinformatics 2016, 32, i80–i89.

129.

R Core Team. R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing: Vienna, Austria, 2015.

130.

Yule

G. U.

An Introduction to the Theory of Statistics. Charles Griffin and Company: London, UK, 1911.

131.

Barnett

Lewis

Outliers in Statistical Data, John Wiley: New York, 1994, 3rd ed.

132.

Shannon

C. E.

Weaver

The Mathematical Theory of Communication. University of Illinois Press: Urbana, 1949.

133.

Simpson

E. H.

Measurement of Diversity. Nature 1949, 163, 688–688.

134.

Rao

C. R.

Diversity and Dissimilarity Coefficients—A Unified Approach. Theor. Popul. Biol. 1982, 21, 24–43.

135.

Razali

N. M.

Wah

Y. B.

Power Comparisons of Shapiro-Wilk, Kolmogorov-Smirnov, Lilliefors and Anderson-Darling Tests. J. Stat. Model. Anal. 2011, 2, 13.

136.

Fallahi-Sichani

Honarnejad

Heiser

L. M.

. Metrics Other Than Potency Reveal Systematic Variation in Responses to Cancer Drugs. Nat. Chem. Biol. 2013, 9, 708–714.

137.

University of Pittsburgh Drug Discovery Institute (UPDDI). upddi.pitt.edu. Accessed July 8, 2016.

138.

Dinov

I. D.

Methodological Challenges and Analytic Opportunities for Modeling and Interpreting Big Healthcare Data. Gigascience 2016, 5, 12.

139.

Hintze

J. L.

Nelson

R. D.

Violin Plots: A Box Plot-Density Trace Synergism. Am. Stat. 1998, 52, 181–184.

140.

Kiviet

D. J.

Nghe

Walker

. Stochasticity of Metabolism and Growth at the Single-Cell Level. Nature 2014, 514, 376–379.

141.

Spiller

D. G.

Wood

C. D.

Rand

D. A.

. Measurement of Single-Cell Dynamics. Nature 2010, 465, 736–745.

142.

Haney

S. A.

Factoring and Clustering High Content Data. In An Introduction to High Content Screening, John Wiley: New York, 2014, pp. 211–229.

143.

Giuliano

K. A.

Chen

Y.-T.

Taylor

D. L.

High-Content Screening with siRNA Optimizes a Cell Biological Approach to Drug Discovery: Defining the Role of P53 Activation in the Cellular Response to Anticancer Drugs. J. Biomol. Screen. 2004, 9, 557–568.

144.

Giuliano

K. A.

Cheung

W. S.

Curran

D. P.

. Systems Cell Biology Knowledge Created from High Content Screening. Assay Drug Dev. Technol. 2005, 3, 501–514.

145.

Perlman

Z. E.

Slack

M. D.

Feng

. Multidimensional Drug Profiling by Automated Microscopy. Science 2004, 306, 1194–1198.

146.

Boland

M. V.

Murphy

R. F.

A Neural Network Classifier Capable of Recognizing the Patterns of All Major Subcellular Structures in Fluorescence Microscope Images of HeLa Cells. Bioinformatics 2001, 17, 1213–1223.

147.

Loo

L. H.

L. F.

Altschuler

S. J.

Image-Based Multivariate Profiling of Drug Responses from Single Cells. Nat. Methods 2007, 4, 445–453.

148.

van der Maaten

Hinton

. Visualizing Data Using t-SNE. J. Mach. Learn. Res. 2008, 9, 2579–2605.

149.

van der Maaten

. Accelerating t-SNE Using Tree-Based Algorithms. J. Mach. Learn. Res. 2014, 15, 3221–3245.

150.

Low

Huang

Blosser

. High-Content Imaging Characterization of Cell Cycle Therapeutics through In Vitro and In Vivo Subpopulation Analysis. Mol. Cancer Ther. 2008, 7, 2455–2463.

151.

Naik

A. W.

Kangas

J. D.

Sullivan

D. P.

. Active Machine Learning-Driven Experimentation to Determine Compound Effects on Protein Patterns. Elife 2016, 5, e10047.

152.

Harrison

P. M.

Badel

Wall

M. J.

. Experimentally Verified Parameter Sets for Modelling Heterogeneous Neocortical Pyramidal-Cell Populations. PLoS Comput. Biol. 2015, 11, e1004165.

153.

Gupta

P. B.

Fillmore

C. M.

Jiang

. Stochastic State Transitions Give Rise to Phenotypic Equilibrium in Populations of Cancer Cells. Cell 2011, 146, 633–644.

154.

Hasenauer

Heinrich

Doszczak

. A Visual Analytics Approach for Models of Heterogeneous Cell Populations. EURASIP J. Bioinform. Syst. Biol. 2012, 2012, 4.

155.

Gentleman

R. C.

Carey

V. J.

Bates

D. M.

. Bioconductor: Open Software Development for Computational Biology and Bioinformatics. Genome Biol. 2004, 5, R80.

156.

Lee

Hahne

Sarkar

. iFlow: A Graphical User Interface for Flow Cytometry Tools in Bioconductor. Adv. Bioinformatics 2009, 2009, 103839.

157.

Finak

Frelinger

Jiang

. OpenCyto: An Open Source Infrastructure for Scalable, Robust, Reproducible, and Automated, End-to-End Flow Cytometry Data Analysis. PLoS Comput. Biol. 2014, 10, e1003806.

158.

Friedman

Yurtsev

FlowCytometryTools v0.4.5, a Python Package for Visualization and Analysis of High-Throughput Flow Cytometry Data. http://eyurtsev.github.io/FlowCytometryTools/. Accessed June 14, 2016.

159.

Jones

T. R.

Kang

I. H.

Wheeler

D. B.

. CellProfiler Analyst: Data Exploration and Analysis Software for Complex Image-Based Screens. BMC Bioinformatics 2008, 9, 1–16.

160.

Ogier

Dorval

HCS-Analyzer: Open Source Software for High-Content Screening Data Correction and Analysis. Bioinformatics 2012, 28, 1945–1946.

161.

Stoter

Niederlein

Barsacchi

. CellProfiler and KNIME: Open Source Tools for High Content Screening. Methods Mol. Biol. 2013, 986, 105–122.

162.

Allan

Burel

J. M.

Moore

. OMERO: Flexible, Model-Driven Data Management for Experimental Biology. Nat. Methods 2012, 9, 245–253.

163.

Amir

E.-a. D.

Davis

K. L.

Tadmor

M. D.

. viSNE Enables Visualization of High Dimensional Single-Cell Data and Reveals Phenotypic Heterogeneity of Leukemia. Nat. Biotech. 2013, 31, 545–552.

164.

Diggins

K. E.

Ferrell

P. B.

Jr. Irish

J. M.

Methods for Discovery and Characterization of Cell Subsets in High Dimensional Mass Cytometry Data. Methods 2015, 82, 55–63.

165.

Burrell

R. A.

McGranahan

Bartek

. The Causes and Consequences of Genetic Heterogeneity in Cancer Evolution. Nature 2013, 501, 338–345.

166.

Batchelor

Kann

M. G.

Przytycka

T. M.

Raphael

B. J.

Wojtowicz

eds. Modeling Cell Heterogeneity: From Single-Cell Variations to Mixed Cells. Pacific Symposium on Biocomputing 2013; 2013; Kohala Coast, HI.

167.

Caie

P. D.

Walls

R. E.

Ingleston-Orme

. High-Content Phenotypic Profiling of Drug Response Signatures across Distinct Cancer Cells. Mol. Cancer Ther. 2010, 9, 1913–1926.

168.

Johnston

R. L.

Wockner

McCart Reed

A. E.

. High Content Screening Application for Cell-Type Specific Behaviour in Heterogeneous Primary Breast Epithelial Subpopulations. Breast Cancer Res. 2016, 18, 18.

169.

Niepel

Spencer

S. L.

Sorger

P. K.

Non-Genetic Cell-to-Cell Variability and the Consequences for Pharmacology. Curr. Opin. Chem. Biol. 2009, 13, 556–561.

170.

Chung

G. G.

Zerkowski

M. P.

Ghosh

. Quantitative Analysis of Estrogen Receptor Heterogeneity in Breast Cancer. Lab. Invest. 2007, 87, 662–669.

171.

Alizadeh

A. A.

Aranda

Bardelli

. Toward Understanding and Exploiting Tumor Heterogeneity. Nat. Med. 2015, 21, 846–853.

172.

Balkwill

F. R.

Capasso

Hagemann

The Tumor Microenvironment at a Glance. J. Cell Sci. 2012, 125, 5591–5596.

173.

Hanahan

Weinberg

R. A.

Hallmarks of Cancer: The Next Generation. Cell 2011, 144, 646–674.

174.

Waclaw

Bozic

Pittman

M. E.

. A Spatial Model Predicts That Dispersal and Cell Turnover Limit Intratumour Heterogeneity. Nature 2015, 525, 261–264.

175.

Gerlinger

Rowan

A. J.

Horswell

. Intratumor Heterogeneity and Branched Evolution Revealed by Multiregion Sequencing. N. Engl. J. Med. 2012, 366, 883–892.

176.

Kumar

Boyle

E. A.

Tokita

. Deep Sequencing of Multiple Regions of Glial Tumors Reveals Spatial Heterogeneity for Mutations in Clinically Relevant Genes. Genome Biol. 2014, 15, 530.

177.

Govindan

Cancer. Attack of the Clones. Science 2014, 346, 169–170.

178.

Bashashati

Tone

. Distinct Evolutionary Trajectories of Primary High-Grade Serous Ovarian Cancers Revealed through Spatial Mutational Profiling. J. Pathol. 2013, 231, 21–34.

179.

Yates

L. R.

Gerstung

Knappskog

. Subclonal Diversification of Primary Breast Cancer Revealed by Multiregion Sequencing. Nat. Med. 2015, 21, 751–759.

180.

Rivenbark

A. G.

O’Connor

S. M.

Coleman

W. B.

Molecular and Cellular Heterogeneity in Breast Cancer: Challenges for Personalized Medicine. Am. J. Pathol. 2013, 183, 1113–1124.

181.

Sugihara

Taniguchi

Kushima

. Laser Microdissection and Two-Dimensional Difference Gel Electrophoresis Reveal Proteomic Intra-Tumor Heterogeneity in Colorectal Cancer. J. Proteomics 2013, 78, 134–147.

182.

Navin

Kendall

Troge

. Tumour Evolution Inferred by Single-Cell Sequencing. Nature 2011, 472, 90–94.

183.

Wang

Waters

Leung

M. L.

. Clonal Evolution in Breast Cancer Revealed by Single Nucleus Genome Sequencing. Nature 2014, 512, 155–160.

184.

Irish

J. M.

Hovland

Krutzik

P. O.

. Single Cell Profiling of Potentiated Phospho-Protein Networks in Cancer Cells. Cell 2004, 118, 217–228.

185.

Camp

R. L.

Chung

G. G.

Rimm

D. L.

Automated Subcellular Localization and Quantification of Protein Expression in Tissue Microarrays. Nat. Med. 2002, 8, 1323–1328.

186.

Chung

G. G.

Zerkowski

M. P.

Ghosh

. Quantitative Analysis of Estrogen Receptor Heterogeneity in Breast Cancer. Lab. Invest. 2007, 87, 662–669.

187.

Salo

Vered

Bello

I. O.

. Insights into the Role of Components of the Tumor Microenvironment in Oral Carcinoma Call for New Therapeutic Approaches. Exp. Cell Res. 2014, 325, 58–64.

188.

Church

K. H. H.

Word Association Norms, Mutual Information, and Lexicography. Comp. Linguistics 1990, 16, 22–29.

189.

Role

F. N. M

. In Handling the Impact of Low Frequency Events on Co-Occurrence-Based Measures of Word Similarity: A Case Study of Pointwise Mutual Information, International Conference on Knowledge Discovery and Information Retrieval Paris, France, Paris, France, 2011.

190.

Lee

J. H.

Daugharthy

E. R.

Scheiman

. Fluorescent In Situ Sequencing (FISSEQ) of RNA for Gene Expression Profiling in Intact Cells and Tissues. Nat Protoc. 2015, 10, 442–458.

191.

Clarke

G. M.

Zubovits

J. T.

Shaikh

K. A.

. A Novel, Automated Technology for Multiplex Biomarker Imaging and Application to Breast Cancer. Histopathology 2014, 64, 242–255.

192.

Durruthy-Durruthy

Gottlieb

Hartman

B. H.

. Reconstruction of the Mouse Otocyst and Early Neuroblast Lineage at Single-Cell Resolution. Cell 2014, 157, 964–978.

193.

Achim

Pettit

J. B.

Saraiva

L. R.

. High-Throughput Spatial Mapping of Single-Cell RNA-seq Data to Tissue of Origin. Nat. Biotechnol. 2015, 33, 503–509.

194.

Satija

Farrell

J. A.

Gennert

. Spatial Reconstruction of Single-Cell Gene Expression Data. Nat. Biotechnol. 2015, 33, 495–502.

195.

Lubeck

Cai

Single-Cell Systems Biology by Super-Resolution Imaging and Combinatorial Labeling. Nat. Methods 2012, 9, 743–748.

196.

Chen

K. H.

Boettiger

A. N.

Moffitt

J. R.

. RNA Imaging: Spatially Resolved, Highly Multiplexed RNA Profiling in Single Cells. Science 2015, 348 (6233). dx.doi.org/10.1126/science.aaa6090.

197.

Stahl

P. L.

Salmen

Vickovic

. Visualization and Analysis of Gene Expression in Tissue Sections by Spatial Transcriptomics. Science 2016, 353, 78–82.

198.

Tetteh

P. W.

Farin

H. F.

Clevers

Plasticity within Stem Cell Hierarchies in Mammalian Epithelia. Trends Cell Biol. 2015, 25, 100–108.

199.

Barker

van Es

J. H.

Kuipers

. Identification of Stem Cells in Small Intestine and Colon by Marker Gene Lgr5. Nature 2007, 449, 1003–1007.

200.

Ritsma

Ellenbroek

S. I.

Zomer

. Intestinal Crypt Homeostasis Revealed at Single-Stem-Cell Level by In Vivo Live Imaging. Nature 2014, 507, 362–365.

201.

Bendall

S. C.

Nolan

G. P.

From Single Cells to Deep Phenotypes in Cancer. Nat. Biotechnol. 2012, 30, 639–647.

202.

Cahan

Daley

G. Q.

Origins and Implications of Pluripotent Stem Cell Variability and Heterogeneity. Nat. Rev. Mol. Cell Biol. 2013, 14, 357–368.

203.

Huang

Ingber

D. E.

A Non-Genetic Basis for Cancer Progression and Metastasis: Self-Organizing Attractors in Cell Regulatory Networks. Breast Dis. 2006, 26, 27–54.

204.

Simons

B. D.

Clevers

Strategies for Homeostatic Stem Cell Self-Renewal in Adult Tissues. Cell 2011, 145, 851–862.

205.

Greulich

Simons

B. D.

Dynamic Heterogeneity as a Strategy of Stem Cell Self-Renewal. Proc. Natl. Acad. Sci. U. S. A. 2016, 113, 7509–7514.

206.

Clevers

The Intestinal Crypt, a Prototype Stem Cell Compartment. Cell 2013, 154, 274–284.

207.

Hara

Nakagawa

Enomoto

. Mouse Spermatogenic Stem Cells Continually Interconvert between Equipotent Singly Isolated and Syncytial States. Cell Stem Cell 2014, 14, 658–672.

208.

Rompolas

Greco

Stem Cell Dynamics in the Hair Follicle Niche. Semin. Cell Dev. Biol. 2014, 25–26, 34–42.

209.

Scadden

D. T.

The Stem-Cell Niche as an Entity of Action. Nature 2006, 441, 1075–1079.

210.

Barker

Adult Intestinal Stem Cells: Critical Drivers of Epithelial Homeostasis and Regeneration. Nat. Rev. Mol. Cell Biol. 2014, 15, 19–33.

211.

Balaban

N. Q.

Merrin

Chait

. Bacterial Persistence as a Phenotypic Switch. Science 2004, 305, 1622–1625.

212.

Munsky

Neuert

van Oudenaarden

Using Gene Expression Noise to Understand Gene Regulation. Science 2012, 336, 183–187.

213.

Ooi

H. K.

Modeling Heterogeneous Responsiveness of Intrinsic Apoptosis Pathway. BMC Syst. Biol. 2013, 7, 65.

214.

De Smet

Marchal

. Advantages and Limitations of Current Network Inference Methods. Nat. Rev. Microbiol. 2010, 8, 717–729.

215.

Borisov

N. M.

Markevich

N. I.

Hoek

J. B.

. Signaling through Receptors and Scaffolds: Independent Interactions Reduce Combinatorial Complexity. Biophys. J. 2005, 89, 951–966.

216.

Schaber

Klipp

Model-Based Inference of Biochemical Parameters and Dynamic Properties of Microbial Signal Transduction Networks. Curr. Opin. Biotechnol. 2011, 22, 109–116.

217.

Taylor

I. W.

Linding

Warde-Farley

. Dynamic Modularity in Protein Interaction Networks Predicts Breast Cancer Outcome. Nat. Biotechnol. 2009, 27, 199–204.

218.

Kirouac

D. C.

Saez-Rodriguez

Swantek

. Creating and Analyzing Pathway and Protein Interaction Compendia for Modelling Signal Transduction Networks. BMC Syst. Biol. 2012, 6, 29.

219.

Eydgahi

Chen

W. W.

Muhlich

J. L.

. Properties of Cell Death Models Calibrated and Compared Using Bayesian Approaches. Mol. Syst. Biol. 2013, 9, 644.

220.

Spencer

S. L.

Gaudet

Albeck

J. G.

. Non-Genetic Origins of Cell-to-Cell Variability in TRAIL-Induced Apoptosis. Nature 2009, 459, 428–432.

221.

Nowak

M. A.

Bonhoeffer

Spatial Heterogeneity in Drug Concentrations Can Facilitate the Emergence of Resistance to Cancer Therapy. PLoS Comput. Biol. 2015, 11, e1004142.

222.

Wells

D. K.

Chuang

Knapp

L. M.

. Spatial and Functional Heterogeneities Shape Collective Behavior of Tumor-Immune Networks. PLoS Comput. Biol. 2015, 11, e1004181.

223.

H. J.

Reinhardt

Herschman

H. R.

. Cancer-Stimulated Mesenchymal Stem Cells Create a Carcinoma Stem Cell Niche via Prostaglandin E2 Signaling. Cancer Discov. 2012, 2, 840–855.

224.

Garraway

L. A.

Janne

P. A.

Circumventing Cancer Drug Resistance in the Era of Personalized Medicine. Cancer Discov. 2012, 2, 214–226.

225.

Zhao

Pritchard

J. R.

Lauffenburger

D. A.

. Addressing Genetic Tumor Heterogeneity through Computationally Predictive Combination Therapy. Cancer Discov. 2014, 4, 166–174.

226.

Pritchard

J. R.

Bruno

P. M.

Gilbert

L. A.

. Defining Principles of Combination Drug Mechanisms of Action. Proc. Natl. Acad. Sci. U. S. A. 2013, 110, E170–E179.

227.

Dawson

J. C.

Carragher

N. O.

Quantitative Phenotypic and Pathway Profiling Guides Rational Drug Combination Strategies. Front. Pharmacol. 2014, 5, 118.

228.

Inda

M. M.

Bonavia

Mukasa

. Tumor Heterogeneity Is an Active Process Maintained by a Mutant EGFR-Induced Cytokine Circuit in Glioblastoma. Genes Dev. 2010, 24, 1731–1745.

229.

Eirew

Steif

Khattra

. Dynamics of Genomic Clones in Breast Cancer Patient Xenografts at Single-Cell Resolution. Nature 2015, 518, 422–426.

230.

de Smith

Statistical Analysis Handbook. http://www.statsref.com/HTML/index.html. Accessed September 17, 2016.

231.

NIST/SEMATECH e-Handbook of Statistical Methods. http://www.itl.nist.gov/div898/handbook/eda/section3/eda35b.htm. Accessed September 17, 2016.

232.

Almendro

Kim

H. J.

Cheng

Y. K.

. Genetic and Phenotypic Diversity in Breast Tumor Metastases. Cancer Res. 2014, 74, 1338–1348.

233.

Rose

C. J.

Mills

S. J.

O’Connor

J. P.

. Quantifying Spatial Heterogeneity in Dynamic Contrast-Enhanced MRI Parameter Maps. Magn. Reson. Med. 2009, 62, 488–499.

234.

Schwarz

R. F.

C. K.

Cooke

S. L.

. Spatial and Temporal Heterogeneity in High-Grade Serous Ovarian Cancer: A Phylogenetic Analysis. PLoS Med. 2015, 12, e1001789.

235.

Saadatpour

Guo

Orkin

S. H.

. Characterizing Heterogeneity in Leukemic Cells Using Single-Cell Gene Expression Analysis. Genome Biol. 2014, 15, 525.