Abstract
Compassion has received increasing societal and scientific interest in recent years. The science of compassion requires a tool that can offer valid and reliable measurement of the construct to allow examination of its causes, correlates, and consequences. The current studies developed and examined the psychometric properties of new self-report measures of compassion for others and for the self, the 20-item Sussex-Oxford Compassion for Others Scale (SOCS-O) and 20-item Sussex-Oxford Compassion for the Self Scale (SOCS-S). These were based on the theoretically and empirically supported definition of compassion as comprising five dimensions: (a) recognizing suffering, (b) understanding the universality of suffering, (c) feeling for the person suffering, (d) tolerating uncomfortable feelings, and (e) motivation to act/acting to alleviate suffering. Findings support the five-factor structure for both the SOCS-O and SOCS-S. Scores on both scales showed adequate internal consistency, interpretability, floor/ceiling effects, and convergent and discriminant validity.
Compassion is considered to be an innate, evolved capacity (Darwin, 1871; de Waal, 2009; Gilbert, 2005) and has long been emphasized to be a core human virtue in major contemplative and religious traditions (Dahlsgaard, Peterson, & Seligman, 2005). Recently, there has been a surge in scientific interest in compassion and increased recognition of the importance of both compassion for others and compassion for the self across multiple sectors of society, including health care, education, and the justice system (e.g., American Medical Association, 2001; Compassion in Education Foundation, 2016; Department of Health, 2013; Norko, 2005). Compassion is associated with a range of adaptive and prosocial characteristics and outcomes, such as greater well-being (Davidson & Schuyler, 2015), happiness (Mongrain, Chin, & Shapira, 2011), and reduced depressive symptoms (López, Sanderman, Ranchor, & Schroevers, 2018), and there is growing evidence that greater compassion can be cultivated through compassion-based interventions (CBIs; Kirby, Tellegen, & Steindl, 2017). The science of compassion requires a tool that can offer valid and reliable measurement of the construct to allow examination of its causes, correlates, and consequences (Strauss et al., 2016). This article reports on the development and psychometric properties of parallel measures of compassion for others and compassion for the self.
While there are many definitions of compassion, there has been a lack of consensus on its key defining features. The Oxford English Dictionary defines compassion solely in terms of an emotional response to suffering (“sympathetic pity and concern for the sufferings or misfortunes”). In psychological literature, compassion has been conceptualized as comprising recognition and awareness of suffering, emotionally connecting with that suffering, and the desire to act or acting to ease the suffering (Jazaieri et al., 2013; Kanov et al., 2004). Other definitions also highlight the integral role of distress tolerance, the ability to stay with difficult emotions when faced with suffering (Dalai Lama, 2002; Gilbert, 2010; Wispe, 1991), and common humanity, understanding that suffering is a universal human experience (Feldman & Kuyken, 2011; Neff, 2003; Pommier, 2010).
In the context of increasing and widespread interest in compassion and how it can be cultivated, there is a need to consolidate the range of theoretical conceptualizations of compassion into one comprehensive, operational definition. A recent position paper reviewed and consolidated a range of conceptualizations of compassion into one multifaceted, operational definition in an attempt to provide the clarity necessary to advance compassion research (Strauss et al., 2016). The definition conceptualizes compassion as a cognitive, affective, and behavioral process consisting of the following five elements: (a) recognizing suffering; (b) understanding the universality of suffering in human experience; (c) feeling for the person suffering and emotionally connecting with their distress; (d) tolerating any uncomfortable feelings aroused in response to the suffering (e.g., fear, disgust, distress) so that we remain accepting of and open to the person suffering; and (e) acting or being motivated to act to alleviate the suffering. As well as encompassing these elements, a key feature of compassion that distinguishes it from related states (e.g., empathy, kindness, sympathy) is that it arises specifically in response to suffering (Strauss et al., 2016). Consistent with theory that the process of compassion is broadly the same whether it is directed at the self or at others (Feldman & Kuyken, 2011; Gilbert, 2009, 2014), this five-element definition applies to both. That is to say recognizing suffering, and its universality, being able to tolerate elicited feelings, and acting to alleviate suffering can be directed equally to the self or others.
As well as being grounded in theory, this five-element conceptualization of compassion has also received empirical support. Gu, Cavanagh, Baer, and Strauss (2017) conducted a series of factor analytic studies to empirically examine the underlying conceptual structure of compassion using items from existing self-report compassion measures. Their findings showed support for a five-factor hierarchical structure of compassion consistent with Strauss et al.’s (2016) definition, with the five elements captured under an overarching compassion factor.
Although factor analyses of existing items support the five-element definition of compassion, existing measures of compassion fail to capture the breadth of all five elements. In addition to their review of conceptualizations of compassion, Strauss et al. (2016) also systematically reviewed nine existing questionnaire measures of self- and other-compassion 1 and found that none of the scales comprehensively captured the construct of compassion, with some items from measures worded in conflict with the response scale, containing the word “compassion” and relying on respondents to define this term, appearing to tap into related constructs such as empathy and kindness, and not being related to suffering and thus arguably not capturing compassion. Many existing measures also have poor or inadequately tested psychometric properties, namely poor internal consistency and insufficient evidence for factor structure, with none of the reviewed measures scoring over 50% on the quality rating tool.
Given the lack of valid and reliable measures which comprehensively capture compassion, there is a need to develop new robust measures of compassion for the self and others in order to progress scientific investigation. Continued use of measures which are limited both in whether they fully capture the nature of compassion and their psychometric properties could lead to erroneous research findings which would be counterproductive for compassion research and practice. Key areas of research and clinical practice which would benefit from new robust measures of compassion include evaluating the causes and consequences of compassion and examining whether psychological interventions developed to explicitly or implicitly enhance people’s capacity for compassion for themselves and other people (e.g., Mindfulness-Based Stress Reduction: Kabat-Zinn, 1982; Mindfulness-Based Cognitive Therapy: Segal, Williams, & Teasdale, 2002, 2013; Compassion Focused Therapy: Gilbert, 2014; Mindful Self-Compassion: Neff & Germer, 2013) work through their hypothesized mechanism of action (i.e., improved compassion).
The Current Program of Research
The current program of research aimed to address the lack of robust compassion measures by developing and psychometrically evaluating two parallel self-report measures of compassion based on Strauss et al.’s (2016) theoretically and empirically supported five-element definition of compassion: the Sussex-Oxford Compassion for Others Scale (SOCS-O) and the Sussex-Oxford Compassion for the Self Scale (SOCS-S). Self and other versions of the scale were developed in parallel in keeping with the theoretical literature on compassion which does not distinguish between the two (e.g., Feldman & Kuyken, 2011; Gilbert, 2009, 2014; Strauss et al., 2016). Developing compassion for self and other scales in parallel has the potential to empirically test this theory and to enhance understanding of the nature of the relationship between compassion for the self and compassion for others (Gu et al., 2017). Parallel scales will clarify the facets underlying compassion for self and others (theory would predict that the factor structure of both scales will mirror each other) and will also enable empirical examination of the overlap between the experience of compassion for self and others.
Development and validation of the SOCS-O and SOCS-S comprised four stages: (a) item generation and review through consultation with both experts and nonexperts, (b) item reduction using data from a sample of health care staff, (c) validation of the factor structure of measures and evaluation of their psychometric properties in a sample of health care staff, and (d) cross-validation of their factor structure and evaluation of their psychometric properties in a sample of University students. Health care staff were recruited in Stages 2 and 3 for a number of reasons. First, they represent a well-defined sample for whom compassion for self and others may be particularly salient on a daily basis, given their experience of providing care to others while working in an emotionally demanding profession. Second, in response to increasing research and societal interest in compassion in health care contexts; there has been a particular emphasis on creating a culture of compassion in the health care sector (e.g., American Medical Association, 2001; Department of Health, 2013; NHS England, 2017). This is linked to research indicating improvements in patient outcomes associated with increased compassionate care (e.g., Epstein et al., 2005; Sanghavi, 2006), acknowledgement of self-care as an integral part of providing effective care to others (NHS England, 2017), and reports of diminishing compassion for self and others in cases of work-related burnout (Joinson, 1992). Last, recruiting health care staff allowed for empirical testing of key research questions in this sample, including whether compassion for the self is related to providing compassionate care to others and whether enhanced compassion is linked to reduced work-related burnout. These questions were addressed in Stage 3 of this program of research.
The four stages followed best practice guidelines for measure development in terms of generating items in relation to a theoretically informed, operational definition and in consultation both with experts in the topic and nonexperts from a population likely to complete the measures in future research, assessing the content validity of items, reducing item pools to remove redundant items and create scales of manageable length, validating factor structures in independent samples to confirm a prespecified model for the measures, and assessing other psychometric properties, such as internal consistency and convergent and discriminant validity (e.g., Byrne, 2005; Costello & Osborne, 2005; Furr, 2011; Hinkin, 1998). All four stages received ethical approval from the Sciences & Technology Cross-Schools Research Ethics Committee of the University of Sussex. The method and results for each stage are presented in turn.
Stage 1: Item Generation and Review
The aim of this stage was to generate and review items through consultation with both experts and nonexperts. To maximize content validity, we used the five-element definition of compassion to formulate items that closely related to each dimension. Items relating to self-compassion and other-compassion were generated in parallel. Items were generated and revised in consultation with experts in contemplative approaches purposively sampled to represent different cultural contexts across the globe and reviewed by nonexperts representative of the populations likely to complete the measure.
Item Generation
Method
Participants and procedure
A total of 22 English-speaking experts in contemplative approaches (72.7% female;
Interviews with experts were conducted by the first author over telephone or Skype. At least 24 hours prior to the interviews, experts were provided with an information sheet detailing the five elements of compassion, the interview procedure, scale instructions, preferred item characteristics (e.g., chosen response scale, response period, items worded as statements, both positively phrased and negatively phrased items), and good practice guidelines for formulating items (e.g., avoiding double-barreled items, keeping item wording concise, excluding frequency terms such as “often” and “sometimes”; DeVellis, 2016; Terwee et al., 2007). The intention was to develop measures which could be used widely, in many adult populations, and we therefore aimed to develop items which were succinct, clear, and understandable and in the scale instructions, defined the use of more ambiguous terms such as “suffering” (see the copy of the resulting scales in the supplementary materials available online, for scale instructions). The information sheet also informed experts of the intention to develop measures of both self- and other-compassion. Experts were asked to generate up to three parallel items that they thought best described each element of compassion for self and others.
Results
Altogether, experts generated 155 other-compassion items and 101 self-compassion items. All authors reviewed all generated items and came to a consensus regarding the set of items through the following iterative process. To retain as many generated items as possible, items were removed only if they were semantic duplicates and if they did not conceptually capture a particular element of compassion. Some items were also reworded to fit the response scale and parallel items were generated where these were lacking (e.g., generating an other-compassion version of an item which had only a self-compassion form). All universality of suffering items could be applied to both the self and others (e.g., “I understand that everyone experiences suffering at some point in their lives”) and were included in both self- and other-compassion item pools. The authors also compared all generated items with existing compassion measures included in Strauss et al.’s (2016) review to ensure good coverage of generated items; no items were added, removed, or changed based on this comparison. Following the iterative review by authors, the pool of items was reduced to 60 compassion for others items and 60 compassion for the self items.
Item Review
Method and Results
Fifteen of the experts in contemplative approaches who contributed to the generation of the initial pool of items and 15 nonexperts (60.0% female;
An anonymous online survey on Bristol Online Surveys (www.onlinesurveys.ac.uk) containing the 60 other-compassion and 60 self-compassion items, displayed under their relevant element, was administered to participants. The survey for experts asked them to consider whether each item adequately represents its relevant element and respond accordingly by selecting “yes” or “no.” The survey for nonexperts asked them to consider whether the wording of each item is clear and understandable (“yes” or “no”). It was agreed a priori that an item would be removed if more than 50% of experts responded “no,” indicating that it does not adequately represent its relevant element, or if more than 50% of nonexperts responded “no,” indicating that it is not clearly worded.
None of the items were removed based on the review by nonexperts. Three other-compassion items and two self-compassion items were deemed by experts to not adequately represent their relevant elements. These five items were removed, leaving 57 other-compassion items and 58 self-compassion items for Stage 2.
Stage 2: Item Reduction
Stage 2 aimed to reduce the pool of self- and other-compassion items generated in Stage 1. To do this, we applied the theoretically and empirically supported five-factor model separately on the pool of self- and other-compassion items using confirmatory factor analysis (CFA) and selected items with the highest loadings on each factor.
Method
Participants and Procedure
Participants were 1,017 health care staff working in a role that involves at least 1 day a week of direct contact with patients. Staff were recruited from public health care organizations in the South of the United Kingdom. Participation was voluntary. Of the 1,017 participants, 859 completed demographic questions, with the exception of age, which was completed by 843 participants. The mean age of the sample was 42.37 years (
Measures
Compassion items
This consisted of the 57 other-compassion items and 58 self-compassion items derived from Stage 1. The self- and other-compassion items appeared separately and their order was counterbalanced, such that for around half of participants, other- or self-compassion scales appeared first. Items were arranged such that they alternated among the five elements. Participants were instructed to indicate how true each statement was of them using a 5-point Likert-type scale ranging from 1 (
Along with the compassion items, other measures of mindfulness, compassion, empathy, well-being, depression, anxiety, and stress were administered in the survey as part of a larger study and data on these are not reported on here.
Planned Data Analysis
Two five-factor CFA models, with items loading on respective factors from the five-element conceptualization of compassion (Strauss et al., 2016), were applied; one to the pool of other-compassion items and one to the pool of self-compassion items. Models used maximum-likelihood estimation with robust standard errors conducted in M
Results
Compassion for Others
Indeed, 932 staff completed other-compassion items and were included in the item selection for this scale. There were no missing item-level data; all 932 staff completed all other-compassion items. Table S1 (see supplementary materials available online) shows the standardized loadings of items on respective factors. The four highest loading items for each factor were selected for the SOCS-O and these are preceded by an asterisk. All standardized loadings were significant (
Compassion for the Self
A total of 947 participants completed self-compassion items and were included in the item selection for this scale. There were no missing item-level data; all 947 staff completed all self-compassion items. Table S2 (see supplementary materials available online) presents the standardized item loadings on respective factors. The four highest loading items for each factor were retained for the SOCS-S; these are preceded by an asterisk. All standardized loadings were significant (
Stage 3: Validating Factor Structures Using CFA
Stage 3 applied CFA to data from a large, independent sample of health care staff to confirm the factor structures of the SOCS-O and SOCS-S. We hypothesized that the theoretically derived five-element model of compassion, which conceptualizes a hierarchical structure, whereby the five related components (recognizing, universality, feeling, tolerating, and acting) are elements of an overarching compassion factor, would be a good fit to data for both the SOCS-O and SOCS-S. CFA is the recommended approach for confirming, and testing hypotheses about, a preconceived factor structure (Costello & Osborne, 2005).
This stage also tested other psychometric properties of these scales, namely convergent and discriminant validity (the degree to which scales were related to other measures in ways consistent with theoretically derived hypotheses), interpretability (the extent to which qualitative meaning can be attached to quantitative scores; tested by comparing scale scores in subgroups of participants), internal consistency of total scale and subscale items (the extent to which items in a scale or subscale are correlated), and floor and ceiling effects (the percentage of respondents achieving the highest and lowest possible scores on scales).
For the criterion of interpretability to be met, Terwee et al. (2007) requires scale scores to be compared in at least four subgroups of participants. We examined whether SOCS-O and SOCS-S scores differed in relation to gender, length of previous meditation experience (four subgroups: no previous experience, less than a year, 1 to 5 years, over 5 years), level of education (five subgroups: no formal qualifications, GCSE/equivalent, A-level/equivalent, degree/equivalent, and higher degree/equivalent), and marital status (four subgroups: single, married/civil partnership/cohabiting/long-term relationship, separated/divorced, widowed). We predicted that there would be a significant gender difference in SOCS-O scores only, with females scoring higher than males, consistent with previous research showing that females score significantly higher on measures of compassion for others than males, but no gender difference for self-compassion (e.g., Neff & Pommier, 2013; Pommier, 2010). Based on previous research demonstrating that meditators reported significantly higher levels of both other-compassion and self-compassion compared with nonmeditator samples (e.g., Neff & Pommier, 2013), we also hypothesized that length of previous meditation experience would have a significant effect on SOCS-O and SOCS-S scores, with subgroups with more meditation experience scoring significantly higher on these scales compared with those with less meditation experience. Due to lack of research and compelling reasons to expect differences in scale scores in relation to level of education and marital status, we did not make any predictions for these variables, but explored their findings.
For the criterion of convergent and discriminant validity to be met, Terwee et al. (2007) requires prespecified hypotheses to be made, at least three quarters of results to be consistent with hypotheses, and in relation to convergent validity, at least two of the correlations to be large (
Method
Participants and Procedure
An independent sample of 1,319 health care staff completed an anonymous online survey on Qualtrics containing self-report measures. Staff were recruited from public health care organizations in the south of the United Kingdom. Participation was voluntary. Indeed, 1,132 to 1,137 participants completed demographic questions, with the exception of age, which was completed by 1,123 participants. The mean age of the sample was 44.83 years (
Measures
With the exception of the SOCS-O and SOCS-S, the below measures were selected because they are theoretically expected to be related in particular ways to self- and/or other-compassion.
Sussex-Oxford Compassion for Others Scale and Sussex-Oxford Compassion for the Self Scale
The 20-item SOCS-O and 20-item SOCS-S derived from Stage 2 appeared separately, either at the start or the end of the survey, and their order was counterbalanced. For each scale, items were arranged such that they alternated among the five elements. Participants were instructed to indicate how true each statement was of them using a 5-point Likert-type scale ranging from 1 (
Five-Facet Mindfulness Questionnaire (FFMQ) 15-item version (Baer, Carmody, & Hunsinger, 2012)
The 15-item FFMQ (FFMQ-15) is a short form of the 39-item FFMQ (FFMQ-39) and measures the general tendency to be mindful in everyday life. It includes the same five facets as the long form: observing, describing, acting with awareness, nonjudging of inner experience, and nonreactivity to inner experience. The factor structure of the FFMQ-15 is consistent with that of the FFMQ-39, there are large correlations between total facet scores of the short and long forms, and the two FFMQ versions do not differ significantly from each other in terms of convergent validity (Gu et al., 2016). Previous research (Baer, Smith, Hopkins, Krietemeyer, & Toney, 2006; Gu et al., 2016; Williams, Dalgleish, Karl, & Kuyken, 2014) found that in nonmeditator samples, a four-factor hierarchical structure without the “observing” facet provided a superior fit compared with a five-factor hierarchical structure. As it is likely that our current sample has little or no previous meditation experience, “observing” items were excluded. FFMQ-15 items were rated on a 5-point Likert-type scale ranging from 1 (
Self-Compassion Scale–Short form (SCS-12; Raes, Pommier, Neff, & Van Gucht, 2011)
This 12-item measure is a short form of the original 26-item scale (Neff, 2003). The SCS-12 was found to have the same factor structure as the long form, with six factors loading on a higher order self-compassion factor: self-kindness, self-judgment, common humanity, isolation, mindfulness, and overidentification (Raes et al., 2011). Items were rated on a 5-point Likert-type scale ranging from 1 (
Santa Clara Brief Compassion Scale (SCBCS; Hwang, Plante, & Lackey, 2008)
The 5-item SCBCS is a short form of the 21-item Compassionate Love Scale (Sprecher & Fehr, 2005) and measures compassion toward strangers and humankind at large. Responses to items were given on a 7-point Likert-type scale ranging from 1 (
Interpersonal reactivity index (IRI; Davis, 1980)
The 28-item IRI is a multidimensional measure of dispositional empathy, with four subscales: perspective taking, fantasy, empathic concern, and personal distress. Responses were given on a 5-point Likert-type scale from 1 (
Depression, Anxiety, and Stress Scale–Short form (DASS; Henry & Crawford, 2005)
The 21-item shortened version of the DASS measures the severity of core symptoms associated with depression, anxiety, and stress. Participants were asked to indicate the presence of each symptom over the past week. Responses were given on a 4-point Likert-type scale ranging from 0 (
Short Warwick-Edinburgh Mental Well-Being Scale (SWEMWBS; Stewart-Brown et al., 2009)
The seven-item SWEMWBS is a measure of positive mental well-being. This measure involves rating items on a 5-point Likert scale ranging from 1 (
Maslach Burnout Inventory–Human Services Survey (MBI-HSS; Maslach, Jackson, & Leiter, 1981)
The 22-item MBI-HSS was designed to measure work-related burnout in professionals working in the human services such as health care and consists of three distinct subscales: emotional exhaustion, depersonalization, and personal accomplishment. Participants were asked about the frequency with which they have certain experiences and items were answered on a 7-point Likert-type scale ranging from 0 (
Planned Data Analysis
Three CFA models were tested for the 20-item SOCS-O and 20-item SOCS-S: (a) a one-factor model in which all items are direct indicators of a single compassion factor; (b) a five-factor correlated model, with items loading on respective factors from the five-element definition of compassion (Strauss et al., 2016); and (c) a five-factor hierarchical model, where the five factors load on an overarching compassion factor. All CFA models used maximum likelihood estimation with robust standard errors conducted in M
The following five fit indices were used to indicate model-data fit: the comparative fit index (CFI; Bentler, 1990), root mean square error of approximation (RMSEA; Steiger, 1990), nonnormed fit index (NNFI; Bentler & Bonett, 1980), standardized root mean square residual (SRMR), and Akaike information criterion (AIC; Akaike, 1974). Rules of thumb cutoff criteria for determining adequate fit using these indices can be arbitrary and affected by numerous factors such as sample size, data distribution, and model complexity and specifications (e.g., Chen, Curran, Bollen, Kirby, & Paxton, 2008; Marsh, Hau, & Wen, 2004), such that a model may fit the data even when one or more indices suggest inadequate fit (Schermelleh-Engel, Moosbrugger, & Müller, 2003). Consequently, researchers do not recommend their use as absolute, universally applied rules for assessing fit (e.g., Hu & Bentler, 1999; Marsh et al., 2004).
Given these considerations, following Williams et al. (2014), we used both liberal and conservative cutoff points for acceptable fit for the CFI, RMSEA, NNFI, and SRMR: the CFI and NNFI should be close to or greater than .90 (liberal) or .95 (conservative), RMSEA should be .10 or less (liberal) or .06 or less (conservative), and SRMR should be less than .10 (liberal) or .05 (conservative). We also considered the significance of factor intercorrelations and loadings when interpreting model fit. The AIC was used to compare the fit of the models, with lower values indicating superior fit. Although the chi-square test of model fit was reported, the significance of this statistic was not used to indicate model fit because of its hypersensitivity (e.g., to nonnormality, large sample sizes, large correlations between variables; R, B. Kline, 2015).
Internal consistency of total scale and subscale items on the SOCS-O and SOCS-S was assessed using both Cronbach’s alpha and omega total coefficients; values greater than or equal to .70 indicate good internal consistency (Terwee et al., 2007), although for psychological constructs, values below .70 are acceptable (P. Kline, 1999). Cronbach’s alphas were computed using SPSS version 24 (IBM, 2016) and omega total estimates were computed using McNeish’s (2017) Excel spreadsheet using standardized item loadings from the CFA model with superior fit. Omega total estimates were calculated alongside Cronbach’s alpha given well-documented methodological limitations of the latter, such as overly rigid assumptions which are commonly violated and poor performance when compared with alternative measures such as omega total (e.g., McNeish, 2017).
Floor and ceiling effects of the SOCS-O and SOCS-S were examined by calculating the percentage of respondents achieving the highest and lowest possible scores; less than 15% of the sample should receive the highest or lowest score (Terwee et al., 2007). Interpretability was tested by conducting independent
Convergent and discriminant validity were tested by examining whether each scale correlated with the measures detailed in the measures subsection in line with the below predictions. We predicted that the SOCS-O and SCBCS, both scales measuring compassion for others, would be significantly correlated at
Consistent with research which found significant correlations, ranging in size from small-moderate to large, between self-compassion and mindfulness, positive mental health, and well-being (e.g., Durkin, Beaumont, Martin, & Carson, 2016; López et al., 2018; Neff, 2003; Pommier, 2010), but no such relationships between compassion for others and these constructs (e.g., Durkin et al., 2016; López et al., 2018; Pommier, 2010), we predicted that there would be significant correlations between the SOCS-S and the FFMQ-15, SWEMWBS, and all subscales of the DASS small-moderate in size (positive for the FFMQ-15 and SWEMWBS and negative for DASS subscales). Findings supporting these predictions would provide evidence for convergent and discriminant validity for the SOCS-S. It is possible that the lack of significant correlations between compassion for others and mindfulness, well-being, and mental health was due to limitations of existing compassion measures (Strauss et al., 2016) and we therefore explored these findings but did not make specific predictions about the relationships between the SOCS-O and these constructs. Similarly, research has found a moderate-large, negative, significant correlation between self-compassion and burnout but no such relationship between compassion for others and burnout (e.g., Durkin et al., 2016). We therefore expected significant, moderate-large correlations between the SOCS-S and subscales of the MBI-HSS (negative for emotional exhaustion and depersonalization and positive for personal accomplishment) but did not make predictions for the SOCS-O and MBI-HSS.
Moreover, self- and other-compassion are theoretically overlapping constructs and the process of compassion is the same whether it is directed at the self or at others. However, research into the relationship between self- and other-compassion has found no more than a small relationship between these constructs (Durkin et al., 2016; López et al., 2018; Neff & Pommier, 2013; Pommier, 2010). It is currently unclear whether the little or no empirical overlap between self- and other-compassion is due to limitations of the measures used in these studies (e.g., Strauss et al., 2016; Williams et al., 2014) or indicates that these two forms of compassion are largely distinct. Thus, no specific hypotheses were made regarding the correlation between the SOCS-O and SOCS-S, but these findings were explored. Last, to test discriminant validity, none of the relationships between the SOCS-O or SOCS-S and other measures were expected to correlate so highly (
Results
Confirmatory Factor Analysis
Compassion for others
However, 1,242 health care staff completed the SOCS-O and were included in the CFA. There were no missing item-level data; all 1,242 participants completed all SOCS-O items. Table 1 shows the fit indices for the three CFA models. Almost all fit indices indicated poor fit of the one-factor model to the data, suggesting that items are not direct indicators of an overarching compassion factor. All fit indices indicated good fit of the five-factor and five-factor hierarchical models according to both liberal and conservative criteria. All loadings of items on factors in these two models were significant. All factor intercorrelations in the five-factor model were significant. In the five-factor hierarchical model, all loadings of factors on the overarching compassion factor were significant, suggesting that the five factors are elements of an overall compassion for others construct. Based on both the fit indices and significance of factor loadings, the five-factor hierarchical model can be interpreted as best fitting the data. Table S3 (see supplementary materials available online) presents the standardized loadings of items on to factors in the five-factor hierarchical model for the SOCS-O and Table S4 (see supplementary materials available online) the standardized factor loadings in the five-factor hierarchical model. Table S5 (see supplementary materials available online) shows the correlations between total scale and subscale scores on the SOCS-O in the health care staff validation sample.
Fit Indices for Compassion Models Tested in Both Validation Samples (Stages 3 and 4).
Five-factor hierarchical refers to a model in which all five factors load on an overarching compassion factor.
Compassion for the self
A total of 1,216 health care staff completed the SOCS-S and were included in the CFA. There were no missing item-level data; all 1,216 participants completed all SOCS-S items. Table 1 presents the fit indices for the three CFA models. All indices suggested poor fit of the one-factor model but adequate fit of the five-factor and five-factor hierarchical models. All item loadings in the two five-factor models were significant. All factor intercorrelations in the five-factor model were significant and all factor loadings in the five-factor hierarchical model were significant, suggesting that the five factors are related and are elements of an overall compassion for the self-construct. Based on both the fit indices and significance of factor loadings, the five-factor hierarchical model can be seen as best fitting the data. Table S6 (see supplementary materials available online) displays the standardized item loadings in the five-factor hierarchical model for the SOCS-S and Table S4 (see supplementary materials available online) the standardized factor loadings in the five-factor hierarchical model. Table S5 (see supplementary materials available online) presents the correlations between total scale and subscale scores on the SOCS-S in the staff validation sample.
Internal consistency
Omega total estimates, calculated using standardized item loadings from five-factor hierarchical models, ranged from .76 to .97 for total SOCS-O scale and subscale items and from .74 to .97 for total SOCS-S scale and subscale items (Table 2). Cronbach’s alphas ranged from .74 to .94 for total SOCS-O scale and subscale items and from .75 to .93 for total SOCS-S scale and subscale items (Table 2). These values are considered adequate for measures of psychological constructs (P. Kline, 1999; Terwee et al., 2007).
Cronbach’s Alpha and Omega Total Coefficients for SOCS-O and SOCS-S Scale and Subscale Items in Both Validation Samples (Stages 3 and 4).
Floor and Ceiling Effects
Less than 15% of the sample received the highest score (100) or lowest score (20) on the SOCS-O and SOCS-S; 0.1% and 0.2% of participants received the lowest possible score on the SOCS-O and SOCS-S, respectively, and 1.6% and 0.3% of participants received the highest possible score on the SOCS-O and SOCS-S, respectively, suggesting that both scales capture variability in responses.
Interpretability
Table 3 displays the means and standard deviations of total SOCS-O and SOCS-S scores across subgroups of participants. As predicted, females scored significantly higher on the SOCS-O compared with males,
Means and Standard Deviations of Total SOCS-O and SOCS-S Scores for all Participants and Participant Subgroups Using Available Data From Both Validation Samples (Stages 3 and 4).
Convergent and Discriminant Validity
Table 4 shows the correlation coefficients between total scale and subscale scores on the SOCS-O and SOCS-S and other constructs. Consistent with predictions, the SOCS-O had significant and large correlations with the SCBCS and empathic concern and perspective taking subscales of the IRI, and the SOCS-S had a significant and large correlation with the SCS-12. The SOCS-O was also significantly and negatively related to the IRI personal distress subscale. Additionally, the SOCS-S was significantly correlated in expected directions with the FFMQ-15, SWEMWBS, and DASS subscales, with correlations ranging from moderate-large to large in size. We also found significant, small-moderate correlations between the SOCS-O and the FFMQ-15 and SWEMWBS, and small, but significant, negative relationships between the SOCS-O and stress and depression (DASS). As predicted, the SOCS-S had significant and moderate-large correlations in expected directions with all subscales of the MBI-HSS. Although we did not make specific predictions for the SOCS-O, it was found to have significant, small-moderate correlations with MBI-HSS depersonalization (negative direction) and personal accomplishment (positive direction). Taken together, at least three quarters of results were consistent with predictions, at least two correlations were large (
Correlation Coefficients Between Total Scores on the SOCS-O and SOCS-S and Other Constructs in Both Health Care Staff (Nonitalicized values) and Student (Values in Italics) Validation Samples (Stages 3 and 4).
Items from the observing subscale were excluded from the total FFMQ-15 score. bThe MBI-HSS was administered to a subset of the Stage 3 health care staff sample (
Relationship Between Compassion for the Self and Others
Health care staff scored significantly higher on the SOCS-O compared with the SOCS-S,
Stage 4: Cross-Validating Factor Structures Using CFA
Stage 4 applied CFA to data from a sample of university students to cross-validate the factor structures of the SOCS-O and SOCS-S. As in Stage 3, we hypothesized that the five-element model of compassion, which conceptualizes a hierarchical structure, would be a good fit to data for both the SOCS-O and SOCS-S.
Method
Participants and Procedure
A sample of 371 undergraduate university students completed an anonymous online survey on Qualtrics containing self-report measures. Students were studying psychology in a university in the south of the United Kingdom. The mean age of the sample was 19.63 years (
Measures
The measures used in Stage 3 were administered to students, with the exception of the MBI-HSS. Cronbach’s alphas for these measures were as follows: .80 (FFMQ-15 without observing items), .87 (SCS-12), .91 (SCBCS), .81 (IRI perspective taking), .78 (IRI empathic concern), .80 (IRI personal distress), .84 (DASS stress), .82 (DASS anxiety), .89 (DASS depression), and .86 (SWEMWBS). The SOCS-O and SOCS-S appeared separately, either at the start or end of the survey, and their order was counterbalanced.
Planned Data Analysis
Analyses in this stage mirrored those in Stage 3, with the exception of interpretability; in this student sample, data on level of education was not obtained. The same criteria and/or predictions were used to determine model fit and adequate psychometric properties.
Results
Confirmatory Factor Analysis
Compassion for others
All 371 students completed all SOCS-O items and were included in the CFA. Table 1 displays the fit indices for the three CFA models tested on the SOCS-O in this sample. As with the CFA findings from Stage 3, fit indices indicated poor fit of the one-factor model but good fit of the five-factor and five-factor hierarchical models. All item loadings in these two models were significant. Factor intercorrelations in the five-factor model were significant and all loadings of factors on the overarching compassion factor in the five-factor hierarchical model were significant, indicating that the five factors are related and elements of an overall other-compassion construct. Based on both the fit indices and significance of factor loadings, the five-factor hierarchical model can be interpreted as best fitting the data. Tables S3 and S4 (see supplementary materials available online) show standardized SOCS-O item loadings and factor loadings, respectively, in the five-factor hierarchical model. Table S7 (see supplementary materials available online) shows correlations between total SOCS-O scale and subscale scores.
Compassion for the self
All 371 students completed all SOCS-S items and were included in the CFA. All fit indices showed acceptable fit of five-factor and five-factor hierarchical models, but poor fit of the one-factor model (Table 1). All item loadings in the two five-factor models were significant. All factor intercorrelations in the five-factor model were significant and factor loadings in the five-factor hierarchical model were significant, indicating that the factors are related and elements of an overall self-compassion construct. Based on both the fit indices and significance of factor loadings, the five-factor hierarchical model can be interpreted as providing the best fit. Tables S6 and S4 (see supplementary materials available online) show standardized SOCS-S item and factor loadings, respectively, in the five-factor hierarchical model. Table S7 (see supplementary materials available online) shows correlations between total SOCS-S scale and subscale scores.
Internal Consistency
In Table 2, both omega total estimates, calculated using standardized item loadings from five-factor hierarchical models, and Cronbach’s alphas indicate acceptable internal consistency for both the SOCS-O and SOCS-S in this sample.
Floor and Ceiling Effects
None of the students received the lowest possible score on the SOCS-O/SOCS-S (20) and 0% and 0.3% received the highest possible score on the SOCS-O and SOCS-S (100), respectively.
Interpretability
Table 3 shows the means and standard deviations of total SOCS-O and SOCS-S scores across subgroups of students. As expected, females scored significantly higher on the SOCS-O compared with males,
Convergent and Discriminant Validity
Table 4 shows correlations between total and subscale scores on the SOCS-O and SOCS-S and other constructs. As predicted, the SOCS-O was significantly correlated with the SCBCS and empathic concern subscale of the IRI, and the SOCS-S was significantly correlated with the SCS-12, at
Relationship Between Compassion for the Self and Others
Students scored significantly higher on the SOCS-O compared with the SOCS-S,
Discussion
The aim of this program of research was to develop and evaluate the psychometric properties of two new self-report measures of compassion: the SOCS-O and SOCS-S. Findings from Stages 1 and 2 yielded the 20-item SOCS-O and SOCS-S and findings from Stages 3 and 4 support the factor structures and demonstrate robust psychometric properties of both scales.
For both scales, in both health care staff and student samples, a five-factor hierarchical model, with items loading on respective factors from the five-element compassion definition and factors loading on an overarching compassion factor (Strauss et al., 2016), was found to fit the data well as predicted. Internal consistency of total SOCS-O and SOCS-S scale and subscale items was adequate and the scales showed no indication of floor and ceiling effects. We also facilitated the interpretability of scores on both scales. For example, in both samples, females scored significantly higher on the SOCS-O compared with males, and in health care staff only, SOCS-O scores also significantly differed based on length of previous meditation experience, with those with more meditation experience scoring significantly higher compared with those with little or no meditation experience. In both samples, SOCS-S scores significantly differed based on previous meditation experience, with those with more meditation experience scoring significantly higher compared with those with little or no meditation experience.
The SOCS-O and SOCS-S also showed evidence of convergent and discriminant validity. Consistent with predictions, in both samples, the SOCS-O was significantly correlated with scales measuring compassion for others and empathy and the SOCS-S significantly correlated with an existing Self-Compassion Scale, with correlations large in size, but not so large as to indicate the SOCS-O and SOCS-S are redundant. As hypothesized, the SOCS-S significantly correlated in expected directions with measures of mindfulness, well-being, stress, anxiety, depression, and burnout with correlations ranging from moderate-large to large in size, but not so large as to suggest that they are measuring the same construct. We also found significant small-moderate correlations between the SOCS-O and measures of mindfulness, well-being, and burnout, and significant, small correlations between the SOCS-O and mental health. However, there were key differences between the SOCS-O and SOCS-S in terms of their patterns of association with mental health outcomes. Whereas SOCS-S scale and subscales were in general significantly and negatively correlated with stress, anxiety, and depression, the relationship between the SOCS-O and mental health outcomes was more variable; only the universality and tolerating subscales showed significant, negative relationships which were largely consistent across both samples, and total SOCS-O was significantly correlated with just stress and depression in the health care sample and just depression in the student sample.
Our findings on the relationship between the SOCS-S and related variables support previous research, but current findings on the SOCS-O contrast with previous research which found no relationship between compassion for others and mindfulness, mental health, well-being, and burnout (e.g., Durkin et al., 2016; López et al., 2018; Pommier, 2010). This suggests that the lack of relationship between other-compassion and these constructs may be partly attributable to limitations of existing compassion measures around content validity, item wording, and psychometric properties (Strauss et al., 2016). We also found, for both health care staff and students, a significant and small-moderate to moderate correlation between the SOCS-O and SOCS-S. This is at odds with previous research which at best have found small correlations between self- and other-compassion in nonmeditator and student samples (López et al., 2018; Neff & Pommier, 2013; Pommier, 2010) and at worst found a small-moderate and negative, but nonsignificant, correlation between self- and other-compassion in students (Durkin et al., 2016). Previous findings of little or no empirical overlap between the two may be in part due to issues with the previous measures of compassion used in these studies.
Taken together, current findings support the multidimensional conceptualization of compassion proposed by Strauss et al. (2016) and present the SOCS-O and SOCS-S as new, psychometrically robust self-report measures which overcome limitations of previous compassion scales. Key limitations of previous compassion measures addressed include those relating to content validity (e.g., making sure items are related to suffering and do not tap related constructs), item wording (e.g., making sure items are worded in line with the response scale and frequency terms are omitted), and psychometric properties (e.g., demonstrating adequate internal consistency of scales and subscales and evidence for factor structure).
Implications
Current findings have theoretical implications for our understanding of compassion constructs, and how they relate to each other and to psychological outcomes, and research and clinical implications. We found that greater compassion for the self and others were related to increased mindfulness and well-being, and decreased burnout, stress, depression, and anxiety. This provides initial support for the cultivation of compassion (e.g., through CBIs) to improve psychological functioning. These findings are particularly valuable given that previous studies have failed to find links between compassion for others and these processes and outcomes (e.g., Durkin et al., 2016; López et al., 2018; Pommier, 2010).
We also found that the same model fit both self-compassion and other-compassion data well, and found small-moderate to moderate, and significant, correlations between the SOCS-O and SOCS-S. This contributes to the discourse on the relationship between self- and other-compassion and has implications for future research examining this association. Our findings are consistent with the notion that compassion refers to a process that can orient both to the self or others and indicate that self- and other-compassion are overlapping constructs. This contrasts to previous research which found little or no empirical overlap between compassion for the self and others (Durkin et al., 2016, López et al., 2018; Neff & Pommier, 2013; Pommier, 2010). Future research should therefore not be deterred by initial findings indicating no relationship between self- and other-compassion and it would be valuable to explore this relationship further.
Moreover, the current program of research addressed an important omission in compassion research and practice by developing valid and reliable measures of compassion. Being able to measure compassion using robust tools is necessary for the growth of this field. We anticipate that these scales will prove valuable in progressing key research areas, including building an evidence base for CBIs by facilitating evaluation of their effectiveness and underlying mechanisms.
Limitations and Future Research
The SOCS-O and SOCS-S require further testing. Some psychometric properties were not assessed as these were beyond the scope of the current program of research. These include test–retest reliability, sensitivity to change over the course of a CBI or other interventions which would theoretically cultivate compassion, and further tests of convergent and discriminant validity with additional theoretically related and unrelated constructs. Although a total score can be derived from each scale, as these were designed to be multidimensional measures, it would also be important for future studies to examine in greater detail how the five elements of self/other-compassion interact with one another and how they relate independently, and collectively, to outcomes.
Given that compassion is a culturally valued construct, it would be beneficial for future research to examine the extent to which compassion, as measured by the SOCS, overlaps with social desirability, as the basis for taking any social desirability into account when interpreting SOCS scores and findings.
Due to the anonymous nature of the online surveys administered to health care staff in Stages 2 and 3, we were not able to ensure that the two samples were entirely independent. In both Stages, study adverts and survey titles were the same, which would have minimized chances of health care staff completing both sets of surveys. However, future research should try to employ measures to ensure complete independence of validation samples.
Although the current program of research validated the SOCS-O and SOCS-S in health care staff and student samples, research in this field has also recruited from other populations (e.g., clinical populations, meditators, general population) and the scales would benefit from cross-validation in such populations to further support their use and inform understanding of compassion across different groups. Complementing item development through consultation with experts from six continents, and given that the dominant ethnicity of both health care staff and student samples in this study was White and both were U.K. samples, future research should also cross-validate the factor structures of the SOCS-O and SOCS-S in samples from other cultures and countries. As part of this line of research, the compassion scales could be translated into different languages which would enable investigation of research questions such as whether there are cross-cultural differences in the strength of the relationship between self- and other-compassion, and compassion and psychological functioning. Alongside cross-validation of the SOCS-O and SOCS-S in different populations, future studies should also evaluate measurement invariance, and if this holds, latent mean differences across groups. For example, given that the current samples mainly consisted of white females, future studies could assess measurement invariance by gender and culture.
Our findings are consistent with the suggestion that interventions designed to cultivate compassion could improve emotional health outcomes (Kirby et al., 2017). However, we used a cross-sectional design and direction of effects cannot be determined. Future research evaluating the effectiveness of CBIs using the SOCS-O and SOCS-S would provide a more robust test of this possibility. Similarly, the relationship between compassion and burnout is consistent with the observation of diminishing compassion in cases of work-related burnout in the health care sector (Joinson, 1992), although the direction of effect cannot be determined from our findings. Using longitudinal designs would also help address any potential common method variance associated with collecting self-report data from the same respondents at a single point in time (Podsakoff, MacKenzie, Lee, & Podsakoff, 2003).
Self-report methods are not without their limitations and are likely to provide only a partial picture of compassion. It would be beneficial for future research to explore whether the SOCS-O and SOCS-S can be triangulated with nonself-report methods of assessing compassion. For example, baseline SOCS-O and SOCS-S scores and/or change in scores over intervention could be correlated with baseline performance and/or change in performance over intervention on behavioral tasks assessing compassion, such as prosocial games (e.g., the Zurich Prosocial Game; Leiberg, Klimecki, & Singer, 2011). Research which uses both self-report and alternative methods of assessing compassion would also help address any common method variance. However, challenges remain in developing behavioral tasks that can clearly distinguish compassion from distinct but related constructs such as prosocial behavior, empathy, and altruism. With this in mind, the SOCS-O and SOCS-S have the advantage of accessing the private cognitive and emotional motivations that are part of the compassion construct. They may also be helpful in developing and refining behavioral measures which specifically capture compassion.
Conclusion
Progress in the field of compassion requires robust measures that comprehensively capture compassion for others and compassion for the self. The current program of research developed new theoretically informed and psychometrically robust self-report measures of compassion: the SOCS-O and SOCS-S. Findings support the factor structures of both scales in health care staff and student samples. Both the SOCS-O and SOCS-S consist of the following five subscales which can be seen as elements of an overall self- or other-compassion construct: (a) recognizing suffering, (b) understanding the universality of suffering, (c) feeling for the person suffering and emotionally connecting with their distress, (d) tolerating uncomfortable feelings aroused so that we remain open to and accepting of them in their suffering, and (e) acting or being motivated to act to alleviate suffering. Findings also support the psychometric properties of both scales in terms of their internal consistency, interpretability, floor and ceiling effects, and convergent and discriminant validity. Taken together, the rigorous development process employed in the current research program and emergent psychometric properties of the SOCS-O and SOCS-S support their use in compassion research and practice.
Supplemental Material
Supplementary_materials_(clean)_r1 – Supplemental material for Development and Psychometric Properties of the Sussex-Oxford Compassion Scales (SOCS)
Supplemental material, Supplementary_materials_(clean)_r1 for Development and Psychometric Properties of the Sussex-Oxford Compassion Scales (SOCS) by Jenny Gu, Ruth Baer, Kate Cavanagh, Willem Kuyken and Clara Strauss in Assessment
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was supported by the Economic and Social Research Council (ESRC) and Sussex Partnership NHS Foundation Trust in a joint Doctoral Training Scholarship awarded to the first author (Grant ES/J500173/1).
Supplemental Material
Supplemental material for this article is available online.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
