Abstract
Several measures that assess loneliness have been developed for adults. Across three studies, we investigated psychometric features of scores of different versions of the Rasch-Type Loneliness Scale, the University of California Los Angeles Loneliness Scale, and three single-item measures. In Study 1 (N = 697 self-ratings, N = 282 informant-ratings of 160 targets) and Study 2 (N = 1,216 individuals from 608 couples), we investigated convergent validity, self-informant agreement, and nomological nets of the item scores using correlates related to demographic aspects, personality, satisfaction, and network characteristics. In Study 3 (N = 411), we estimated a reliability of
Keywords
Humans are social beings, and they rely and depend on each other (Axelrod & Hamilton, 1981; Baumeister & Leary, 1995). Interpersonal relationships are among the most powerful sources for well-being and health (Baumeister & Leary, 1995; S. Cohen, 2004; Hartup & Stevens, 1997; House et al., 1988). When interpersonal relationships are perceived as deficient regarding quantitative (e.g., network size, number of friends) and/or qualitative aspects (e.g., relationship closeness, trust, intimacy), individuals experience loneliness (Ernst & Cacioppo, 1999; Perlman & Peplau, 1981).
The large body of literature on loneliness that has accumulated is very consistent in demonstrating the negative effects of loneliness on health-related outcomes (for reviews, see Cacioppo et al., 2015; Ernst & Cacioppo, 1999; Hawkley & Cacioppo, 2010; Hawkley & Capitanio, 2015; Heinrich & Gullone, 2006; Holt-Lunstad et al., 2015) and interpersonal outcomes (Cacioppo et al., 2017; Lodder et al., 2017; Mund et al., 2022; Mund & Johnson, 2021; Spithoven et al., 2018). This consistency is to be highlighted given the multitude of loneliness measures that have been developed over the last decades, including unidimensional and multidimensional multi-item scales, various abbreviated versions thereof, and a variety of single-item measures. Although issues of validity have been investigated in prior research, these studies are limited in that they (a) have examined either convergent validity between item scores, but not their correlations with external variables (e.g., Cacioppo, Hawkley, et al., 2006; Grygiel et al., 2013; Iecovich, 2013; Maes et al., 2017; D. Russell et al., 1980), (b) examined the nomological net of scores of only one or two measures at once (Cacioppo, Hawkley, et al., 2006; Shiovitz-Ezra & Ayalon, 2012; Victor et al., 2005), and, when doing so, (c) mostly only considered demographic variables such as age, gender, and education as external correlates of loneliness (Borys & Perlman, 1985; Shiovitz-Ezra & Ayalon, 2012; Victor et al., 2005). Thus, it is unclear whether item scores of different loneliness measures for adults tap into different aspects of loneliness or capture a common core.
In this study, we extend prior work in four ways. First, we simultaneously examined the convergent validity among and the nomological net of scores of the most widely used measures of loneliness in adulthood (Buecker et al., 2020, 2021; Maes et al., 2019; Mund, Freuding, et al., 2020; Pinquart & Sörensen, 2001): the Rasch-Type Loneliness Scale (RTLS) and its facets emotional and social loneliness (de Jong Gierveld & Kamphuis, 1985; de Jong Gierveld & van Tilburg, 2006), the University of California Los Angeles Loneliness Scale (UCLA-LS) in its full (D. Russell et al., 1978, 1980; D. W. Russell, 1996) and two abbreviated versions (Hawkley et al., 2015; Hays & DiMatteo, 1987), and three single-item measures of loneliness. Second, we examined the convergence between scores of self-ratings and informant-ratings of loneliness to extend the assessment of the validity beyond self-reports (McCrae et al., 2004; Podsakoff et al., 2003). Third, in addition to demographic correlates of loneliness, we investigated the nomological net of scores of the various measures using a wide variety of constructs (e.g., Big Five personality traits, domain and life satisfaction, social support, depressiveness, and shyness) to pinpoint similarities and differences between item scores of the selected measures. Fourth, we examined the reliability of scores of three single-item measures to investigate the often-raised criticism that such measures lack reliability (D. Russell, 1982).
To achieve these goals, we conducted three independent studies. Studies 1 and 2 were designed to investigate the validity of item scores and agreement between self-ratings and ratings by close others and to examine the nomological nets of scores of different loneliness measures. In Study 3, we examined the reliability of scores of three single-item measures of loneliness using a procedure developed by Heise (1969) in a study extending over three measurement occasions, each 2 weeks apart.
Measures of Loneliness
Given its subjective nature (Ernst & Cacioppo, 1999; Perlman & Peplau, 1981), loneliness has most often been assessed using self-reports (Marangoni & Ickes, 1989; D. Russell, 1982). The various self-report measures that have been developed in recent decades primarily differ regarding three aspects (D. Russell, 1982): (a) the dimensionality of the instrument (unidimensional vs. multidimensional), (b) the number of items (i.e., single-item measures vs. multi-item scale), and (c) whether the instrument assesses loneliness in a direct or an indirect way.
Unidimensional and Multidimensional Measures
Measurement instruments to assess loneliness can be differentiated according to whether they were developed based on the conceptualization of loneliness as a unidimensional construct or as a construct with multiple interrelated facets. Instruments such as the UCLA-LS (D. Russell et al., 1978, 1980; D. W. Russell, 1996) have been designed to assess global loneliness. This notion has been challenged by several studies showing the UCLA-LS scores to be multidimensional and to be accounted for by two (Cramer & Barry, 1999; Knight et al., 1988; Maes et al., 2017; Wilson et al., 1992; Zakahi & Duran, 1982), three (Cacioppo, Hawkley, et al., 2006; Hawkley et al., 2005; McWhirter, 1990; Shevlin et al., 2015), or even five factors (Cacioppo, Hawkley, et al., 2006). By contrast, other studies support the notion of the UCLA-LS scores being essentially unidimensional (Dodeen, 2015; Hartshorne, 1993; Lasgaard, 2007; McDanal et al., 2021; D. Russell et al., 1980; D. W. Russell, 1996). Despite this controversy, we decided to use the UCLA-LS scores as unidimensional in the present work for two main reasons. First, there is no consensual multidimensional solution of UCLA-LS item scores in the literature, and even studies reporting the same number of underlying factors differ markedly in which items are allocated to which factor and how those factors are interpreted. Second, in most research contexts, scores on the UCLA-LS seem to be treated as unidimensional. Specifically, several recent meta-analyses refrained from considering specific aspects of loneliness because there were not enough studies using the UCLA-LS reporting on those facets (Buecker et al., 2020, 2021; Maes et al., 2019; Mund, Freuding, et al., 2020).
Other scales such as the RTLS (de Jong Gierveld & Kamphuis, 1985; de Jong Gierveld & van Tilburg, 2006) have been designed to capture multiple aspects of loneliness such as emotional and social loneliness. Multidimensional approaches offer the opportunity to measure loneliness in a more nuanced fashion and to better understand differential predictors and consequences of those facets. However, when taking the RTLS as an example, surprisingly few studies seem to build on its multidimensionality (for meta-analyses, see Buecker et al., 2020; Mund, Freuding, et al., 2020). This might be due to relatively high correlations between scores of the two facets of the RTLS (Grygiel et al., 2013; Iecovich, 2013; Mund & Neyer, 2016), possibly indicating that the two facets capture actually the same aspects of loneliness. In this study, we will examine both global loneliness as measured by scores of the RTLS and its facets emotional and social loneliness.
Single Items and Multi-Item Scales
Single-item measures of loneliness have been used frequently (Mund, Freuding, et al., 2020; Pinquart & Sörensen, 2001) and have been attested substantial content validity (D. Russell, 1982). However, their usage has been discouraged for several reasons (Marangoni & Ickes, 1989; D. Russell, 1982). For example, the validity of scores of single-item measures has been questioned due to possible socially desirable responding and lack of standardization (Marangoni & Ickes, 1989; D. Russell, 1982). That is, there is a plethora of single-item measures of loneliness that differ in wording and response format so that it is not fully clear whether scores of all these measures tap into the same construct. Studies comparing scores of single-item with multi-item measures have found a lower prevalence of loneliness when single items are used (Eccles et al., 2020; Victor et al., 2005, but see Shiovitz-Ezra & Ayalon, 2012, for a contrasting finding). Moreover, it seems that the individuals who are classified as “lonely” based on cut-off scores on multi-item scales are not the same individuals who are classified as “lonely” based on a cut-off score applied to single-item measures (Eccles et al., 2020; Shiovitz-Ezra & Ayalon, 2012). Thus, it might be questioned whether scores of single items capture the complexity of the construct in a way scores of multi-item scales do. Apart from prevalence rates, however, the results obtained with scores of single-item measures converge very well with those obtained with scores of multi-item scales regarding demographic aspects (Shiovitz-Ezra & Ayalon, 2012; von Soest et al., 2020), aspects of mean-level and rank-order stability (Mund, Freuding, et al., 2020; Mund, Lüdtke, & Neyer, 2020), personality correlates (Buecker et al., 2020), and health outcomes (Beutel et al., 2017; Eccles et al., 2020), including early mortality (Holt-Lunstad et al., 2015; Shiovitz-Ezra & Ayalon, 2010).
It has also been argued that scores of single items are unreliable and capture more noise than substantial interindividual differences (Marangoni & Ickes, 1989; D. Russell, 1982). In line with this reasoning, a study employing STARTS models (Kenny & Zautra, 2001) found that scores of single-item measures capture more (potentially unreliable) state variance (≈50% state variance) than scores of longer scales such as a six-item version of the RTLS (≈25% state variance; Mund, Lüdtke, & Neyer, 2020). However, it has been shown for scores of single-item measures of subjective well-being that state variance can contain a substantial amount of reliable variance (Lucas & Donnellan, 2012). Similarly, Robins et al. (2001) demonstrated high reliability of scores of a single-item measure of self-esteem (
Direct and Indirect Measures of Loneliness
In direct measures of loneliness, the respondents are confronted with the target construct insofar as words such as “lonely” are contained in the questions (e.g., “I am lonely”; “How often have you felt very lonely?”). Indirect measures, by contrast, avoid such references to loneliness (D. Russell, 1982). One item from the UCLA-LS, for example, asks participants to indicate “How often do you feel left out?,” and an item from the RTLS asks participants to rate to what extent they agree with the statement “I experience a general sense of emptiness” (indicative of emotional loneliness).
The reasoning for using indirect measures is to disguise the researchers’ interest in loneliness, thereby avoiding the activation of negative stereotypes and circumventing socially desirable responding (D. Russell, 1982). Previous studies have shown that gender differences in the prevalence of loneliness can be observed when scores of direct measures are used, with men reporting lower levels of loneliness on average than women (Borys & Perlman, 1985). Those gender differences disappear or even reverse when scores of indirect measures are used (Maes et al., 2019). It should be noted that direct measures of loneliness are often single items, which, necessarily, are unidimensional. By contrast, most multi-item measures are indirect measures of loneliness (for an exception, see Rubenstein & Shaver, 1980), and some of them are multidimensional.
Convergent Validity, Self-Informant Agreement, and Nomological Nets
Previous studies comparing measures of loneliness for adults usually included no more than two measures at once and primarily considered their association with demographic variables (Borys & Perlman, 1985; Cacioppo, Hawkley, et al., 2006; Hughes et al., 2004; Iecovich, 2013; Maes et al., 2017; Shiovitz-Ezra & Ayalon, 2012; Victor et al., 2005; von Soest et al., 2020). Regarding aspects of validity, we extend prior research by (a) considering scores of multiple measures, and their facets, of loneliness for adults, (b) investigating correlations among self-reports and the agreement between self-ratings and informant-ratings on item scores of those measures, and (c) considering a broad range of demographic and psychological variables to investigate similarities and differences in the nomological nets of scores of those measures.
Although several studies have investigated the psychometric properties of scores of multiple loneliness measures for children and adolescents (Cole et al., 2021; Goossens & Beyers, 2002; Maes et al., 2017), studies on scores of loneliness measures for adults have only compared scores of two measures at once (Borys & Perlman, 1985; Cacioppo, Hawkley, et al., 2006; Hughes et al., 2004; Iecovich, 2013; Shiovitz-Ezra & Ayalon, 2012; Victor et al., 2005; von Soest et al., 2020), with a particular focus on the UCLA-LS. However, examining the convergent validity of item scores of multiple measures is important to ensure that scores of all those measures capture interindividual differences in the same construct (Cronbach & Meehl, 1955).
The correlation between self-reports and informant-reports is an important aspect of construct validity (Connelly & Ones, 2010; McCrae et al., 2004; Podsakoff et al., 2003). If scores of an instrument merely capture noise instead of reliable interindividual differences, the correlation with informant-ratings would be low (McCrae, 2015; Podsakoff et al., 2003). Thus, self-informant agreement is indicative of the substance of the measured construct (McCrae et al., 2004). Using a nine-item version of the UCLA-LS, Luhmann et al. (2016) found self-informant agreements for partnered individuals of
As another facet of self-informant agreement, it is also possible to examine mean-level differences in self-ratings and informant-ratings of loneliness. While the correlation between self-ratings and informant-ratings pertains to the detectability of interindividual differences, comparing mean levels provides an evaluation of the exact agreement between self-reports and informant-reports (Kim et al., 2019). Previous studies on mean-level differences in loneliness measured using the UCLA-LS have found that parents and friends, but not partners, tend to underestimate the degree of target’s loneliness (Lee & Ko, 2018; Luhmann et al., 2016). No studies have examined mean-level differences between self-ratings and informant-ratings in scores of measures other than the UCLA-LS so far.
The construct validity of the scores of measures investigated in the present work is further examined using a comprehensive nomological net (Cronbach & Meehl, 1955). Previous studies comparing the nomological nets of scores of loneliness measures for adults (Borys & Perlman, 1985; Cacioppo, Hawkley, et al., 2006; Hughes et al., 2004; Iecovich, 2013; Shiovitz-Ezra & Ayalon, 2012; Victor et al., 2005; von Soest et al., 2020) were restricted to a few, mostly demographic aspects. In the present work, we investigated broad nomological nets of scores of the selected measures including demographic aspects, personality characteristics, domain-specific and life satisfaction, and aspects of social networks. Large differences in the nomological nets across item scores would indicate that researchers and practitioners need to choose their measures carefully to make sure that the obtained item scores measure the intended aspects.
The Present Work
In this article, we present a series of three studies. In Study 1, we investigated the validity (convergence, self-other agreement, and nomological nets) of scores of six measures of loneliness in a sample of young adults. In Study 2, we examined the validity of scores of four partly different measures of loneliness than those used in Study 1. Furthermore, data in Study 2 were collected as part of a larger project on partner relationships; accordingly, informants in Study 2 were romantic partners. In Study 3, we investigated the reliability of scores of three single-item measures of loneliness by designing a short-term prospective study with three measurement occasions each 2 weeks apart. The single studies were not preregistered. Analysis scripts and data used in the analyses are accessible at https://osf.io/7gsfw/.
Study 1: Validity of Scores of Six Loneliness Measures
Method
Sample
The study was conducted in May 2018 as a cross-sectional online study using the formr survey framework (Arslan et al., 2020). The study was announced on several university mailing lists and on dedicated websites for people interested in psychological studies. The study protocol was approved by the Institutional Review Board of the Friedrich-Schiller-Universität Jena, Germany (FSV 18/03). A total of 697 German-speaking individuals participated in the self-rating part of the study. Among those, 72.17% were female. The majority of the participants were students (73.39%), 15.88% of the participants were employed or self-employed, 2.43% were enrolled in nonacademic education (e.g., vocational learning, traineeship), and 1% were fulfilling a civil service; 7.30% of the participants were unemployed, unable to work because of ill-health, or not working for other reasons (e.g., retired, parental leave). On average, participants were 26.06 years (SD = 9.84, Mdn = 23) old, ranging from 18 to 99 years. After completion of the survey, participants received personalized feedback on their personality and well-being, and could sign up for a voucher lottery.
Very early in the survey, participants were asked to provide contact details of up to six individuals who could provide ratings of the targets’ personality and well-being. Those contacts were then sent an email and asked to fill in a brief questionnaire. In this way, we obtained 282 informant-ratings on 160 targets. On average, each of the 160 participants was rated by 1.68 informants (SD = 1.03). Most of the targets (60.3%) were rated by one, 21.8% were rated by two, 9.9% by three, 6.2% by four, 1.2% by five, and 0.6% by six informants, respectively. On average, the informants were 28.25 years old (SD = 13.58, Mdn = 22), ranging from 18 to 70 years. Among the informants, 52% were friends of the targets, 23% were kin (e.g., parents, siblings), 18% were the target’s romantic partner, 3% were acquaintances, 3% were colleagues, and 1% indicated to be in another relationship with the target. Given the uneven distribution of types of informants, we aggregated the ratings of all informants in the analyses.
Informants were asked to indicate how well they knew the target using a scale ranging from 0 (not well) to 10 (very well). On average, informants stated to know the targets very well (8.43, SD = 1.54, Mdn = 9). Upon completion of the survey, informants were offered the option to sign up for a voucher lottery. Informants were also invited to take part in the self-rating. In this way, it was possible that self-raters also served as informants for other targets. Furthermore, it is possible that some individuals provided informant-ratings for more than one person. The primary data used for the analysis are available at https://osf.io/7gsfw/.
Measures
Details on internal consistency, sample items, response formats, descriptive statistics, and zero-order correlations are provided in Supplemental Tables S1 to S3.
Loneliness
The loneliness measures were presented in the self-rating and informant-rating parts of the study.
Rasch-Type Loneliness Scale
We used the 11-item version of the RTLS (de Jong Gierveld & Kamphuis, 1985) to assess global loneliness and the facets emotional and social loneliness. Internal consistency (coefficient
UCLA Loneliness Scale (UCLA-LS20Items)
We used the German version of the 20-item UCLA-LS (Döring & Bortz, 1993; D. Russell et al., 1980). Internal consistency was high for self-ratings (
Three-Item Version of the UCLA Loneliness Scale (UCLA-LS3Items)
We employed a three-item version of the UCLA-LS that has been developed for use in large panel studies (Hawkley et al., 2015; Luhmann & Hawkley, 2016). Internal consistency was satisfactory in self-ratings (
Direct Single-Item Measure (SI Direct)
As a direct single-item measure of loneliness, we asked the participants to what extent the statement “I feel lonely” applies to them.
Indirect Single-Item Measure (SI Indirect)
As an indirect single-item measure of loneliness, we asked participants to indicate their agreement with the statement “I feel alone.”
Direct Single-Item Frequency Measure (SI Direct frequency )
Adapting the SI Direct, we also asked participants about the frequency of loneliness (i.e., “How often do you feel lonely?”).
Correlates of Loneliness
All correlates were assessed only in the self-rating part of the study. We examined correlates from the domains of (a) demography, (b) personality, (c) satisfaction, and (d) network characteristics.
Demographic Variables
Participants were asked to indicate their age, gender, educational status, and whether they currently have a partner or not.
Personality
The Big Five personality traits were assessed using a 15-item version of the Big Five Inventory (Hahn et al., 2012). Self-esteem was assessed using a three-item version adapted from the Rosenberg Self-Esteem Scale that has been used previously in a large panel study (Huinink et al., 2011). Depressiveness was measured using the 10-item State-Trait Depression Scale (Spaderna et al., 2002). The explicit affiliation motive was assessed using four items from the Unified Motive Scales (Schönbrodt & Gerstenberg, 2012). We also assessed need satisfaction of the affiliation motive by two items from the Basic Need Satisfaction and Frustration Scale (Chen et al., 2015). A product interaction between both variables was calculated to create an index of frustration of the affiliation motive. Social desirability was assessed using the 17-item Social Desirability Scale (Stöber, 2001). Shyness and sociability were assessed using the Shyness and Sociability Scales for adults (Asendorpf & Wilpers, 1998).
Satisfaction
Participants were asked to indicate their overall satisfaction with (a) life, (b) education, (c) leisure, (d) friends and social contacts, (e) family, and (f) partner relationship using items taken from larger panel studies (Huinink et al., 2011; Siedler et al., 2008).
Network Characteristics
Using single-item measures, participants were asked to indicate the number of (a) close friends, (b) friends on Facebook, (c) persons with whom the participant would be ready to discuss personal or occupational problems (i.e., emotional support), (d) persons who could be contacted for practical help (e.g., when relocating; i.e., instrumental support), and (e) persons the participant would ask for advice. Furthermore, participants were asked to indicate the frequency of (f) contact with their closest friend(s) and (g) joint activities with their friends.
Analytic Procedure
Convergent validity, operationalized as the correlation between scores of the six loneliness measures, was assessed separately for self-ratings and informant-ratings. Furthermore, for scores of each measure, we examined the correlation between self-ratings and informant-ratings. Nomological nets of the item scores were examined by calculating the correlations with the external variables (e.g., demography, personality).
To investigate whether scores of the selected loneliness measures differ in their associations with external variables, we conducted pairwise model comparisons. In the first step of this approach, we entered all item scores and allowed unrestricted correlations between the scores of the loneliness measures and the external variables, resulting in a saturated model. In the next step, we constrained the correlation between scores of two loneliness measures (e.g., RTLS
Total
and UCLA-LS20Items) and a correlate (e.g., shyness) to be equal. We then compared the fit of this constrained model with the unconstrained model via a χ2-difference test with 1 degree of freedom. A significant decrease in model fit in this setup indicated that scores of the two loneliness measures differed significantly in their association with the external variable. Due to the large number of pairwise comparisons, we only consider differences significant when the p-value of the χ2-difference test was lower than .001. To facilitate the interpretation of the findings, we computed the average absolute correlation (
To estimate differences in the correlation patterns between self-ratings and informant-ratings as well as between Study 1 and Study 2, we calculated mean absolute deviations between correlation matrices (
To examine the similarity of correlation profiles, we calculated the double-entry intraclass correlation (
Results and Discussion
Preliminary Analyses
Before turning to the key analyses, we conducted some preliminary analyses. First, we examined whether loneliness would be associated with the number of informant-ratings. Across all measures, loneliness was correlated with the number of informant-reports such that individuals with higher scores on loneliness were rated by fewer individuals. On average, this correlation amounted to
Convergent Validity
The intercorrelation between scores of the different measures of loneliness is displayed in Table 1 separately for the self-ratings and aggregated informant-ratings. Overall, the correlations were high for self-ratings (
Study 1: Convergent Validity of Loneliness Scores in Self-Rating and Informant-Rating.
Note. Self-rating (N = 657–679) is below the diagonal and informant-rating (
Within the self-ratings, the correlations between scores of RTLS
Social
and the three single items were lower than other associations and varied between
Taken together, the findings converge with prior research showing high intercorrelations between scores of different measures of loneliness for adults (Iecovich, 2013; D. Russell et al., 1980). Furthermore, we extended prior research by simultaneously examining the convergent validity of scores of multi-item and single-item instruments, and by demonstrating that the high convergence between item scores also extends to informant-ratings. Thus, we conclude that all the instruments included in Study 1 are appropriate to assess between-person differences in loneliness from an internal (i.e., self-ratings) and an external (i.e., informant-ratings) perspective.
Self-Informant Agreement
Table 2 displays the correlations between self-ratings and aggregated informant-ratings. Self-informant agreement on scores of the same scale (shown in the diagonal of Table 2) varied across instruments (.49 ≤ r ≤ .61). The highest self-informant agreement was found for scores of the RTLS
Total
(
Study 1: Self-Informant Agreement of Scores of Different Loneliness Measures.
Note. N = 159–160 pairs between self-ratings and aggregated informant-ratings. All correlations are statistically significant at
To our knowledge, this is the first study that investigated the self-informant agreement on scores of multiple measures of loneliness. The results show that the overall convergence between self-ratings and informant-ratings is high within and across measures. In fact, the amount of self-informant agreement found in the present and previous studies (Lee & Ko, 2018; Luhmann et al., 2016; Mearns et al., 2009) is similar to self-informant agreement in broader personality characteristics such as the Big Five traits (for a meta-analysis, see Connelly & Ones, 2010). Furthermore, the mean differences between self-ratings and informant-ratings are, overall, negligible, indicating that the aggregated informant-ratings provide a relatively accurate description of the target’s loneliness (for similar findings regarding broader personality characteristics, see Hofstee, 1994; Kim et al., 2019; Kolar et al., 1996).
The Nomological Nets of Loneliness Scores
Table 3 displays the correlations between scores of the different loneliness measures and a wide variety of correlates pertaining to demography, personality, satisfaction, and aspects of social networks. All correlations were compared pairwise via a model comparison with 1 degree of freedom. We will not go into detail on these model comparisons but instead highlight the overall pattern of results. To facilitate the inter-pretation, we computed scale-wise average absolute correlations (
Study 1: Nomological Net of Scores of Different Measures of Loneliness (Self-Reports).
Note. Different subscripts indicate a significantly different (
Due to the high correlation between the total score on the RTLS and its subfacets emotional and social loneliness, we ran different batches of models. In one batch, we compared the correlations between scores of the full RTLS and scores of all other scales; in the second batch, we compared the correlations between scores of the emotional and social loneliness facets with scores of all other scales. Hence, we could not test whether correlations differ between the full RTLS and its subfacets. bGender was coded as 1 = female and 0 = male/other. cAverage correlations were calculated based on the absolute z-transformed correlations. After averaging, z-scores were retransformed to
Study 1: Profile Correlations for the Association Between Loneliness Scores and Correlates.
Note. The table shows the correlations of the profiles of different loneliness scores with regard to the correlates displayed in Table 3. Profile correlations are based on z-scores and were calculated using the double-entry ICC. RTLS = Rasch-Type Loneliness Scale; UCLA-LS = University of Los Angeles California Loneliness Scale; SI = single item; ICC = intraclass correlation.
Demography
Item scores of all measures were only modestly related to demographic aspects. Average absolute correlations ranged from
Personality
The average absolute correlations ranged from
Satisfaction With Domains of Life
The average absolute correlations between loneliness scores and domains of satisfaction ranged from
Network Characteristics
Across all item scores, loneliness was associated with smaller friendship and support networks. All scores were uncorrelated with the self-reported number of friends on Facebook. The average absolute correlations ranged from
Profile Similarity
The correlation between the profiles of the loneliness scores and the correlates examined in Study 1 are displayed in Table 4. On average, the profiles of the item scores were very similar to each other, indicating that their nomological nets overlap to a large extent. Nevertheless, it might be noted that the profile correlation between scores of the single items and scores of the RTLS
Total
(.87 ≤ ICCDE ≤ .89), RTLS
Social
(.82 ≤ ICCDE ≤ .87) and UCLA-LS20Items (.80 ≤ ICCDE ≤ .84) was relatively low. By contrast, the profile correlations were substantially higher with the RTLS
Emotional
(
Summary and Conclusion
Overall, the results of Study 1 show that (a) all scores of measures of loneliness are highly correlated in the self-reports and informant-reports, (b) self-informant agreement was high for scores of all measures, and (c) scores of all measures have a similar nomological net. These findings suggest that scores of all measures tap into the same underlying latent construct and that there are no strong qualitative differences between the measures. It should be noted, though, that the nomological net of the scores obtained using single-item measures and—to a lesser extent—scores of the UCLA-LS3Items overlapped less strongly with related phenomena pertaining to personality (e.g., extraversion, sociability) and social networks (e.g., contact frequency). It could be argued that scores of these measures provide purer assessments of loneliness. However, it could also be argued that these lower correlations are due to the lower reliability of scores of these measures (Marangoni & Ickes, 1989; D. Russell, 1982). Indeed, the UCLA-LS3Items had a lower internal consistency in the present study (
Study 2: Validity of Scores of Four Loneliness Measures
Method
Sample
The data used in Study 2 were taken from a larger online study on personality and partner relationships (1 citation removed for masked review). The study protocol was approved by the Institutional Review Board of the Friedrich Schiller University Jena (FSV 18/47). The larger study combines features of longitudinal and diary methods in that couples are asked every 3 months about aspects of their own and their partner’s personality, evaluations of the partner relationship, and aspects related to partner perception and communication patterns. In addition, participants are invited in February and March each year to participate in a diary period extending over 28 days in total, separated by a 3-week break after the first 14 days. Participants were recruited via advertisements on social media platforms, several university mailing lists, and online forums visited by people interested in research participation (Mund & Drewke, 2020).
In this study, we use data from the first five longitudinal waves. These waves were conducted in January 2020, April 2020, July 2020, October 2020, and January 2021, respectively. We included data of couples when they participated for the first time. That is, for the purpose of this study, we did not build on the longitudinal nature of the study, but on its dyadic nature and the large sample size. Accordingly, when a couple participated more than once, we only used data from the first time both partners provided data. This first-time participation could have been anytime during the five waves. In this way, we included data from 1,216 individuals (608 mixed-sex couples). Of those 608 couples, 136 entered the study in January 2020, 48 new couples entered in April 2020, 141 new couples entered in July 2020, 219 new couples entered in October 2020, and 64 new couples entered in January 2021.
Despite couples entering the study at different measurement occasions, analyses of variance indicated no mean-level differences across waves for scores of the UCLA-LS8Items,
The same pattern of results was observed when comparing data collected before the COVID-19 pandemic (i.e., Wave 1, January 2020, N = 272) with data collected during the pandemic (i.e., from Wave 2 conducted in April 2020 onward; N = 944). By using independent-samples Welch tests, we found no differences in the levels of loneliness before and during the pandemic for the UCLA-LS20Items,
On average, participants were 26.69 years old (
Measures
Details on internal consistency, sample items, response formats, descriptive statistics, and zero-order correlations are provided in Supplemental Tables S4 to S6. Differences between samples of Study 1 and Study 2 regarding their average levels of loneliness are discussed in Section 3 of the Supplement, including Supplemental Tables S7 and S8.
Loneliness
For all loneliness measures, we obtained a self-rating and a partner-rating. That is, each participant rated and was rated by their partner on each loneliness measure.
Rasch-Type Loneliness Scale
In Study 2, we used the six-item version of the RTLS (de Jong Gierveld & van Tilburg, 2006), which also allows to differentiate between emotional and social loneliness. The correlation between scores of the two facets was less strong than in Study 1 for both the self-rating (
UCLA Loneliness Scale (UCLA-LS8Items)
We used the German version of the eight-item UCLA-LS (Hays & DiMatteo, 1987). Internal consistency was high for self-rating (
Three-Item Version of the UCLA Loneliness Scale (UCLA-LS3Items)
As in Study 1, we employed a three-item version of the UCLA-LS as proposed by Hawkley et al. (2015). Internal consistency was satisfactory in the self-rating (
Direct Single-Item Measure (SI Direct)
As a direct single-item measure of loneliness, we used the same measure as employed in Study 1. Self-ratings of this item were collected in January 2020, April 2020, July 2020, and January 2021, but not in October 2020. Partner-ratings were collected at all measurement occasions.
Correlates of Loneliness
As in Study 1, we used variables from several domains to examine the nomological net of scores of the loneliness measures. Specifically, we used correlates from the domains of (a) demography, (b) personality, and (c) satisfaction.
Demographic Variables
Participants were asked to indicate their age, gender, educational status, and their relationship duration.
Personality
The Big Five were assessed using a 15-item version of the Big Five Inventory Version 2 (Rammstedt et al., 2020; Soto & John, 2017). Self-esteem was assessed using the 10-item Rosenberg Self-Esteem Scale (von Collani & Herzberg, 2003). Depressiveness was measured using the five negatively worded items from the State-Trait Depression Scale (Spaderna et al., 2002). The explicit affiliation motive was assessed using two items from the Unified Motive Scales (Schönbrodt & Gerstenberg, 2012). Shyness and sociability were assessed using the Shyness and Sociability Scales for adults (Asendorpf & Wilpers, 1998). Sociability was only assessed at the first three measurement waves (January, April, and July 2020, respectively).
Satisfaction
Participants were asked to indicate their overall satisfaction with (a) life, (b) education, (c) leisure, (d) friends and social contacts, and (e) family using single items (Huinink et al., 2011; Siedler et al., 2008). Relationship satisfaction was assessed with the seven-item Relationship Assessment Scale (Hendrick, 1988; Sander & Böcker, 1993).
Results and Discussion
Convergent Validity
The intercorrelation between scores of the loneliness measures used in Study 2 is displayed in Table 5 separately for self-ratings (below the diagonal) and partner-ratings (above the diagonal). For self-ratings, intercorrelations were high (
Study 2: Convergent Validity Loneliness Scores in Self- and Informant-Rating.
Note. Self-rating (
The average difference between the self-ratings and partner-ratings in Study 2 was
Finally, we compared the average differences and profile correlations between Study 1 and Study 2 separately for the self-ratings and informant-ratings for the measures included in both studies (i.e.,
Taken together, the results regarding the convergent validity of scores of different loneliness measures obtained in Study 2 converged very well with the results observed in Study 1. Across the two studies, the correlation profiles were very similar and the differences in the average correlations were small to moderate. Unlike Study 1, the SI Direct turned out to be somewhat problematic in the partner-rating of Study 2 in the sense that it showed only modest correlations with scores of other partner-rated loneliness measures. One possible explanation for this finding is that the SI Direct forced individuals to explicitly rate their partners as “lonely,” which might threaten the self-concept of the rating partner. In Study 1, such effects might have been removed through the aggregation of the informant-ratings across multiple raters.
Self-Informant Agreement
The agreement between self-ratings and partner-ratings are displayed in Table 6. The diagonal shows the agreement on the same instrument, ranging between
Study 2: Self-Informant Agreement of the Different Loneliness Scores.
Note. N = 742–1,196 pairs of self-ratings and partner-ratings. All correlations are statistically significant at
In contrast to Study 1, paired-samples t-tests indicated consistent statistically significant differences between self-ratings and informant-ratings (all ps < .001). These differences were moderate in size, ranging between
Comparing self-informant agreement between Study 1 and Study 2, we found a weak profile correlation of
The Nomological Nets of Loneliness Measures
The correlations between the different loneliness item scores and external correlates are displayed in Table 7. As in Study 1, we focus on the average absolute correlations (
Study 2: Nomological Nets of Different Measures of Loneliness (Self-Reports).
Note. Different subscripts indicate a significantly different (
Due to the high correlation between scores of the total score on the RTLS and its subfacets emotional and social loneliness, we ran different batches of models. In one batch, we compared the correlations between scores of the full RTLS and all other scales, and in the second batch, we compared the correlations between scores of the emotional and social loneliness facets with all other scales. Hence, we could not test whether correlations differ between scores of the full RTLS and its subfacets. bGender was coded as 1 = female and 0 = male/other. cAverage correlations were calculated based on the absolute z-transformed correlations. After averaging, z-scores were retransformed to
Study 2: Profile Correlations Between Loneliness Scores and Correlates.
Note. The table shows the correlations of the profiles of different loneliness scales with regard to the correlates displayed in Table 7. Profile correlations are based on z-scores and were calculated using the double-entry ICC. SI = single item; RTLS = Rasch-Type Loneliness Scale; UCLA-LS = University of Los Angeles California Loneliness Scale; ICC = intraclass correlation.
Demography
The absolute average correlations varied only slightly between the loneliness measures with (.04 ≤
Personality
Compared to Study 1, the absolute average correlations in Study 2 were substantially lower, ranging from
As in Study 1, scores of the
Satisfaction
On average, there were no strong differences between scores of the loneliness measures with .27 ≤
Profile Similarity
The profile similarities for Study 2 are displayed in Table 8. The profiles of the different loneliness scores varied between .77 ≤ ICCDE ≤ .99. The highest similarity was found between the SI Direct and the
Comparing the profiles of scores of the measures that were used in both studies (i.e., RTLS and facets,
Summary and Conclusion
The results of Study 2 compared well with the results obtained in Study 1. We found a similar pattern of convergent validity among scores of different loneliness measures in the self-ratings and, with the exception of SI Direct, also in the partner-ratings. Moreover, we found a very high similarity between the profiles of different loneliness item scores and external correlates across the two studies. Although Study 2 cannot be considered a direct replication of Study 1, the similarity between the results indicates that the findings are robust. Thus, Study 2 further corroborates that scores of all included measures are valid and reliable measures of loneliness in adults.
Study 3: Reliability of Scores of Single-Item Measures
In Studies 1 and 2, we found that the nomological nets of scores of single-item measures of loneliness overlap less with phenomena related to extraversion and sociability. This finding might point either to purer nomological nets or to a lack of validity of the scores of single items—because of low reliability, their correlation with other constructs might be reduced.
Two previous studies have used STARTS models (Kenny & Zautra, 2001) on single-item scores of loneliness (Mund, Lüdtke, & Neyer, 2020; Zhong et al., 2016). In this way, Zhong et al. (2016) estimated a reliability of .595 for scores of an SI Direct, whereas the results reported by Mund, Lüdtke, and Neyer (2020) imply a reliability of scores of different single-item measures in the range between
Method
Sample
In 2019, we recruited a sample of 411 individuals via the service provider Prolific (www.prolific.com; for an overview, see Palan & Schitter, 2018). As the study only involved self-ratings, no ethical approval was necessary according to German regulations; however, all participants provided informed consent before entering the study. On average, participants were 30.06 years old (
Two weeks after the first assessment (T1), all participants were reinvited to participate a second time (T2). Of the initial 411 individuals, 278 (67%) participated again. Another 2 weeks later (i.e., 4 weeks after the initial assessment), all participants were invited to the third survey (T3) and 276 individuals (67% compared to T1) provided the necessary data. For estimating the reliability of scores of the single items, we conducted an analysis based on all available data and, separately, for the panel sample of 276 individuals who participated in all three assessment points. The data used for the key analyses are available from https://osf.io/7gsfw/.
Measures
Loneliness
As measures of loneliness, we included the three single items also included in Study 1. That is, we included the SI Direct, SI Indirect, and the SI
Correlates
The three single-item measures of loneliness were interspersed with other measures. We included the three-item measure of self-esteem developed for large panel studies (Huinink et al., 2011) that was also used in the present Study 1. Furthermore, participants rated the frequency of being tense, joyful, sad, and happy. We will consider scores of self-esteem in the following analyses, but the emotion ratings were merely used as fillers and will not be considered further.
Analytic Strategy
The procedure described by Heise (1969, Formula 9) requires three autocorrelations among the items measuring loneliness: the correlation between scores of T1 and T2 (
Results and Discussion
Before estimating the reliability of scores of the single-item measures, we investigated their correlation with scores of the brief self-esteem measure. The correlation between self-esteem scores and scores of SI Direct (rT1 = –.55, rT2 = –.51, rT3 = –.51), SI Indirect (
Table 9 displays the reliability of the scores of single-item measures of loneliness for the full and panel sample. The highest reliability was estimated for scores of the SI Indirect, followed by scores of the
Study 3: Reliability of Scores of Single-Item Measures.
Note.
Correlations in the Full Sample are displayed below the diagonal, and correlations in the Panel Sample are displayed above the diagonal.
General Discussion
In the present studies, we examined the psychometric features of scores of several measures of loneliness—the perception of one’s relationships as deficient regarding the quantitative and/or qualitative aspects (Ernst & Cacioppo, 1999; Perlman & Peplau, 1981). We included versions of the RTLS, UCLA-LS, and different single-item measures because those are among the most popular and widely used measures of loneliness in adulthood (Buecker et al., 2020; Maes et al., 2019; Mund, Freuding, et al., 2020; Pinquart & Sörensen, 2001). At the same time, these measures represent broader operationalizations of loneliness: The RTLS represents an indirect multidimensional approach, whereas the UCLA-LS represents an indirect unidimensional approach to loneliness. The included single-item measures are necessarily unidimensional, but represent both direct and indirect approaches. This study largely extends previous research by (a) focusing on several loneliness measures at once, (b) using self-ratings and informant-ratings to assess loneliness, (c) examining and comparing broad nomological nets, and (d) paying attention to the validity and reliability of different versions of single-item measures. The results presented in this article offer many vantage points for discussion, including (a) the similarities and differences between measures of loneliness, (b) the debate around the use of single items, and (c) recommendations for future research.
Similarities and Differences Between Loneliness Measures
In Study 1 and Study 2, we investigated the convergent validity of scores of different measures of loneliness, self-informant agreement, and the nomological nets of these item scores. The results indicate that scores of all measures included in this study are useful measures of loneliness. All item scores show high convergent validity in the self-reports and informant-reports. Furthermore, the nomological nets of scores of all measures were very similar to each other as indicated by high profile correlations. However, it should be noted that consistent across Studies 1 and 2, scores of the UCLA-LS, in its 20-item as well as in its 8-item version, were more strongly related to affiliative aspects like sociability and extraversion than scores of other measures, even after controlling for unreliability in other item scores. Similarly, scores of the
Across all measures and both Study 1 and Study 2, depressiveness and life satisfaction emerged as key correlates of loneliness. These findings are in line with prior research suggesting that loneliness might be a risk factor for depression (Cacioppo et al., 2010; Erzen & Çikrikci, 2018) or co-occur due to a common origin (Abdellaoui et al., 2018; Beutel et al., 2017; Cacioppo, Hughes, et al., 2006). The association with lower life satisfaction further suggests that loneliness might have very broad influences on how individuals see and approach the world. More specifically, although loneliness is conceptualized to emerge from perceived deficiencies in one’s social relationships, its consequences seem to reach far beyond social relationships and to cloud individual’s perspective on life more generally (Ernst & Cacioppo, 1999; Perlman & Peplau, 1981).
Consistent with prior research (Lee & Ko, 2018; Luhmann et al., 2016; Mearns et al., 2009), self-informant agreement within and across item scores was generally high and similar to self-informant agreement in broader personality characteristics (Connelly & Ones, 2010). This finding indicates that loneliness can be perceived by others. From a psychometric perspective, this supports the notion that scores of the measures selected in this article capture reliable interindividual differences that are, to a large extent, validated by informants (McCrae et al., 2004; Podsakoff et al., 2003). From an applied perspective, this finding might also indicate that close others can accurately evaluate a person’s loneliness. Thus, campaigns to tackle loneliness might also benefit from equipping close others with the knowledge of how to help and support lonely individuals in their environment in addition to encouraging lonely people to seek help proactively.
It should be noted that the informant-ratings in the present and previous studies (Lee & Ko, 2018; Luhmann et al., 2016; Mearns et al., 2009) were obtained from close others such as partners, friends, or parents. Loneliness might be less well observable for less close informants, such as teachers or casual acquaintances (Geukens et al., 2021). Unlike extraversion, for example, which can be judged accurately even after a few seconds of interaction (Back & Nestler, 2016; Connelly & Ones, 2010; Vazire, 2010), loneliness is a rather internal state that might not manifest readily in specific observable behaviors (see also Vazire, 2010). Instead, loneliness might express itself in the content of interactions, for example, by a specific emotional tone present in repeated conversations, or specific language patterns (Mehl et al., 2017; for findings on neuroticism, see Tackman et al., 2020) or interaction patterns (Tsai & Reis, 2009).
Debate Around Single-Item Measures
In Study 1 and Study 2, we demonstrated that scores of different versions of single-item measures correlate highly with each other and with scores of multi-item scales. Furthermore, we demonstrated that the nomological nets of scores of single items are consistent across different formulations and with the nomological nets of the established RTLS and UCLA-LS item scores. In Study 3, we also demonstrated that scores of single items are reliable. Thus, the results of the present studies indicate that scores of single-item measures are not deficient or invalid measures of loneliness per se. Consequently, we argue that single-item measures of loneliness have their place as robust and trustworthy measures in research on loneliness.
The practice of measuring loneliness via single items has been controversial (Marangoni & Ickes, 1989; D. Russell et al., 1980). However, single-item measures are highly prevalent in research on loneliness (Mund, Freuding, et al., 2020; Pinquart & Sörensen, 2001). This controversy seems to resemble similar discussions surrounding single-item measures of self-esteem (Robins et al., 2001) and life satisfaction (Cheung & Lucas, 2014; Lucas & Donnellan, 2012). Specifically results in those research fields have suggested that both self-esteem and life satisfaction are so highly schematized constructs that single-item scores are reliable and valid. The results of this study seem to suggest that loneliness is highly schematized as well and can be measured using single items in contexts where financial or time-related constraints prohibit the use of multi-item scales. However, it should also be noted that the use of single-item and other ultra-short measures of loneliness such as the
Recommendations for Future Research
Broadly speaking, when it comes to measuring the core of loneliness, we proclaim the well-known Dodo Verdict that “everybody has won, so all shall have prizes.” Put differently, we believe that the phenomenon of loneliness in adulthood can be measured robustly, validly, and reliably by scores of any of the measures investigated in this article.
A more nuanced view is necessary within specific research contexts. For example, in studies designed to investigate the behavioral correlates and/or the observability of loneliness, researchers need to take care that any observed effects are not due to other factors. For example, most studies that have examined self-informant agreement in loneliness have used scores of the UCLA-LS (Lee & Ko, 2018; Luhmann et al., 2016; Mearns et al., 2009). Given the results presented in the current studies, the high self-informant agreement found in these studies might be attributable, to some extent, to informants having actually observed the target’s extraversion or sociability, which are well observable even for raters who are unacquainted with the target (Back & Nestler, 2016; Connelly & Ones, 2010). As another example, the similarity in loneliness between romantic partners has been found to be moderate (
The general recommendation that can be taken from the present findings is that loneliness measures should be selected with a particular regard to the research question. Some of the existing measures (e.g., RTLS, UCLA-LS) might need to be adjusted statistically to rule out confounding with overlapping constructs (e.g., extraversion). Furthermore, some measures might not work well when informant-ratings are to be collected, as was the case for the SI Direct in the present Study 2; in such cases, indirect measures might be better suited. Ideally, researchers should include multiple measures whenever possible to cross-check the results. The nomological nets presented in this study might help to spot potential sources of disagreement between measures in existing and future research.
Limitations
The findings presented in this article should be evaluated against the backdrop of some limitations. First, recruitment of the participants was conducted via the internet and, thus, was prone to self-selection. Overall, the samples were composed of mostly young and highly educated adults. Hence, it is unclear whether the results can be generalized to samples from other age groups, other educational or cultural backgrounds, or with chronic health conditions.
Second, we investigated a selection of measurement instruments that are frequently used in contemporary research on loneliness in adulthood (Buecker et al., 2020; Maes et al., 2019; Mund, Freuding, et al., 2020; Pinquart & Sörensen, 2001). However, other loneliness measures are also available. Some of those measures have been developed for samples of children or adolescents (Maes et al., 2015, 2017; Marcoen et al., 1987), and some have been developed with a broad multidimensional and relationship-specific perspective (DiTommaso & Spinner, 1993; Hoza et al., 2000; Maes et al., 2015; Pollet et al., 2018). We hope that the nomological nets presented in this study might serve as a foundation for further investigating the construct validity of scores of those measures.
Third, although we investigated comprehensive nomological nets of scores of different loneliness measures, several important correlates were not considered. These include, first and foremost, health behaviors (Eccles et al., 2020; Shankar et al., 2011), but also other phenomena such as externalizing problems (e.g., hostility; Luhmann et al., 2015; Segel-Karpas & Ayalon, 2020) or social media usage (Nowland et al., 2018).
Conclusion
Across three studies, we investigated aspects of the validity and reliability of scores of different loneliness measures—including several versions of established scales such as the RTLA and the UCLA-LS, but also different versions of single-item measures. Overall, the results indicate that scores of all measures are highly correlated in self-ratings as well as informant-ratings. Furthermore, we found substantial self-informant agreement on scores of all measures. The nomological nets of the included item scores were shown to be consistent within and across studies. Finally, we have shown that scores of single-item measures of loneliness possess adequate reliability. Taken together, the findings of the present studies provide rich evidence for the validity of scores of popular loneliness measures, thereby providing a nuanced picture of the nomological net of each measure’s item scores. Thus, this study might guide further research in selecting an appropriate loneliness measure and at the same time provide a solid foundation for future research on the validity of scores of other loneliness measures.
Supplemental Material
sj-docx-1-asm-10.1177_10731911221077227 – Supplemental material for Would the Real Loneliness Please Stand Up? The Validity of Loneliness Scores and the Reliability of Single-Item Scores
Supplemental material, sj-docx-1-asm-10.1177_10731911221077227 for Would the Real Loneliness Please Stand Up? The Validity of Loneliness Scores and the Reliability of Single-Item Scores by Marcus Mund, Marlies Maes, Pia M. Drewke, Antonia Gutzeit, Isabel Jaki and Pamela Qualter in Assessment
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Preparation of this manuscript was supported by grant DRM/2017-06 from the Friedrich-Schiller-Universität Jena (Study 1) and German Research Foundation grant MU 4338/1-1 (Study 2) awarded to Marcus Mund.
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
