Abstract
In this meta-analysis, we synthesized the available longitudinal evidence on prospective effects between self-esteem and eating pathology (i.e., restrained eating, bulimic behavior, binge eating, eating concern, negative body image, and drive for thinness). The analyses were based on 48 independent samples, including 19,187 participants. Mean age was 19.3 years (range = 7–48). As effect-size measure, we used standardized regression coefficients, controlled for prior levels of the predicted variables. Results suggested reciprocal prospective effects between low self-esteem and eating pathology. Self-esteem negatively predicted total eating pathology over time (β = −.08), and total eating pathology negatively predicted self-esteem over time (β = −.09). Overall, results for specific categories of eating pathology were similar. Moderator analyses indicated that the effects did not differ across age, gender, sample type (clinical vs. nonclinical), and time lag between assessments. In sum, the results support a reciprocal relations model of low self-esteem and eating disorders.
Approximately 8.4% of women and 2.2% of men suffer from eating disorders, such as anorexia nervosa, bulimia nervosa, and binge-eating disorder, and the point prevalence has been increasing over the past decades (Galmiche et al., 2019). Subclinical prevalence is even higher and has been estimated to be around 22% in adolescence (Hölling & Schlack, 2007). Besides genetic, biological, and sociological factors, self-esteem (i.e., individuals’ subjective evaluation of their worth as a person; Donnellan et al., 2011) has been hypothesized to be an important psychological predictor for the development of an eating disorder (e.g., Fairburn et al., 2003; Serpell & Troop, 2003). However, the potential role of eating disorders in the development of self-esteem has been largely ignored in the literature. In this meta-analysis, we therefore synthesized the available longitudinal evidence to gain better insights into the prospective effects between self-esteem and eating pathology.
As noted above, research suggests that low self-esteem is a risk factor for the development of eating pathology, an effect to which we refer as the vulnerability effect of low self-esteem (see Klein et al., 2011; Sowislo & Orth, 2013). Theory on eating disorders supports the assumption that low self-esteem is an influential factor in the onset and maintenance of eating disorders. For example, the transdiagnostic model of eating disorders proposes that low self-esteem motivates individuals to achieve their desired goals in the domain of weight and shape to improve their appearance-related self-evaluations (Fairburn et al., 2003). According to this model, this disproportional overvaluation of weight and shape leads to strict dieting and other weight-control behavior, which, in turn, results in either low weight or binge eating. Another example is the model of psychological factors of eating disorders (Serpell & Troop, 2003), which identifies low self-esteem as one of the central psychological factors that increase the likelihood of developing an eating disorder. Moreover, both models consider low self-esteem a transdiagnostic predictor for eating pathology, which is also supported empirically (e.g., E. J. Jones et al., 2020; Lampard et al., 2013). Overall, empirical evidence suggests that low self-esteem is a risk factor for eating disorders. The results from a recent meta-analysis, based on data from 13 studies, suggested that self-esteem negatively predicted later eating-disorder symptoms (β = −.09; Colmsee et al., 2021).
On the other hand, it is possible that eating pathology leads to enduring damages in the individual’s self-concept, resulting in reduced self-esteem, an effect to which we refer as the “scar” effect of eating pathology (see Klein et al., 2011; Sowislo & Orth, 2013). Two theoretical perspectives support the hypothesis that eating pathology negatively affects self-esteem. First, in his classic theory, James (1890) defined self-esteem as the degree to which individuals perceive that their goals are achieved (i.e., as the ratio of success to pretensions). For individuals with eating-disorder symptoms, a favorable ratio is often difficult to achieve. For example, individuals with anorexia may desire to be thin but perceive themselves as fat. In addition, individuals with binge-eating disorder may desire to have their eating behavior under control but repeatedly experience loss of control. A second theoretical perspective is sociometer theory, which assumes that self-esteem is part of an internal system that constantly monitors the individual’s degree of social acceptance (Leary, 2012). When social acceptance is threatened, self-esteem declines. The problem is that most individuals with an eating disorder perceive themselves as disgusting and undesirable (e.g., Ille et al., 2014), which may contribute to declines in self-esteem. Although these theoretical perspectives suggest that eating pathology negatively affects self-esteem, only a few studies are available that focused on this possibility. Nevertheless, the available evidence suggests that restrained eating, binge eating, and body dissatisfaction predict decreases in self-esteem (Johnson & Wardle, 2005; D. C. Jones & Newman, 2009; Sehm & Warschburger, 2018).
The Present Research
The present meta-analysis synthesizes the available evidence on prospective effects between self-esteem and eating pathology. Note that the vulnerability effect and scar effect are not mutually exclusive but may operate simultaneously. Given the theoretical perspectives and empirical evidence reviewed above, we expected a pattern of reciprocal effects (i.e., supporting a reciprocal-relations model of low self-esteem and eating disorders).
We examined a comprehensive set of key constructs related to eating pathology, which can be clustered into behavioral symptoms (including restrained eating, bulimic behavior, and binge eating) and cognitive-affective symptoms (including eating concern, negative body image, and drive for thinness). Restrained eating covers aspects of restricted food intake, such as limiting the amount of consumed food or excluding certain types of food from the diet. Bulimic behavior describes the uncontrollable consumption of large amounts of food and subsequent compensatory measures, such as vomiting, excessively exercising, or using laxatives. Binge eating covers frequent episodes of consuming large amounts of food, in situations where the individual loses control (in contrast to bulimic behavior, no compensatory measures are taken). Eating concern comprises constant thoughts about food, eating, and calories. Negative body image covers weight and shape concerns (e.g., preoccupation with thoughts about weight and shape, importance of shape and weight) and body dissatisfaction (i.e., dissatisfaction with specific body parts, e.g., stomach, thighs, and hips). Drive for thinness describes the strong desire to be thin and the constant fear of gaining weight. Finally, we aggregated the evidence across all available measures by computing an index of total eating pathology.
To strengthen the validity of conclusions, we focused exclusively on effect sizes that were based on longitudinal data and that were controlled for prior levels of the predicted variables (e.g., Cole & Maxwell, 2003; Gollob & Reichardt, 1987). Figure 1 provides a generic illustration of the effect sizes analyzed in this meta-analysis. First, we examined the cross-lagged coefficients between self-esteem and eating-disorder variables for which the autoregressive effects of the predicted variables were controlled (e.g., the effect of self-esteem at Time 1 on restrained eating at Time 2 controlling for the effect of restrained eating at Time 1). Second, we examined the stability (i.e., autoregressive) coefficients of each construct (e.g., the effect of restrained eating at Time 1 on restrained eating at Time 2). Third, for reasons of completeness, we also examined the concurrent correlation between the constructs at Time 1 (e.g., the correlation between self-esteem at Time 1 and restrained eating at Time 1).

Generic illustration of effect sizes analyzed in the present meta-analysis. We examined the cross-lagged effects between self-esteem and eating-disorder variables (e.g., the effect of self-esteem at Time 1 on restrained eating at Time 2) while controlling for the stability effects of the predicted variables (e.g., the effect of restrained eating at Time 1 on restrained eating at Time 2). In addition to cross-lagged and stability effects, we also examined the concurrent correlation between the constructs at Time 1 (e.g., correlation between self-esteem at Time 1 and restrained eating at Time 1). The correlation between the residuals of the constructs at Time 2 was not meta-analyzed (and is therefore grayed out in the figure). Residual variances (i.e., disturbances) are denoted as d1 and d2.
A general strength and central goal of meta-analyses is to assess the robustness of effects by conducting moderator analyses. To maximize the power of our moderator analyses, we focused exclusively on total eating pathology (for this variable, the number of effect sizes was much larger compared with the specific-eating-disorder variables). As potential moderators, we tested three important sample characteristics—mean age, proportion of gender in the sample, and sample type (i.e., clinical vs. nonclinical)—and one design characteristic, that is, length of time lag between assessments. These moderators were selected because they are useful to test whether the findings hold across variations in key characteristics of the studies (for similar procedures, see Khazanov & Ruscio, 2016; Sowislo & Orth, 2013). Even if we did not expect significant moderator effects, some research is available that suggests that the moderator variables might explain variability in effect sizes. For example, the effects of low self-esteem on eating pathology could be stronger in childhood and adolescence compared with adulthood because adults have typically developed more adaptive strategies to cope with low self-esteem other than pathological eating behaviors (e.g., Diehl et al., 2014). Moreover, body weight and shape may affect girls’ (vs. boys’) self-esteem more strongly because girls place greater importance on their physical appearance than boys (Quittkat et al., 2019). Consequently, the effects of eating pathology on self-esteem could be larger for girls than for boys. In addition, sample type may play an important role because the link between self-esteem and eating pathology could be stronger in clinical samples than in nonclinical samples (Braun et al., 2016). Finally, we tested for moderating effects of time lag because effects between self-esteem and eating disorders might decrease with longer time lags (Dormann & Griffin, 2015).
The Need for the Present Meta-Analysis
The goal of the present meta-analysis was to better understand the link between low self-esteem and eating disorders. It is important to understand whether low self-esteem is a predictor for developing an eating disorder, and at the same time, it is also important to understand how eating disorders shape the development of self-esteem. Although the meta-analysis by Colmsee et al. (2021) summarized evidence regarding some of the questions in this field, the present meta-analysis is needed for several reasons. First, Colmsee et al. focused exclusively on the possible effects of low self-esteem on eating pathology. However, as reviewed above, theoretical and empirical arguments suggest that the relation between the constructs may be reciprocal. Consequently, without considering the possible effects in the reverse direction (i.e., effects of eating pathology on self-esteem), any conclusions about the nature of the relation between the constructs are incomplete and possibly false. Second, Colmsee et al. tested whether type of disorder (i.e., anorexia, bulimia, and binge eating) moderated the effect size (which was not the case). However, there is a need for more detailed information with regard to specific categories of eating pathology, which allows for a more refined theoretical account of the relation between self-esteem and eating pathology. In addition to an index of total eating pathology, we therefore examined a broad set of specific categories, including behavioral symptoms (i.e., restrained eating, bulimic behavior, and binge eating) and cognitive-affective symptoms (i.e., eating concern, negative body image, and drive for thinness). Theory suggests that low self-esteem is related to both behavioral and cognitive-affective symptoms (Fairburn et al., 2003; Rieger et al., 2010; Serpell & Troop, 2003), but little empirical information is available on this issue. Thus, it is possible that there are differential effects between self-esteem and specific categories of eating pathology, which would have important implications for potential interventions. Third, meta-analytic evidence on the possible effects of eating pathology on self-esteem will contribute important knowledge to the field of self-esteem development. Although some information on the factors that influence self-esteem is available (for a review, see Orth & Robins, 2019), overall, there is a need to better understand the factors that shape the development of self-esteem, and relatively little information is available about the effects of clinical conditions and symptoms (Zeigler-Hill, 2011).
Method
Given that we used anonymized data, the present research was exempt from approval by the Ethics Committee of the authors’ institution (Faculty of Human Sciences, University of Bern), in accordance with national law. The present research was not registered. We report how we determined our sample size, all data exclusions (if any), all manipulations, and all measures in the study. Data, materials, and code are available at https://osf.io/t6nge. The present meta-analysis follows the PRISMA guideline for reporting systematic reviews (Page et al., 2021).
Selection of studies
Figure S1, in the Supplemental Material available online, shows the flow diagram of the search and selection procedures. We searched the databases PsycINFO and MEDLINE for English-language journal articles, books, book chapters, and dissertations. The search was conducted on May 12, 2020, and covered articles from all years included in the databases (i.e., beginning in 1806 for PsycINFO and 1946 for MEDLINE). For self-esteem, we used the following search terms: self-esteem, self-worth, self-liking, self-view*, self-concept, self-respect, self-regard, self-acceptance, and self-image*. For constructs related to eating disorders, we used the following search terms: eating*, anore*, bulimi*, and bing*. The asterisk allowed for the inclusion of alternate word endings of the search term. To ensure that the search will likely yield longitudinal studies, we added the search terms longitudinal*, prospective*, and cross-lag*. After removing duplicates, the search resulted in a total of 510 potentially relevant articles.
To make the meta-analysis as exhaustive as possible, we used three additional strategies. First, we sent a request for unpublished studies to authors with a research program in the area of eating disorders. We requested unpublished manuscripts, preprints, papers in press, theses, or any other form of gray literature or unpublished data. This strategy resulted in three additional articles. Second, we examined the reference sections of seven articles that focused on the topic of the present research (i.e., Bardone-Cone et al., 2020; Baumeister et al., 2003; Colmsee et al., 2021; Fairburn et al., 2003; Larsen et al., 2015; Linardon et al., 2019; Polivy & Herman, 2002). This strategy resulted in three additional studies. Third, we included one additional article that we knew of but was not found by the search procedures described above.
Studies were included in the meta-analysis if the following criteria were fulfilled: (a) The study was empirically quantitative, (b) the study used a longitudinal design (i.e., it included two or more assessments of the same sample), (c) the study included measures of both self-esteem and an eating-disorder variable, (d) enough information was given to compute the effect sizes, and (e) the sample did not undergo a psychological or psychopharmacological intervention (however, we used information from control groups that did not undergo any treatment). Nearly all studies were assessed in full text. Only studies that were difficult to obtain were first screened on the level of the abstract by S. Krauss. If the abstract did not meet any exclusion criteria, the study was acquired as full text for further assessment. All full texts were assessed by S. Krauss to decide on the eligibility of studies. In addition, a random sample of 60 studies was assessed independently by L. C. Dapp to obtain an estimate of interrater agreement. The interrater agreement on inclusion or exclusion in the meta-analysis was high (κ = 1.00). 1
From the 517 potentially relevant studies, 14 could directly be included in the meta-analysis because all required information was available in the article. If studies fulfilled all inclusion criteria except for providing enough information to compute effect sizes, we contacted the authors by email with a request for providing the required information (if the article had been published in 2000 or later). On the basis of the authors’ responses, we could include 30 additional articles. In sum, the search procedures yielded 44 sources (i.e., 41 journal articles and three dissertations), providing data on 48 independent samples for analysis.
Coding of studies
We coded the following data: sample size, mean age of participants at Time 1, percentage of female participants, sample type (i.e., nationally representative, college students, clinical, and community), country of data collection, race/ethnicity, information on socioeconomic status, year of Time 1 assessment, time lag between assessments, effect-size information, publication status of effect sizes (i.e., effect-size data published in article in form of correlation or regression coefficients vs. effect-size data not published in article but obtained from authors), and risk of bias. If an article did not report the mean age of participants, we used the most valid indicator of age that was available. For example, if an age range was given (e.g., 18−20 years), we used the midpoint of the interval as estimate of mean age (e.g., 19 years). If a study provided more than one effect size for the same category of eating pathology using the same sample (e.g., one effect size for weight/shape concern and one effect size for body dissatisfaction, which both belong to negative body image), we averaged the effect sizes using Fisher’s z transformations. Risk of bias was assessed for each included study using the Joanna Briggs Institute’s (JBI) Critical Appraisal Checklist for Analytical Cross-Sectional Studies (Moola et al., 2020). The checklist has been modified for longitudinal studies. The modified version included six dichotomous items (see Coding Manual), and a sum score of 6 indicates the lowest risk of bias (and the highest quality) of the included study.
As effect-size measure, we used standardized regression coefficients (denoted as β), where the effect of the predictor at Time 1 on the outcome at Time 2 is controlled for the Time 1 level of the outcome (see Fig. 1). In most cases, these effect sizes were not directly reported in the article; instead, we coded all relevant zero-order correlations (i.e., correlations between Time 1 self-esteem, Time 2 self-esteem, Time 1 eating-disorder variable, and Time 2 eating-disorder variable). Using these correlations, we computed the standardized regression coefficients with the following equation (Cohen et al., 2003, p. 68), which is applicable when a criterion variable (Y) is influenced by two predictors (X1, X2):
Here, βY1.2 is the standardized regression coefficient of X1 predicting Y controlling for the effect of X2 (e.g., the effect of self-esteem at Time 1 on restrained eating at Time 2 controlling for restrained eating at Time 1), rY1 and rY2 are the zero-order correlations between each predictor (X1, X2; e.g., self-esteem at Time 1, restrained eating at Time 1) and the criterion (Y; e.g., restrained eating at Time 2), and r12 is the correlation between the two predictors (X1 and X2; e.g., the cross-sectional correlation of self-esteem at Time 1 and restrained eating at Time 1).
Studies from the PsycINFO and MEDLINE search that provided all required effect-size information were coded independently by two raters (i.e., S. Krauss, L. C. Dapp; the other studies were coded by S. Krauss). Estimates of interrater agreement were based on 12 samples and were calculated with the psych package (Revelle, 2021) in R (R Core Team, 2021). The interrater agreement was high for categorical variables (κ = 1.00) and continuous variables (r > .99). Diverging assessments were discussed until consensus was reached.
Meta-analytic procedure
For all meta-analytic computations, we used the metafor package in R (Viechtbauer, 2010). For the analyses, all coefficients were converted to Fisher’s z values. The within-study variance of the transformed values is given by
where ni is the sample size in study i. In the effect-size analyses, we used random-effects models (for estimating weighted mean effect sizes) and mixed-effects metaregression models (for testing moderators), following recommendations by Borenstein et al. (2009). For both kind of models, study weights are given by
where ω i is the study weight for study i, vi is the within-study variance for study i, and τ2 is the estimate of between-studies heterogeneity. Between-studies heterogeneity (i.e., τ2) was estimated with the method of restricted maximum likelihood (Harville, 1977), as recommended by Langan et al. (2019). To account for the uncertainty in the estimate of τ2, we used the Knapp and Hartung (2003) method, as recommended by Viechtbauer et al. (2015).
In the preliminary analyses, we examined statistical outliers on effect-size estimates and conducted sensitivity analyses. For each of the effect-size variables, we searched for influential outliers using the “influence” function of the metafor package (Viechtbauer, 2010). When an effect size was influential and qualified as a potential outlier (following the cutoff values implemented in the metafor package), we repeated the meta-analytic computation of the weighted mean effect size without this study for the purpose of sensitivity analyses.
In the effect-size analyses, we computed weighted mean effect sizes and tested for heterogeneity of effect-size distributions. In these analyses, we first focused on total eating pathology and then on specific categories of eating pathology (i.e., behavioral symptoms and cognitive-affective symptoms). The index of total eating pathology was based on all available measures (i.e., measures of specific symptoms and global measures of eating pathology). If a sample provided effect sizes for more than one measure, we averaged the effect sizes before the meta-analytic computations (using Fisher’s z transformations) to ensure the statistical independence of effect sizes.
Moderator tests were conducted exclusively for total eating pathology because of the much larger number of samples for this variable compared with the more specific eating-disorder variables to maximize the power and robustness of the moderator analyses. The moderator variables mean age at Time 1, proportion of female participants, and time lag were continuous and were treated accordingly. The moderator-variable, sample type, was dichotomous, distinguishing between clinical (23%) and nonclinical samples (77%). For both directions of effects, the four moderators were tested simultaneously using mixed-effects metaregression models.
Finally, we determined whether there was evidence of publication bias in the meta-analytic data set using three methods. First, we examined funnel graphs, which display the relation between effect size and the standard error of the effect size. A symmetrical shape indicates a nonbiased meta-analytic data set. Second, we used Egger’s regression test (Egger et al., 1997) to examine whether the funnel graphs significantly deviate from a symmetrical shape. Third, we compared effect sizes that were published in the articles (i.e., as correlation or regression coefficients) with effect sizes that were not published in the articles (but obtained from the study authors) using mixed-effects metaregression models. If the size and significance of an effect size influences whether it is published, then this comparison should yield a significant difference between effect sizes (i.e., this would be evidence of publication bias).
Results
Description of studies
The meta-analytic data set included information from 44 sources. Year of publication ranged from 2001 to 2018; the median is 2011. The 44 sources provided information on 48 independent samples. Basic sample characteristics are shown in Table 1, including references for all samples in the meta-analytic data set. Table 1 includes 49 lines because information for one sample was taken from two sources (i.e., the two sources reported on different eating-disorder variables for the same sample). 2 Sample sizes ranged from 20 to 2,601 (M = 399.7, SD = 549.0). In sum, the samples included 19,187 participants. Mean age at Time 1 ranged from 6.5 years to 47.6 years (M = 19.3 years, SD = 9.4 years). One sample was assessed in childhood (mean age = 6.5 years), 29 samples were assessed in adolescence (mean age range = 10.8–17.7 years), 13 samples were assessed in emerging adulthood (mean age range = 18.1–26.3 years), and five samples were assessed in middle adulthood (mean age range = 37.8–47.6 years). The mean proportion of female participants was 79% (range = 0%–100%, SD = 27%). Twenty-eight samples were community samples, 14 samples consisted of participants with a clinical disorder, five samples consisted of college and university students, and one sample was nationally representative. Twenty-six samples were from the United States, five were from Switzerland, four were from Australia, three were from Spain, two were from Germany, two were from Sweden, and one sample each was from Brazil, Canada, Finland, Italy, Norway, and the UK. With regard to race/ethnicity, 26 samples were predominantly White/European (“predominantly” was defined as 80% and more), one was predominantly Hispanic, 14 were mixed, and for seven samples, information about race/ethnicity was missing. More detailed information on race/ethnicity and socioeconomic status is reported in Table S1 in the Supplemental Material. Time lag between the assessments ranged from 4 weeks to 5 years (M = 1.1 years, SD = 1.2 years). Self-esteem was most frequently measured by the Rosenberg Self-Esteem Scale (Rosenberg, 1965; 72%), followed by Harter’s Self-Perception Profile (e.g., Harter, 2012; 16%). Eating pathology was most frequently measured by the Eating Disorder Examination (e.g., Fairburn & Beglin, 1994; 35%), followed by the Eating Disorder Inventory (e.g., Garner et al., 1983; 33%) and the Eating Attitudes Test (e.g., Garner & Garfinkel, 1979; 7%). Risk of bias for the included samples, as assessed by the JBI checklist, ranged from 4 to 6 (M = 5.63, SD = 0.61), indicating a high quality of studies and a low risk of bias.
Descriptive Information on the Studies Included in the Meta-Analysis
Note: Mean age at Time 1 is given in years. The column “Female” shows the proportion of female participants in percentages. The column “Time lag” indicates the interval between Time 1 and Time 2 in years. “Effect-size data published” is a dichotomous variable (yes = effect-size data were published in article in form of correlation or regression coefficients, no = effect-size data had not been published in article but were obtained from the authors).
Preliminary analyses
When an effect size was influential and qualified as potential outlier, we conducted the meta-analytic computation of the weighted mean effect size without this study for the purpose of sensitivity analyses. The results suggested that excluding these studies did not change the pattern of findings and did not lead to any different conclusions (see Tables S2 and S3 in the Supplemental Material). We therefore used the complete data set in the remainder of the analyses, consistent with methodological literature advising against routine deletion of outliers (Viechtbauer & Cheung, 2010).
Effect-size analyses for total eating pathology
Table 2 reports the meta-analytic findings for total eating pathology. The two cross-lagged effects were statistically significant and of similar size. Specifically, total eating pathology negatively predicted later self-esteem (β = −.09), and self-esteem negatively predicted later eating pathology (β = −.08). No formal test of the difference between the effects is available because the samples on which these two effects were based overlapped partially. As an approximate means of comparing the effects, we used the 95% confidence intervals. The two confidence intervals overlapped strongly, suggesting that the effects did not differ significantly. Moreover, the results indicated that the cross-sectional correlation between self-esteem and total eating pathology at Time 1 was significant and of moderate size (r = −.35). The stability effects of self-esteem (β = .53) and total eating pathology (β = .55) were relatively large.
Summary of Effect Sizes for Relations Between Total Eating Pathology and Self-Esteem
Note: Computations were made with random-effects models. ED = eating-disorder variable; SE = self-esteem; k = number of samples; N = total number of participants in the k samples; CI = confidence interval; Q = statistic used in heterogeneity test; τ2 = estimated amount of total heterogeneity; I2 = ratio of total heterogeneity by total variability (given in percent); rED,SE = correlation between eating disorder variable at Time 1 and self-esteem at Time 1.
Standardized regression coefficient.
p < .05.
The analyses suggested that the cross-lagged effects were relatively heterogeneous across samples (Table 2), suggesting that moderating factors may account for systematic between-samples differences. Table S4 in the Supplemental Material provides information about the descriptive statistics and intercorrelations among moderators (i.e., mean age at Time 1, proportion of female participants, sample type, and time lag). The moderator analyses indicated that none of the moderators was significant (Table 3). Thus, the findings suggest that the cross-lagged effects between total eating pathology and self-esteem hold across samples varying with regard to age, gender, sample type, and time lag between assessments, which strengthens the generalizability of the findings.
Mixed-Effects Metaregression Models for Sample Characteristics Predicting Cross-Lagged Effects Between Total Eating Pathology and Self-Esteem
Note: Regression coefficients are unstandardized. For each direction of effects, the four moderators were tested simultaneously. The significance level was adjusted to p < .0125 (Bonferroni correction for four moderators). k = number of samples; ED = eating-disorder variable; SE = self-esteem.
1 = clinical, 0 = nonclinical.
Effect-size analyses for specific eating pathology
Table 4 reports the meta-analytic findings for the specific categories of eating pathology. For behavioral symptoms, significant cross-lagged effects emerged in both directions. Restrained eating (β = −.08), bulimic behavior (β = −.07), and binge eating (β = −.05) negatively predicted later self-esteem, and self-esteem negatively predicted restrained eating (β = −.07), bulimic behavior (β = −.09), and binge eating (β = −.11). For cognitive-affective symptoms, almost all cross-lagged effects were significant. Eating concern (β = −.13), negative body image (β = −.16), and drive for thinness (β = −.12) negatively predicted later self-esteem, and self-esteem negatively predicted negative body image (β = −.07) and drive for thinness (β = −.05). The only exception was that self-esteem did not significantly predict later eating concern (β = .00).
Summary of Effect Sizes for Relations Between Specific Eating Pathology Categories and Self-Esteem
Note: Computations were made with random-effects models. ED = eating-disorder variable; SE = self-esteem; k = number of samples; N = total number of participants in the k samples; CI = confidence interval; Q = statistic used in heterogeneity test; τ2 = estimated amount of total heterogeneity; I2 = ratio of total heterogeneity by total variability (given in percentage); rED,SE = correlation between eating-disorder variable at Time 1 and self-esteem at Time 1.
Standardized regression coefficient.
p < .05.
Assessment of publication bias
We assessed whether there was evidence of publication bias in the cross-lagged effects (i.e., the coefficients that were of key interest in this research). The three methods used for testing publication bias indicated little evidence of publication bias. First, the funnel graphs exhibited a relatively symmetrical shape typical of nonbiased meta-analytic data sets (see Fig. S2 in the Supplemental Material). Second, Egger’s regression test (Egger et al., 1997) was nonsignificant in all cases (Table 5), suggesting that the funnel graphs did not deviate significantly from a symmetrical shape. Third, mixed-effects metaregression models indicated that all effect sizes did not differ significantly as a function of whether effect-size data had been reported in the article, except for the cross-lagged effect of total eating pathology on self-esteem (Table 5). When restricting the data to unpublished effect sizes (k = 33), we found the weighted mean effect size of this effect was β = −.07 [95% CI = −.10, −.04], which was only slightly smaller than the effect size based on the full data set (β = −.09, k = 39; see Table 2). Given that this was the only eating-disorder variable for which the publication bias tests yielded a significant result and given that the other methods of assessing publication bias were inconspicuous for all eating-disorder variables, we concluded that the possible influence of publication bias was, at most, very small.
Tests of Publication Bias in Cross-Lagged Effects
Note: The differences between effect sizes from studies for which effect size data were published in the article versus unpublished were tested with mixed-effects metaregression models. Dash indicates that there were no published effect-size data for the eating-disorder variable. The significance level was adjusted to p < .004 for Egger’s regression test and p < .006 for the difference between published and unpublished effect-size data (Bonferroni correction for 14 and nine tests, respectively). kp = number of published effect sizes; ku = number of unpublished effect sizes; ED = eating-disorder variable; SE = self-esteem.
Discussion
The aim of the present meta-analysis was to better understand the link between low self-esteem and eating disorders by synthesizing the available longitudinal evidence on prospective effects between low self-esteem and eating pathology (i.e., restrained eating, bulimic behavior, binge eating, eating concern, negative body image, drive for thinness, and total eating pathology). The analyses were based on 48 samples, including data from more than 19,000 participants ranging from 7 to 48 years in age. Overall, the findings suggested a reciprocal pattern between low self-esteem and eating pathology. For behavioral symptoms, the point estimates were of similar size in both directions (i.e., behavioral symptoms predicted later self-esteem with about the same effect size as low self-esteem predicted later behavioral symptoms). In contrast, the point estimates of the effects of cognitive-affective symptoms on self-esteem were slightly larger than the reverse estimates, that is, effects of self-esteem on cognitive-affective symptoms. However, no formal test of the difference between the effects is available because the samples on which the effects were based overlapped partially. Moreover, note that by and large, the pattern of results was relatively similar across specific categories of eating pathology. Moderator analyses on the prospective effects between self-esteem and total eating pathology suggested that the effects did not differ across age, gender, sample type, and time lag. The pattern of effects is in line with both the vulnerability and the scar effect, supporting a reciprocal-relations model of low self-esteem and eating disorders. Thus, the findings suggest that low self-esteem makes people more vulnerable to developing an eating disorder and that suffering from an eating disorder scars the individual’s self-esteem. Consequently, the present findings imply a positive feedback loop for people with high self-esteem and healthy eating behavior and, simultaneously, a vicious circle for people with low self-esteem and problematic eating behavior.
At first sight, the meta-analytic estimates of the effects might be considered as relatively small (i.e., the significant effects ranged from β = −.05 to β = −.16, with an average value of β = .09), especially when using Cohen’s conventions for interpreting correlation coefficients (Cohen, 1992; e.g., with r = .10 indicating a small effect). However, conventions for correlations do not apply to cross-lagged effects, in particular because the stabilities of the outcomes are statistically controlled in the coefficients (Adachi & Willoughby, 2015). Often, constructs are relatively stable across time (which was also true in the present meta-analysis with regard to both self-esteem and eating pathology), which limits the theoretically possible range of effects of other constructs on the outcome. Recently, a meta-analytic project established empirical benchmarks for cross-lagged effects in several fields of psychology, including clinical and social-personality psychology (Orth et al., 2022). Given the distribution of effect sizes, results suggested that an effect of .03 should be interpreted as small, .07 should be interpreted as medium, and .12 should be interpreted as large. Consequently, the effects found in the present meta-analysis can be considered of medium size. In addition, from a substantive perspective, very large effects were not to be expected given that many factors, besides low self-esteem, may influence eating pathology (for a review, see Polivy & Herman, 2002; Stice, 2002; Striegel-Moore & Bulik, 2007) and given that many factors, besides eating pathology, may influence self-esteem (for a review, see Orth & Robins, 2019). Moreover, the effects between self-esteem and eating pathology may accumulate over time (e.g., over adolescence and young adulthood; Abelson, 1985).
The largest effect size emerged for the effect of negative body image on self-esteem (β = −.16). This effect is consistent with the bottom-up model of self-esteem, which proposes that people’s evaluations of specific aspects of their self (e.g., physical appearance) influence their global level of self-esteem (Rentzsch & Schröder-Abé, 2022). Given that measures of negative body image are more closely related to people’s self than other categories of eating pathology (e.g., restrained eating or binge eating), this may provide an explanation of the larger effect of negative body image. However, as noted above, the pattern of findings generally held across specific categories of eating pathology.
The magnitude of the effect sizes between self-esteem and eating pathology was relatively similar to the effects between self-esteem and depression and between self-esteem and anxiety (Sowislo & Orth, 2013). Future research should examine the relations between self-esteem and other psychological disorders (e.g., personality disorders) because it is possible that low self-esteem is transdiagnostically related to psychopathology more generally (Nolen-Hoeksema & Watkins, 2011; Zeigler-Hill, 2011). If low self-esteem proves to be a transdiagnostic risk factor for psychological disorders, this would have important implications for clinical assessment, diagnosis, and treatment (Rodriguez-Seijas et al., 2015). Furthermore, improving low self-esteem may be useful for reducing the risk for psychological disorders in the first place.
The analyses suggested that the findings are unlikely to be influenced by publication bias. Moreover, we believe that there was generally little reason to expect publication bias in the present meta-analytic data set. The reason is that most of the effect sizes had not been reported in the articles but were obtained from the study authors. In addition, some studies included in the meta-analysis focused on other research questions (i.e., other than prospective effects between self-esteem and eating-disorder variables), and the relevant statistics (i.e., correlations between self-esteem and eating pathology) were simply reported along with other statistics on a larger set of variables.
As noted above, Colmsee et al. (2021) recently meta-analyzed the prospective effect of self-esteem on eating pathology. The meta-analytic estimate determined in the present research (β = −.08) was very similar in magnitude to the effect found in the earlier meta-analysis (β = −.09). However, several reasons suggest that the present meta-analysis significantly improves the knowledge about the link between self-esteem and eating pathology. First, the present meta-analysis examined a much larger evidentiary base. More precisely, whereas Colmsee et al. used data from 13 studies, the present meta-analysis included 44 studies. These numbers suggest not only that the present research provides for much larger power and precision but also that the present data set reflects the available evidence in a more comprehensive way. Second, whereas the data set by Colmsee et al. was based exclusively on published data, the present meta-analytic data set included a relatively large number of unpublished effect sizes, allowing for thorough tests of publication bias. Third, whereas in Colmsee et al. only nine effect sizes could be controlled for prior levels of the outcomes, this was possible for all effect sizes in the present research, which increases the validity of conclusions about prospective effects. Fourth, the present research significantly extends prior research in this area by investigating aspects that were not covered by Colmsee et al. Specifically, the present research also tested the reverse direction of effects (i.e., effects of eating pathology on self-esteem) and estimated the effects in both directions also for specific categories of eating pathology (e.g., restrained eating, binge eating) in addition to the index of total eating pathology.
When interpreting the findings, the following limitations should be noted. First, the present research does not allow for strong causal conclusions because the studies included in the meta-analysis used nonexperimental designs. As is true for all observational studies, it is possible that the effects are confounded by unmeasured third variables (e.g., Little et al., 2007). Nevertheless, longitudinal data are useful to test whether prospective effects are consistent with a causal model of the relation between constructs. Second, the samples were predominantly from Western cultures, of White/European race/ethnicity, and of medium to high socioeconomic status. In future research, it would therefore be highly desirable to more often collect data on self-esteem and eating pathology in samples from non-Western cultural contexts and in samples that are more diverse regarding race/ethnicity and socioeconomic status to evaluate the degree to which the pattern of findings generalizes across cultures and different types of samples (Henrich et al., 2010). Third, in the meta-analysis, we could not test whether differences in socioeconomic status (as indicated, e.g., by income, level of education, or occupational prestige) moderate the effects. The reason is that even if information is available in some reports, this information would not be comparable across samples because of the many differences between measures, between countries, and between birth cohorts. Fourth, we contacted authors with a research program in the area of eating disorders that we were aware of. However, because we received only three articles that were eligible for inclusion, it is unlikely that the effect sizes from these articles have biased the results. Fifth, it was not possible to divide bulimic behavior into two separate variables (i.e., binge eating and compensatory behaviors) because the measures rarely differentiated between the two components. Although some measures included binge eating as a separate scale, they did not provide information about whether binge eating was accompanied by compensatory behaviors. Finally, for some of the categories of eating pathology, only a small number of samples was available. Thus, future research would benefit from conducting more longitudinal studies examining the relation between low self-esteem and eating disorders.
The present research also has important strengths. A crucial advantage of meta-analyses in general lies in the aggregation of data across a set of heterogeneous studies, which increases the robustness of the findings. In the present research, 48 samples provided data from more than 19,000 individuals. Another major strength is the longitudinal nature of the data. Specifically, the fact that the meta-analysis synthesized prospective effects and systematically controlled for prior levels of the constructs significantly strengthens the validity of the findings. Finally, the thorough and diverse strategies used in the search for relevant studies (i.e., use of bibliographic databases, examination of references in key papers, contacting of authors with relevant research programs, and requests for unpublished effect-size data) increased the likelihood that the present meta-analysis comprehensively covers the available data.
Given that the present research suggests that low self-esteem and eating pathology reciprocally affect each other, future research should examine the mechanisms that may mediate the effects between the constructs. For example, a possible pathway is that individuals’ level of self-esteem influences the perception of their own bodies, which, in turn, influences individuals’ eating behavior (e.g., Woodward et al., 2019). A possible pathway of the reverse direction of effects is that eating pathology leads to social withdrawal and isolation (e.g., Robinson, 2014; Rørtveit et al., 2009), which is detrimental for social relationships. A lowered level of social support, in turn, might then lead to decreases in self-esteem (Harris & Orth, 2020). Finally, an interesting avenue for future research consists in using experience-sampling methods that focus on within-persons effects across shorter time periods (Csikszentmihalyi & Larson, 2014). For example, this approach would facilitate important insights into how eating-related thoughts and behaviors in everyday life may lead to immediate changes in self-esteem.
Moreover, future research should continue to test for moderators of the effects between low self-esteem and eating pathology. The moderator variables tested in the present meta-analysis (i.e., age, gender, sample type, and time lag) were not significant. However, other factors, such as the person’s living situation, eating habits, and self-regulation abilities, may be important moderators. For example, living together with close others (e.g., one’s parents or a partner) and sharing meals together may serve as a protective factor and thereby reduce the effect of low self-esteem on eating pathology. The person’s level of self-regulation may also moderate the effects between low self-esteem and eating pathology. However, self-regulation may affect the relation between self-esteem and specific symptoms in different ways. For example, whereas better self-regulation abilities may buffer the effect of low self-esteem on binge eating, they might increase, rather than buffer, the effect of low self-esteem on restrained eating.
The findings of the present meta-analysis may also have practical implications. If future research supports the causality of the effects, then eating pathology could be improved by fostering self-esteem. In fact, research suggests that it is possible to increase self-esteem through interventions (Haney & Durlak, 1998; Niveau et al., 2021). We note that many factors influence complex phenomena such as eating pathology, and future research would benefit from combining information on a broad set of factors (e.g., by using procedures such as machine learning). Nevertheless, the present research suggests that self-esteem is one of the factors that contributes to the development of eating pathology. Another practical implication is related to the reverse direction of effects, that is, that it might be possible to improve self-esteem by treating eating pathology (e.g., Linardon et al., 2019). Given the findings from the present meta-analysis, the individual’s self-esteem could benefit particularly from modifying the cognitive-affective symptoms of eating disorders, such as eating concern and negative body image.
Supplemental Material
sj-docx-1-cpx-10.1177_21677026221144255 – Supplemental material for The Link Between Low Self-Esteem and Eating Disorders: A Meta-Analysis of Longitudinal Studies
Supplemental material, sj-docx-1-cpx-10.1177_21677026221144255 for The Link Between Low Self-Esteem and Eating Disorders: A Meta-Analysis of Longitudinal Studies by Samantha Krauss, Laura C. Dapp and Ulrich Orth in Clinical Psychological Science
Footnotes
Acknowledgements
Transparency
Action Editor: Kelsie T. Forbush
Editor: Jennifer L. Tackett
Author Contribution(s)
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
