Abstract
Bulimia Nervosa (BN) represents an important public health problem and is related to serious morbidity and even mortality. All types of anti-depressants seem to be beneficial to some degree in relieving bulimic symptoms in a large number of patients [1]. However, there is no clear evidence of a differential effect among the various antidepressants used. Moreover, it is not known if antidepressants have a specific anti-bulimic effect or if their success is a result of their antidepressant effects. This review attempts to systematically evaluate the use of anti-depressant medications compared with placebo for the treatment of BN, providing a statistical synthesis of the comparative data available to date.
Objectives
The aims were directed to the following questions: (i) are antidepressants clinically effective for the treatment of BN when compared with placebo? (ii) is there any differential effect for the various classes of antidepressants? and (iii) is the anti-bulimic effect of antidepressants independent of the effect on depressive symptoms?
Methods
Relevant trials for this review were identified by searching the following seven electronic databases, by means of the Cochrane Depression, Anxiety and Neurosis Group strategy for randomised controlled trials (RCTs): MEDLINE; EMBASE; LILACS; PsycLIT; the Cochrane Depression, Anxiety and Neurosis Group Database of Trials; the Cochrane Controlled Trials Register and the Science Citation Index (SCISEARCH). Reference lists of selected articles were inspected for further relevant studies. Pharmaceutical companies and first authors of all selected studies were contacted in order to obtain data on possible unpublished studies.
The review considered randomised, double-blind, placebo-controlled studies investigating any class of antidepressant medications in the treatment of patients with purging or non-purging type BN. Patients could have been diagnosed according to Russell, DSM-III, DSM-III-R, DSM-IV or ICD-10 criteria [2–6]. Parallel or crossover studies from which data for the first treatment period could be obtained and treated as a parallel trial were eligible. Data for at least one primary outcome of interest should have been reported or acquired.
As concealment of randomisation has been shown to affect trial outcomes [7],[8], two reviewers (JB and PH) independently graded the quality of allocation concealment according to the categories described in the Cochrane Collaboration Handbook (CCH) [7]. Trials with low or moderate risk of bias (categories A or B, respectively) were included. Trials with a high risk of bias due to inadequate allocation concealment were excluded. When no information on the randomisation process could be obtained from the authors, double-blind trials were included in category B. An inter-rater reliability study between reviewers was performed by means of the weighted kappa coefficient. The result indicated a very good agreement beyond chance (observed agreement of 91%%, K = 0.80, p < 0.0001). Studies comparing two active drugs or drugs with psychological approaches were not eligible for this review. Studies were also excluded if participants were patients with binge-eating/purging type anorexia nervosa or binge-eating disorder, as defined in DSM-IV [5], and if no information regarding the first treatment period of crossover studies could be obtained. In case of discrepancies, authors were contacted to have doubts resolved. Most excluded studies were reviews, subsets from other studies or did not provide any data for at least one primary outcome of interest.
The efficacy outcomes selected were remission, clinical improvement and changes in bulimic symptoms. Remission was defined as 100%% reduction in binge-eating episodes from baseline to endpoint, and clinical improvement as more than 50%% reduction. As we could not assume beforehand that all patients included in the studies would be purging-type bulimic individuals, binge-eating episodes were considered the main bulimic symptoms. A sensitivity analysis was performed to verify if purging and bingeing could be conflated.
Comorbid depression was evaluated through the mean difference in severity of depressive symptoms at endpoint.
Tolerability and acceptability of treatment were based on the number of patients dropping out during the study due to adverse events and for any cause.
Meta-analytic calculations were made by means of the statistical package ARCUS Quickstat for windows [9]. Remission, clinical improvement and drop-outs were analysed by calculating DerSimonian-Laird [10] estimate of relative risks (DL-RR) and approximate 95%% confidence interval. By convention, DL-RRs less than 1 indicated that an event was more likely to occur in the active treatment group. When overall results were significant, the number needed to harm (NNH) to produce or the number needed to treat (NNT) to prevent one outcome was calculated on the inverse of the absolute risk reduction [11].
Continuous outcomes (bulimic and depressive symptoms) were analysed by estimating effect size across all of the studies. The difference between the means of the antidepressant and placebo groups was divided by their pooled standard deviation (Hedges ‘g’ [12]). These g values were corrected for bias due to sample size, with the effect expressed as the unbiased estimator ‘d’. The pooled mean effect size was calculated using direct weights defined as the inverse of the variance of d for each study. According to Cohen [13], a large effect size is 0.8, a moderate effect size is 0.5 and a small effect size is 0.2.
Heterogeneity in the results of the trials was assessed by a test of heterogeneity using a Q (com-binability) statistic. A value of p < 0.05 indicated statistical heterogeneity.
The presence of selective publication was detected graphically in a funnel plot [14]. The number of studies (with null results) residing in the file drawers and needed to overturn the results of the combined significant test was assessed by means of the fail-safe N [12],[15].
Results
Characteristics of all eligible studies
Two studies were multi-centre, multinational [16],[17], both comparing fluoxetine to placebo. Ten trials were conducted in the USA [18–27], two in Canada [28],[29], one in the UK [30] and one in Norway [31]. Trials recruited outpatients seeking treatment ‘spontaneously’ from eating disorders clinics or programs, or included also volunteers for research recruited through advertisement.
The study population included in the trials presented reasonably comparable demographic and behavioural features. Only one study [23] included non-purging type bulimic subjects. Patients were mostly adult and young adult females, few adolescents being included. Underweight and overweight patients were constantly excluded.
All comparisons concerned one single antidepressant agent versus placebo. The only dose-response study conducted was the first multi-centre fluoxetine study [16]. Fluoxetine (60 mg/day) was clearly superior to both placebo and the usual 20 mg per day antidepressant dosages. The doses of other anti-depresssants have generally been similar to those employed in the treatment of depression.
Only two studies [17],[30] were graded as A according to the CCH criteria [7], thanks to information provided by the authors. All other 14 trials were graded as B. Blinding procedures were inadequately described in most trials. Some did not specify number or reasons for drop-outs, excluded patients after randomisation from analysis and did not report an intention-to-treat analysis. Some trials also omitted the standard deviations of continuous variables.
Efficacy
Composite RR for efficacy, homogeneity tests and NNT
Pooled DL-RR for the eight studies reporting remission of bulimic episodes favoured drugs. The subgroup analysis confirmed that results were significant when bingeing and purging were analysed separately (DL-RR = 0.88, 95%% CI = 0.83–0.94 for bingeing and DL-RR = 0.90, 95%% CI = 0.81–0.99 for purging). According to the NNT, it would be necessary to treat 9 bulimic patients with an antidepressant to obtain one complete remission in a short-term, when compared with placebo, considering the 92%% non-remission rate of bulimic episodes in the placebo controls as a measure of baseline risk.
Clinical improvement was reported in eight trials (Table 1). Effects were stronger than those observed for remission (NNT = 4).
No published negative study was found; however, the search detected at least one unpublished multi-centre study performed in France and the UK [32], showing a lack of greater efficacy of fluvoxamine compared with placebo. Authors were contacted but did not provide their unpublished results. The number of trials necessary to overturn the results of the combined significance test (fail-safe N) was 18 for remission and 122 for clinical improvement.
The analysis of the mean difference in bulimic and depressive symptoms was based on the number of patients reporting these symptoms at the end of the trials. Five trials reported mean and standard deviations for bulimic symptoms at endpoint. DerSimonian-Laird pooled estimate of standardised effect size for difference in bulimic symptoms was −0.59 in favour of the active antidepressant drugs (95%% CI = − 0.87, − 0.31, p < 0.0001). According to Cohen's nomenclature this effect size is moderate [13].
Comorbidity (depressive symptoms)
Seven trials were included in the analysis. Standardised effect size was −0.23. No statistically significant difference was observed among classes of drugs. As individual patient data were not available, it was not possible to determine if the effects of anti-depressants on bulimic symptoms were independent of their effect on depressive symptoms.
Tolerability and acceptability
Composite RR for tolerability and acceptability of treatment, homogeneity tests and NNH
Tricyclic antidepressants (TCAs) and selective serotonin reuptake inhibitors (SSRIs) showed statistically significant results, but in opposite directions (Table 3). The pooled analysis of TCA studies showed that drop-outs were more likely in the active drug group (29%% vs 14.4%%), while in the case of SSRIs they were more likely for placebo (37.2%% for drug and 40.1%% for placebo).
Sensitivity analysis
Four pre-specified sensitivity analyses, subsidiary to the main review questions and based on non-randomised comparisons, were performed. They intended to confirm the hypotheses that response differed according to: (i) quality of the trials; (ii) class of drugs; (iii) length of treatment (< 8 weeks vs > 8 weeks); and (iv) diagnostic criteria (DSM-III vs DSM-III-R). Assessments were made by looking at separate subgroup of studies. None of the differences were statistically significant.
Discussion and conclusions
Bulimia nervosa is considered a complex and multi-factorial clinical condition, difficult to treat, with a chronic course [33]. Almost nothing is known about its natural course in the absence of treatment [34]. Investigations of the use of antidepressants for its treatment have been limited by several conceptual and methodological problems [35].
Indeed, most of the papers reviewed did not provide relevant information, such as details about adequacy of randomisation, blinding procedures and reasons of drop-outs. Crossover studies did not provide information for the first period, so carry-over or withdrawn effects were not taken into account. These methodological inconsistencies may have altered the perceived differences in efficacy between antidepressants and placebo and should be taken into account in the appraisal of results of the present review.
The most frequently reported outcome for efficacy was the mean per cent reduction in bulimic episodes, but measures of variance were missing in some trials. This outcome was not considered helpful to guide clinical decision-making as it does not translate benefits of treatment. A 70%% reduction in bulimic episodes in the experimental group, compared with a 50%% reduction in the placebo control group does not inform which is the probability of being free of bulimic symptoms or presenting a relevant clinical improvement when treated with a specific drug. Remission and clinical improvement were considered more appropriate outcomes for efficacy. Nevertheless, this information was not available for a considerable number of trials.
Concerning generalisability of the main findings, BN patients included in the trials seem similar to those seen in clinical settings in terms of age, duration of illness and severity of symptoms. However, most trials used stringent definition for bulimia or BN and usually did not include patients with comorbid severe depression, serious personality disorders or other ‘atypical’ conditions more difficult to treat (e.g. multi-impulsive patients or substance abusers). Therefore, results of this review are directed to bulimic patients without severe comorbidity.
This meta-analysis provides statistical evidence that a single antidepressant agent is clinically effective for the treatment of BN when compared with placebo, but the effect is modest. Although there were differences in terms of definition of illness according to different diagnostic criteria, length of treatment, quality of trials, and classes of drugs, a homogeneous therapeutic effect was found. No statistically significant differential effect regarding efficacy and tolerability among TCAs, SSRIs, MAOIs or other classes of antidepressants could be demonstrated. Remission rates were low and a considerable fraction of patients did not show a reduction of at least 50%% in bulimic symptoms in the short-term. Clinical improvement was consistent for all classes of antidepressants, though most of the patients would still fulfil criteria for BN at the end of the trials. It is proposed that single pharmacotherapy is less accepted for patients with BN than alternative single psychological therapies [36–38]. Medication studies without concurrent psychological approaches included in this review confirmed the high drop-out rates observed in most trials evaluating single pharmacological treatments, in part because of side-effects and in part because of some patients' negative attitudes toward medication use. The better acceptability of SSRIs may be related to their short-term effects on appetite and weight.
Few studies followed patients after the short-term double-blind treatment periods. Two trials reported a significant relapse rate (30–45%%) on improved patients followed for 4–6 months [20],[39]. We could find only one randomised, double-blind, placebo-controlled trial designed to specifically evaluate prevention of relapse with fluvoxamine [40]. Relapse rates during a 15-week maintenance period after successful psychotherapy treatment were significantly higher for placebo, but drop-out rates were very high (19 out of 33 patients randomised to fluvoxamine). The paucity of knowledge concerning the natural history of BN [34] makes it difficult to assess the long-term effects of treatments. Given the chronic, frequently relapsing course of treated patients, short-term results are of limited clinical value and long-term studies are warranted.
Antidepressants are effective for the treatment of minor depression [41] and also show a positive effect in the treatment of BN. Depressive symptoms are common in bulimic patients [42]. It is possible that antidepressants have both a direct effect over bulimic symptoms and an effect over depressive symptoms, reducing indirectly the disturbed eating behaviours. However, as individual patient data for concurrent depression was not systematically provided, it was not possible to conclude how depression affected short-term efficacy of antidepressants on bulimic symptoms.
It is important to further investigate and report whether TCAs and SSRIs present the same efficacy and frequency of drop-outs due to adverse events, describing these events and the respective remission rates. Therefore, it may be appropriate to directly compare these two classes of antidepressants regarding their tolerability and cost-effectiveness.
Future research should systematically include bulimic patients with comorbid major depression, personality disorders, substance abuse and other relevant clinical conditions. The impact of depression and other comorbid disorders on prognosis should be evaluated, improving generalisability of results. The option of trying a second antidepressant agent to reduce side-effects or increase efficacy should also be evaluated.
Footnotes
Acknowledgements
The authors wish to acknowledge Irismar R. Oliveira, Sergio L. Bartczak and Mauricio S. de Lima for advice and Roberta P. Trefiglio for assistance.
