Abstract
For research purposes, it is generally accepted that experimental ostracism manipulations can lead to a reduction of participants’ well-being. To eventually restore participants’ well-being, researchers rely on post-experimental debriefings that discredit prior deception. However, evidence suggests that discredited beliefs can persevere. The present research investigates whether a potent debriefing procedure restores participants’ well-being after an experimentally induced ostracism experience. In two studies, participants were either excluded or included in a Cyberball game, indicated their well-being, and were debriefed. In two additional conditions, participants were debriefed before indicating their well-being. Ostracism compared to inclusion led to decreased positive and increased negative mood. The debriefing did not counteract this effect (Studies 1 & 2). Unwanted aftereffects of the manipulation persevered for more than one day after the experimental session (Study 2). These findings question the effectiveness of debriefings and raise issues about research ethics.
In their Ethical Principles of Psychologists and Code of Conduct, the American Psychological Association (2017) deals with the question of inflicting harm on participants and sets a clear standard: “Psychologists [need to] take reasonable steps to avoid harming their … research participants … and to minimize harm where it is foreseeable and unavoidable” (p. 6). Starting decades ago, researches have thought of two different strategies on how to take those ‘reasonable steps’: (1) the abandonment of any harmful manipulations in psychological studies (e.g., Baumrind, 1964), and (2) the employment of precautions such as recruiting only psychologically stable individuals as participants (e.g., Kelman, 1967; Walster et al., 1967).
In social psychological research, these approaches have not been widely adopted: Manipulations that can lower participants’ well-being are being used and careful a priori screening of participants is usually not mentioned in the respective publications.
Historically, there have been vivid discussions regarding whether and how research can still be ethical in this context (e.g., Adair et al., 1983; Elms, 1975; Tesch, 1977; West & Gunn, 1978). Following Milgram's (1964) justification of his obedience studies, the idea of justifying the use of potentially harmful manipulations (even those that are not in accordance with the APA guidelines) through post-experimental debriefing became common practice (e.g., Adair et al., 1985; Sharpe & Faye, 2009; Smith & Richardson, 1983; Sommers & Miller, 2013; Tesch, 1977; Walster et al., 1967).
However, formal evidence for the effectiveness of debriefings is scarce (e.g., Sharpe & Faye, 2009; Tesch, 1977). The few existing studies conclude that debriefings are not necessarily effective (McFarland et al., 2007; Miketta & Friese, 2019; Oczak & Niedźwieńska, 2007; Ross et al., 1975; Silverman et al., 1970; Walster et al., 1967). As a result, several researchers made efforts to develop more effective debriefing procedures. For example, McFarland and colleagues (2007) compared a Standard Outcome Debriefing, which informed participants about the false feedback they had received, with their newly created Revised Outcome Debriefing that additionally informed participants about the fact that the ostensible social perceptiveness test that they took was not real either (and therefore invalid). This Revised Outcome Debriefing proved to be more effective in eliminating the effects of the manipulation than the Standard Outcome Debriefing.
The research by McFarland and colleagues suggests that the Revised Outcome Debriefing is the most potent type of debriefing and able to undo the effects of falsely given information. Doubts remain, however, if this conclusion extends to highly threatening experimental manipulations. In the work by McFarland et al. (2007), participants received false feedback about an ostensible social perceptiveness task in which they had to distinguish fake suicide notes from real ones. In the negative feedback condition, participants then received the feedback that they had performed poorly at distinguishing the suicide notes. However, in contrast to some widely used experimental manipulations in social psychology that are known to cause detrimental effects on well-being (e.g., Hartgerink et al., 2015; Leary et al., 2009), there is no empirical evidence for detrimental effects caused by the suicide note manipulation used by McFarland and colleagues. Furthermore, the ability to reliably distinguish real from fake suicide notes is not commonly considered a core competence relevant for one's personal identity.
Being included in social groups, by contrast, is a fundamental need (Baumeister & Leary, 1995; Smith et al., 1999) that has even been argued to be crucial for survival (Caporael, 1997; van Beest & Williams, 2006). It should thus not come as a surprise that ostracism – being ignored and excluded by others – can have worrisome effects, even if the ostracism experience is part of a psychological research study that is subject to ethical regulations: after being ostracized in the laboratory, people show more negative mood (Gonsalkorale & Williams, 2007; Williams et al., 2000), lower self-esteem, sense of control, and belonging, as well as increased feelings of meaninglessness (Gonsalkorale & Williams, 2007; Jamieson et al., 2010; Williams et al., 2000), and more aggressive behavior (Warburton et al., 2006; Wesselmann et al., 2010) compared to people who have not been ostracized (for a meta-analysis, see Hartgerink et al., 2015).
The most common procedure to experimentally manipulate ostracism in the laboratory is the so-called Cyberball game (Gonsalkorale & Williams, 2007; Williams et al., 2000). In this game, participants play a virtual ball tossing game with two ostensible other participants over the internet. Unknown to participants, instead of actual others playing with them, the computer runs a pre-programmed script. This script ensures, in the inclusion condition, that participants receive the ball approximately a fair one third of the time while ostracized participants receive the ball once or twice at the beginning of the game and then never receive it again while the other two players keep throwing the ball to each other. This procedure has been employed in more than 200 published studies (Hartgerink et al., 2015).
For most Cyberball studies it remains unknown if and how participants were debriefed: a systematic review of 39 peer-reviewed Cyberball studies published in 2021 revealed that for 33 studies a debriefing was either not mentioned or its description did not contain any details on what the procedure entailed (e.g., “participants were debriefed”; see Supplement for review details). For a further five studies, the description contained only minimal, vague information about the debriefing (e.g., “participants were given information about the study purpose”). None of the 39 studies included any information on the effectiveness of the employed debriefing.
Furthermore, the question whether a debriefing procedure is effective in eliminating the effects of an ostracism experience remains unanswered in general. Some research even suggests that the Cyberball manipulation may have similarly detrimental effects when participants know in advance that they will be interacting with a pre-programmed computer instead of real persons (Zadro et al., 2004). Since a post-experimental debriefing usually provides participants with information about the false nature of the manipulation, the information that participants received in the study by Zadro and colleagues can be thought of as a pre-experimental debriefing. This “debriefing” had no softening effect on the negative impact caused by the manipulation, rendering the effectiveness of the post-experimental debriefing all the more important.
The aim of the present research was to investigate whether a Revised Outcome Debriefing would be effective in eliminating the effects on subjective well-being caused by a Cyberball ostracism manipulation. In two studies, we sought to conceptually replicate the effects of the Cyberball manipulation on indicators of subjective well-being such as mood and self-esteem. In Study 2, we additionally investigated potential aftereffects of the ostracism manipulation several hours after participants had left the laboratory. Lastly, we conducted internal meta-analyses of both studies. These analyses allowed for a concurrent examination of the overall evidence with higher statistical power. Both studies received approval from the local ethics committee.
Study 1
In Study 1, participants played Cyberball and were either included or ostracized by two ostensible other players. We expected participants in the ostracism condition to show impaired subjective well-being as indicated by more negative mood, less positive mood, and lowered state self-esteem. Most importantly, we investigated whether the Revised Outcome Debriefing was able to remedy the effects of the ostracism manipulation.
Material and Methods
Participants and Design
We aimed to recruit as many participants as possible until the end of the academic term. Eighty-five university students of various disciplines (67% females; mean age M = 22.91, SD = 2.96) participated in exchange for sweets and a voucher for a café on campus. They were randomly assigned to one of four conditions in a 2 (ostracism condition: inclusion vs. ostracism) × 2 (debriefing status [at the time of dependent variables (DV) measurement]: debriefed vs. not debriefed yet) design. Participants were run in individual sessions that lasted about 20 min. No participants were excluded from the study.
Procedure
After giving written informed consent, participants were asked to play a virtual ball tossing game on the computer. Afterwards, participants in the not debriefed yet conditions completed the dependent variables (positive mood, negative mood, state self-esteem) as well as demographics (age, gender, major). They then received the debriefing. Participants in the debriefed conditions received the debriefing before completing the dependent measures. After the debriefing and completion of the dependent measures, participants were thanked, compensated, and dismissed.
Experimental Manipulations
The game consisted of 50 trials and lasted for approximately four minutes. In the ostracism condition, participants received the ball once from each of the other two ostensible players at the beginning of the game. Then they never received the ball again. In the inclusion condition, participants received the ball after every other throw from each of the other two ostensible players which adds up to receiving the ball a third of the time in total.
Debriefing
We provided participants with a written Revised Outcome Debriefing, the presumably most effective kind of debriefing (McFarland et al., 2007). The national recommendations for researchers and ethics committees favoured the use of a written and standardized debriefing (see Deutsche Gesellschaft für Psychologie [German Psychological Society; DGPs], 2018). Therefore, we provided participants with a standardized written debriefing. The debriefing, which had been developed in agreement with the local ethics committee, was presented on the computer screen. Participants read that in contrast to what they had learned earlier (1) there were no other participants, (2) the two universities did not exist, (3) the ball tossing game consisted of the computer running a pre-programmed script and that (4) they had been randomly assigned to a condition where they either received the ball only twice at the beginning of the game and then never received it again (ostracism condition) or where they received the ball regularly (inclusion condition). To ensure that participants understood the debriefing, they were asked to explicitly confirm the following sentence in order to continue with the study: “I fully understand that the ‘other participants’ were the computer running a script and that I have not actually been excluded/playing with someone”. All participants confirmed this statement. In addition to receiving the Revised Outcome Debriefing, participants were provided with the contact details of a trained psychologist in case they would want to talk about potentially experienced distress related to their participation in the study.
Dependent Variables
Mood and state self-esteem served as dependent variables (Hartgerink et al., 2015). 1 Given evidence that positive and negative mood are two different dimensions (e.g., Crawford & Henry, 2004; Watson et al., 1988), we included measures of both dimensions.
Results
Analytic Strategy
For each dependent variable, we ran a 2 (ostracism condition: inclusion vs. ostracism) × 2 (debriefing status: debriefed vs. not debriefed yet) ANOVA. We first examined the predicted simple main effect of the ostracism condition in the not debriefed yet condition (mirroring previous research) to establish the focal effect that the debriefing procedure would aim to remedy. In the case of evidence for this simple main effect, we then analyzed the simple main effect of the ostracism condition in the debriefed conditions. Next, we investigated whether the general main effect of the ostracism condition was significant across both debriefing status conditions. Then we examined a potential interaction between the ostracism condition and debriefing status. Finally, we looked for a potential (unexpected) main effect of the debriefing status.
If there was no evidence for the simple main effect of the ostracism condition in the not debriefed yet conditions, this meant that ostracism did not significantly influence the dependent variable. In this case, our main research question concerning the effectiveness of the debriefing could not be examined: it is impossible to investigate whether a non-existent effect of the ostracism manipulation persevered after the debriefing. Therefore, we do not report the full set of analyses in these cases.
Positive Mood
In the not debriefed yet conditions, participants reported less positive mood after ostracism (M = 1.91, SD = 0.45) than after inclusion (M = 2.37, SD = 0.73), t(35.29) = -2.46, p = .019, d = −0.75, as expected. In the debriefed conditions, the simple contrast of the ostracism condition revealed that the effect persevered after the debriefing (MOstr = 2.08, SD = 0.47; MIncl = 2.46, SD = 0.62), t(35.07) = -2.25, p = .031, d = −0.70. The main effect of the ostracism condition was significant (MOstr = 2.00, SD = 0.46; MIncl = 2.41, SD = 0.68), F(1, 81) = 11.04, p = .001, ηp² = .12. There was no interaction between the ostracism condition and debriefing status F(1, 81) = 0.09, p = .766, ηp² < .01. Finally, there was no main effect of debriefing status (MNoDeb = 2.15, SD = 0.65; MDeb = 2.25, SD = 0.58), F(1, 81) = 1.05, p = .308, ηp² = .01 (Figure 1A). Taken together, ostracism led to decreased positive mood, and the debriefing did not remedy this effect.

1A – 1C. Positive and negative mood, as well as state self-esteem as a function of ostracism condition and debriefing status at the time of DV measurement. Participants who had been ostracized showed lower levels of positive mood and higher levels of negative mood than participants who had been included, no matter whether they had been debriefed at the time of the collection of the respective dependent variable or not. There were no significant effects on state self-esteem. Error bars indicate +/- 1 SEM. *p < .05.
Negative Mood
In the not debriefed yet conditions, participants reported more negative mood after ostracism (M = 1.60, SD = 0.49) than after inclusion (M = 1.27, SD = 0.26), t(28.31) = 2.71, p = .011, d = 0.85. In the debriefed conditions, the simple contrast of the ostracism condition revealed that the effect persevered after the debriefing (MOstr = 1.43, SD = 0.45; MIncl = 1.20, SD = 0.19), t(30.99) = 2.25, p = .032, d = 0.65. The main effect of the ostracism condition was significant (MOstr = 1.51, SD = 0.47; MIncl = 1.24, SD = 0.23), F(1, 81) = 12.29, p = .001, ηp² = .13. There was no interaction between the ostracism condition and debriefing status F(1, 81) = 0.43, p = .516, ηp² < .01. Finally, there was no main effect of debriefing status (MNoDeb = 1.43, SD = 0.42; MDeb = 1.32, SD = 0.37), F(1, 81) = 2.24, p = .138, ηp² = .03 (Figure 1B). Taken together, ostracism led to increased negative mood, and the debriefing did not remedy this effect.
State Self-Esteem
Contrary to expectations, in the not debriefed yet conditions, participants showed similar levels of state self-esteem after ostracism (M = 3.51, SD = 0.67) as after inclusion (M = 3.49, SD = 0.63), t(81) = 0.14, p = .889, d = 0.03 (Figure 1C). Therefore, we stopped the analysis at this point.
Discussion
Participants who had been ostracized in the Cyberball game showed decreased positive mood and increased negative mood compared to participants who had been included in the game. A Revised Outcome Debriefing failed to eliminate these effects.
Contrary to previous research (e.g., Gonsalkorale & Williams, 2007; Jamieson et al., 2010; Williams et al., 2000), the Cyberball manipulation did not reliably affect state self-esteem. This finding aligns with the observations that negative effects of ostracism on self-esteem can be delayed, dependent on rumination, or subject to defensive reactions that protect the self (Blackhart et al., 2009; Kunda, 1990; Tesser, 2000).
Study 2
In Study 2 we sought to replicate the effects of ostracism on mood obtained in Study 1. Different than in Study 1, we used an implicit self-esteem task directly tailored to the ostracism manipulation that may be less susceptible to intentional response editing compared to the self-report measure employed in Study 1. In addition, Study 2 examined potential aftereffects of the ostracism experience not only directly after the debriefing, but also several hours after participants had left the laboratory.
Material and Methods
Participants and Design
We aimed to recruit as many participants as possible until the end of the academic term. Ninety-two university students of various disciplines (40% females; mean age M = 22.52, SD = 2.38) participated in exchange for sweets and a voucher for a café on campus. They were randomly assigned to one of four conditions in a 2 (ostracism condition: inclusion vs. ostracism) × 2 (debriefing status: debriefed vs. not debriefed yet [at the time of the DV measurement]) design. Participants were run in individual sessions that lasted about 20 min. No participants were excluded from the study.
Procedure
The procedure during the laboratory session was the same as in Study 1. In addition, participants were asked to fill out an online follow-up questionnaire later on the same day. Then they were thanked, compensated, and dismissed. Later on the same day, participants received a personalized link to the online questionnaire via email. The link contained coded information about the participant number and experimental condition, both of which were automatically saved to the data file. This allowed us to match the data from the lab session and the follow-up for each participant. We did not retrieve any information about participants’ identity. Therefore, anonymity was guaranteed.
Experimental Manipulations
Ostracism and debriefing status were manipulated in the same way as in Study 1.
Debriefing
The debriefing and referral to a trained psychologist were the same as in Study 1.
Dependent Variables
Results
Analytic Strategy
The analytic strategy was the same as in Study 1, except for the follow-up questionnaire: at the time of the measurement of potential aftereffects, all participants had already been debriefed. We therefore collapsed across the debriefing status conditions for this dependent variable.
Positive Mood
As expected, in the not debriefed yet conditions, participants reported less positive mood after ostracism (M = 2.67, SD = 0.95) than after inclusion (M = 3.17, SD = 0.85), t(44.83) = -1.90, p = .064, d = −0.55, although this effect was not quite significant in a two-tailed analysis. The simple contrast of the ostracism condition in the debriefed conditions revealed that the effect persevered after the debriefing (MOstr = 2.83, SD = 0.79; MIncl = 3.26, SD = 0.60), t(39.15) = -2.08, p = .044, d = −0.62. The main effect of the ostracism condition was significant (MOstr = 2.74, SD = 0.87; MIncl = 3.21, SD = 0.73), F(1, 88) = 7.66, p = .007, ηp² = .08. There was no interaction between the ostracism condition and debriefing status F(1, 88) = 0.04, p = .848, ηp² < .01. Finally, there was no main effect of debriefing status (MNoDeb = 2.91, SD = 0.93; MDeb = 3.05, SD = 0.72), F(1, 88) = 0.56, p = .456, ηp² = .01 (Figure 2A). Taken together, ostracism led to decreased positive mood, and the debriefing did not remedy this effect.

2A – 2C. Positive and negative mood, as well as implicit self-esteem as a function of ostracism condition and debriefing status at the time of DV measurement. Participants who had been ostracized showed lower levels of positive mood and higher levels of negative mood than participants who had been included, no matter whether they had been debriefed at the time of the collection of the respective dependent variable or not. There were no significant effects on implicit self-esteem (as indicated by the IAT D1 score). Error bars indicate +/- 1 SEM. † p < .10, *p < .05.
Negative Mood
As expected, in the not debriefed yet conditions, participants reported more negative mood after ostracism (M = 1.77, SD = 0.77) than after inclusion (M = 1.34, SD = 0.40), t(35.33) = 2.37, p = .024, d = 0.69. The simple contrast of the ostracism condition in the debriefed conditions revealed that the effect persevered after the debriefing (MOstr = 1.66, SD = 0.70; MIncl = 1.27, SD = 0.41), t(33.77) = 2.24, p = .032, d = 0.68. The main effect of the ostracism condition was significant (MOstr = 1.71, SD = 0.73; MIncl = 1.31, SD = 0.40), F(1, 86) = 10.31, p = .002, ηp² = .11. There was no interaction between the ostracism condition and debriefing status F(1, 86) = 0.50, p = .482, ηp² = .01. Finally, there was no main effect of debriefing status (MNoDeb = 1.56, SD = 0.65; MDeb = 1.47, SD = 0.60), F(1, 86) = 0.02, p = .890, ηp² < .01 (Figure 2B). Taken together, ostracism led to increased negative mood, and the debriefing did not remedy this effect.
Implicit Self-Esteem
In the not debriefed yet conditions, participants showed similar levels of implicit self-esteem after ostracism (M = 0.72, SD = 0.38) as after inclusion (M = 0.69, SD = 0.45), t(88) = 0.28, p = .783, d = 0.07 (Figure 2C). Therefore, we stopped the analysis at this point.
Aftereffects
Fifty-two participants (57% retention rate; no difference between conditions) completed the online follow-up questionnaire on average 41.93 h after they had left the laboratory (Md = 10.96 h, SD = 59.83). All participants had been debriefed at this point. Therefore, only the effect of the ostracism condition could be meaningfully analyzed. After ostracism compared to inclusion participants reported more preoccupation with the study and feeling affected by it in a more negative way (MOstr = 2.03, SD = 0.57; MIncl = 1.61, SD = 0.48), F(1,51) = 8.04, p = .007, ηp² = .14. This effect was not significantly influenced by the time that had passed since the laboratory session, F(1,51) = 2.05, p = .159, ηp² = .04. Separate follow-up analyses of the subscales “preoccupation with the study” and “emotional consequences” led to similar results: after ostracism compared to inclusion participants tended to think about their general likeability as well as about the specific Cyberball study more often (MOstr = 2.03, SD = 0.68; MIncl = 1.70, SD = 0.56), F(1,51) = 3.55, p = .065, ηp² = .06, and felt emotionally affected by it in a more negative way (MOstr = 2.04, SD = 0.62; MIncl = 1.51, SD = 0.57), F(1,51) = 10.09, p = .003, ηp² = .16.
Discussion
Participants who had been ostracized in the Cyberball game experienced less positive and more negative mood compared to participants who had been included. A Revised Outcome Debriefing failed to eliminate these effects. Moreover, participants in the ostracism conditions reported a more negative impact of the experiment several hours after leaving the laboratory even though all participants had already been debriefed at that point.
The Cyberball manipulation did not affect participants’ implicit self-esteem. This might be due to methodological problems: Implicit self-esteem has been criticized for its psychometric properties and weak relationship with well-being (e.g., Buhrmester et al., 2011; Falk et al., 2015; Gawronski et al., 2017).
Internal Meta-Analytical Summary
To summarize the effects of ostracism and of the debriefing, we conducted internal meta-analyses that allow for more robust conclusions than individual studies (Braver et al., 2014; Goh et al., 2016; Maner, 2014).
Method
First, we estimated the basic ostracism effect on positive and negative mood between the ostracism not debriefed yet condition versus the inclusion not debriefed yet condition (following Goh et al., 2016). Second, we examined whether the debriefing improved well-being compared to the ostracism condition that had not included a debriefed yet. Third, we examined whether after the debriefing participants’ well-being was similar to the well-being of participants who did not experience ostracism in the first place (i.e., the inclusion debriefed condition).
Results
Basic Ostracism Effect
In the not debriefed yet conditions, participants reported less positive (d = −0.65, p = .003) and more negative (d = −0.77, p < .001) mood after ostracism than after inclusion (see Table 1).
Results of the Internal Meta-Analyses.
Note. CI = confidence interval; number of studies = 2.
Ostracism/not Debriefed Versus Ostracism/Debriefed
In the ostracism conditions, a comparison between participants who had experienced ostracism and had versus had not been debriefed revealed nonsignificant effects for positive (d = −0.27, p = .204) and negative (d = 0.25, p = .238) mood. The small effect sizes indicated that the debriefing was insufficient to remedy the effects of the ostracism experience.
Inclusion/Debriefed Versus Ostracism/Debriefed
After the debriefings, participants’ well-being was still lower after an ostracism experience compared with participants who had not been ostracized in the Cyberball game as indicated by both positive (d = −0.65, p = .003) and negative (d = 0.67, p = .002) mood. Thus, the debriefings did not restore well-being compared with participants who did not experience ostracism.
General Discussion
In two studies, participants were ostracized in a virtual ball-tossing game and subsequently indicated their positive and negative mood as well as their self-esteem either before or after receiving a Revised Outcome Debriefing (McFarland et al., 2007). Ostracism compared to inclusion led to less positive and more negative mood. The debriefing failed to undo these effects. Additionally, negative thoughts and feelings regarding the manipulation persevered the debriefing and persisted on average for more than one day (Study 2, M
On closer consideration, perhaps the present results should not come as a surprise. Distress caused by ostracism can be resilient to moderation by situational factors (Williams, 2007): ostracism can still impair well-being if participants are ostracized by a highly despised outgroup (Gonsalkorale & Williams, 2007), if ostracism comes with an incentive (e.g., because inclusion costs; van Beest & Williams, 2006), and even if participants know that they are playing with a pre-programmed computer script (Zadro et al., 2004).
Ethical Considerations
To interpret these findings in a broader context, it is helpful to distinguish between deception, harm, and deception about harm. Deception can occur without the employment of harm (e.g., when participants are being deceived about supposedly independent experiments that are actually part of the same study). Harm can occur without deception (e.g., in trauma research where participants might be fully informed about the induction of an analogue trauma). Lastly, harm and deception can coincide when the informed consent omits the information that potentially harmful experiences are part of a specific study. While a debriefing can, at least theoretically, ameliorate harm, deception cannot be undone through the use of a debriefing procedure.
The three concepts, deception, harm, and deception about harm, differ in how they are being treated by the APA's (2017) Code of Conduct: Deception should not be used unless it “is justified by the study's significant prospective scientific, educational, or applied value and … effective nondeceptive alternative procedures are not feasible” (p.11) and harm should be avoided altogether or, at least, minimized “where it is foreseeable and unavoidable” (p. 6). Additionally, researchers who notice that a study has harmed participants are required to “take reasonable steps to minimize the harm” (p. 12). In contrast to this approach that allows for rare exceptions when it comes to the employment of deception or harmful manipulations, deception about harm is strictly forbidden: “Psychologists do not deceive prospective participants about research that is reasonably expected to cause physical pain or severe emotional distress” (p. 11).
The APA's (2017) Code of Conduct unambiguously prohibits deception about harm. Unfortunately, other aspects are less clear and lack important information: (1) the threshold of ‘significant prospective value’ that justifies deception is not exemplified, (2) it remains unclear what ‘unavoidable’ harm means, and, (3) how ‘harm’ and ‘severe emotional distress’ are defined. Do those terms apply to participants leaving the laboratory in a bad mood, or do they rather refer to drastic symptoms like intrusions and nightmares? (4) Furthermore, it is unclear what “[w]hen psychologists become aware that research procedures have harmed a participant” (APA, 2017, p. 12) means. Does ‘become aware’ imply a passive approach, where researchers can use manipulations until participants contact them with complaints? Or is it an active process that mandates researchers to assess participants’ well-being? (5) Finally, it remains unclear what the ‘reasonable steps’ entail that researchers are obligated to take in order to avoid harming participants in the first place and to then minimize potential harm after the end of an experimental session. For the latter purpose, debriefings have become common practice but as the present research demonstrates, they may not always be effective in fulfilling this task. In fact, disclosure of deception is the only straightforward part of a debriefing procedure since researchers are, by definition, aware of whether they have deceived and thereby wronged participants. Whether participants have also been harmed may remain less clear because ‘harm’ is not properly defined and because it is uncommon to measure participants’ well-being at the end of an experiment.
There is a need for more precise guidelines that elaborate on what the research community tolerates and what it does not 3 . The development of such guidelines should involve experts on the topics of well-being and ethics, individuals outside the psychological field, as well as participants’ opinions on what they find acceptable or unacceptable. This will be no easy task as ethical views even within disciplines can vary widely. For instance, reactions to the present research have ranged from “this is shocking” to “frankly, I don’t see the problem”. Such diverging ethical opinions about psychological research are by no means a new phenomenon (e.g., Baron, 1981; Baumrind, 1971; Brock & Becker, 1966; Gergen, 1973; Johnson, 1974; Lichtenstein, 1970; Perry & Abramson, 1980).
Some of the criticisms of ostracism studies and deception more generally also apply to the present work. To investigate the effectiveness of debriefings in ostracism studies as they are often conducted requires conducting these studies in this particular way. We nonetheless designed the present research as ethically sound as possible and in adherence to the Code of Conduct as well as our country's ethics code. Specifically, we used comprehensive informed consent (including information about potential distress, i.e., we did not employ deception about harm), provided contact information of a trained psychologist in case of any distress, carefully considered whether our research could be conducted without deception (which unfortunately was not possible), and employed an effective debriefing procedure according to prior research (McFarland et al., 2007). However, we do not wish to claim that our research is entirely unproblematic solely because we adhered to different ethics codes. We do not want to disavow our responsibility, but we believe in the importance of investigating the effectiveness of a procedure that is thought of as a tool to ameliorate harm and that is therefore crucial when it comes to adhering to ethical guidelines. We also believe in the relevance of highlighting potential ethical problems. Unfortunately, in an empirical field, this is only possible through empirical evidence for those ethical problems.
Best Practice
The trust of researchers in the effectiveness of debriefings to reestablish pre-study conditions of participants (e.g., well-being) might not be justified. Therefore, a well-rounded risk assessment of whether study participation might impair participants’ well-being and how it can be restored is needed. Screening procedures could help exclude particularly vulnerable participants (see Lajoie et al. 2020). Potential participants have to be fully educated in advance about possible risks of participating in a research study and the potential perseverance of certain experiences. For studies where deception is indispensable, this can mean to omit information about exactly what kind of unpleasant experiences the study will entail but to still inform participants about the fact that they might occur. Furthermore, researchers should only rely on debriefing procedures that have previously proven effective, report their details in the publication, and assess well-being after a study to double-check on the debriefing's effectiveness.
Independent of these recommendations, researchers should question whether they strictly need to use a potentially stressful manipulation or if they can achieve their research goals in alternative ways. If researchers opt for a potentially unpleasant manipulation, they should include a statement in the publication explaining the decision to do so. We suggest a similar approach when it comes to deception. The fact that in some cases there are nondeceptive alternatives is often overlooked (Hilbig et al., 2022). We encourage careful consideration of whether deception is strictly necessary for a specific study. Ultimately, it is a trade-off between being able to answer certain research questions and using manipulations involving deception and/or discomfort for participants; we propose that this trade-off be transparently addressed.
Research Agenda
Considering the perseverance of distress caused by experimental ostracism, the question arises how participants’ well-being can be restored. So far, there exist no official guidelines on how to debrief participants in general (Brody et al., 2000) and how to successfully eliminate the effects of ostracism manipulations in particular. Obviously, the fact that in the present research, the Revised Outcome Debriefing was not sufficient to eliminate the adverse effects caused by experimental ostracism, does not imply that other procedures are not effective either. However, recent research suggests that even extensive personal debriefing procedures may not be sufficient to fully reestablish well-being after ego-threat (Miketta & Friese, 2019). Thus, the identification of an effective, suitable debriefing procedure (not only for ostracism manipulations) remains an important goal for future research.
We advocate for post-experimental debriefings and interventions to be thought of as a second manipulation instead of a mere formality. Taking this perspective, to undo the effects of the original manipulation this second manipulation has to at least match the original manipulation in strength and effectiveness. Recent work has shown that interactive consent forms increased participants’ understanding of and engagement with those forms (Geier et al., 2021). Perhaps this approach can be incorporated into the design of debriefing procedures. One possible option may be developing ameliorating interventions that take place after the correction of potential deception. We propose an affirmative task that is specifically tailored to the respective dimension that has been threatened during an experiment. In the context of ostracism, this may involve asking participants to write a brief text about instances in their lives where they have experienced being liked and well-integrated within groups. A positive aspect of this approach is that it would not involve any deceptive elements and would instead solely rely on participants’ real-life experiences.
Educational Implications
Although the empirical evidence for the effectiveness of debriefings to restore participants' well-being is scarce, researchers still rely on it. To address this contradiction in education, students should be informed about potential consequences of study participation. This may include reading accounts of former participants and getting the opportunity to experience unpleasant manipulations themselves (after being fully informed about the possible consequences, of course). Furthermore, lectures should establish the linkage between perseverance of experimental effects and related psychological phenomena, like persistence of fake news, retracted scientific findings or rumors.
Conclusion
Ostracism caused distress, indicated by less positive and more negative mood. A debriefing procedure failed to restore participants’ mood. Undesired study-related affect and mental preoccupation with the study persisted for at least several hours. These findings raise questions about research ethics and encourage future research on finding an effective remedy for adverse effects of experiencing ostracism in research studies.
Supplemental Material
sj-docx-1-jre-10.1177_15562646241227065 - Supplemental material for When a Negative Experience Sticks With You: Does the Revised Outcome Debriefing Counteract the Consequences of Experimental Ostracism in Psychological Research?
Supplemental material, sj-docx-1-jre-10.1177_15562646241227065 for When a Negative Experience Sticks With You: Does the Revised Outcome Debriefing Counteract the Consequences of Experimental Ostracism in Psychological Research? by Stefanie Miketta and Malte Friese in Journal of Empirical Research on Human Research Ethics
Footnotes
Acknowledgements
We wish to thank Tian Neuke for his help with formatting and proofreading the manuscript as well as our students for their assistance in collecting the data.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
Supplemental Material
Supplemental material for this article is available online.
Notes
Author Biographies
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
