Abstract
Illusory causation is a phenomenon in which people mistakenly perceive a causal relationship between a cue and outcome even though the contingency between them is actually zero. Illusory causation studies typically use a unidirectional causal rating scale, where one endpoint refers to no relationship and the other to a strongly positive causal relationship. This procedure may bias mean causal ratings in a positive direction, either by censoring negative ratings or by discouraging participants from giving the normative rating of zero which is at the bottom extreme of the scale. To test this possibility, we ran two experiments that directly compared the magnitude of causal illusions when assessed with a unidirectional (zero—positive) versus a bidirectional (negative—zero—positive) rating scale. Experiment 1 used high cue and outcome densities (both 75%), whereas Experiment 2 used neutral cue and outcome densities (both 50%). Across both experiments, we observed a larger illusory causation effect in the unidirectional group compared with the bidirectional group, despite both groups experiencing the same training trials. The causal illusions in Experiment 2 were observed despite participants accurately learning the conditional probabilities of the outcome occurring in both the presence and absence of the cue, suggesting that the illusion is driven by the inability to accurately integrate conditional probabilities to infer causal relationships. Our results indicate that although illusory causation is a genuine phenomenon that is observable with either a undirectional or a bidirectional rating scale, its magnitude may be overestimated when unidirectional rating scales are used.
Keywords
Introduction
The ability to learn causal relationships between events is crucial to how we interact with the world. An essential component of assessing causal relationships is to determine the contingency between events: how regularly and reliably one event follows another. For example, in determining whether a medicine causes recovery from a headache, one would compare the likelihood of recovery when they took the medicine to the likelihood when they did not take the medicine. If recovery occurred more often when medicine was taken compared to when it was not, one would infer a
Generally, people are sensitive to distinguishing between different positive and negative contingencies (Shanks & Dickinson, 1987; Wasserman, 1990). However, people seem to have difficulty when learning about null contingencies and can develop the false belief that random events are causally related (Alloy & Abramson, 1979). This phenomenon, named illusory causation, has been replicated in a variety of contingency learning tasks (see Matute et al., 2015 for review). A common feature of experiments that generate the illusory causation effect is their use of high cue and outcome densities in their designs. That is, there is a higher ratio of trials where the cue is present compared with trials where the cue is absent (Blanco et al., 2013; Hannah & Beneteau, 2009; Perales et al., 2005) as well as a higher occurrence of trials where the outcome is present (Allan & Jenkins, 1983; Jenkins & Ward, 1965; Vallee-Tourangeau et al., 2005), both of which have been shown to increase the magnitude of causal illusions. Illusory causation is particularly interesting as it provides an in-lab analogue of how people may form fallacious beliefs in real-world contexts with high cue and outcome density, such as the efficacy of pseudomedicines (Torres et al., 2020), stereotype formation (Hamilton & Gifford, 1976), and superstitious beliefs (Blanco et al., 2015). For example, homoeopathic pills can be taken frequently as they are readily available, allowing for a high cue density environment. Furthermore, medical conditions such as headaches typically fluctuate in magnitude and are short-lasting, thus providing a high outcome density. As such, the efficacy of pseudomedicines such as homoeopathic treatments can be conflated with these natural recoveries, leading to a high density of misattributed recovery outcomes.
Experiments investigating illusory causation are commonly conceptualised in terms of a 2 × 2 contingency table, which classifies trials by the presence and absence of a binary cue (potential cause) and a binary outcome (see Table 1). Framing trials in such a way allows experimenters to calculate and manipulate Δ
Typical 2 × 2 contingency table used to design causal learning experiments.
A common feature of illusory causation experiments is the use of unidirectional scales to measure participants’ causal judgements. A typical causal rating question with a unidirectional scale involves asking participants to indicate how effective the cue is at producing an outcome by choosing numerical values on a labelled slider scale ranging from 0 (not effective at all) to 100 (totally effective; e.g., Barberia et al., 2019; Blanco et al., 2011; Matute et al., 2011). A bidirectional scale, on the other hand, extends the unidirectional scale to include both causal and preventive (e.g., 100; totally preventive) judgements. Blanco and Matute (2020) argue against the use of bidirectional scales in favour of unidirectional scales in illusory causation experiments. Many illusory causation experiments present a medical scenario instructing participants to assume the role of a patient or doctor in a clinical trial and to observe and determine the effectiveness of an experimental drug (Barberia et al., 2019; Chow et al., 2019; Matute et al., 2011; Yarritu et al., 2014). Blanco and Matute (2020) posit that participants may be confused by the idea that medicine could worsen symptoms. However, it is not difficult to imagine a situation where an experimental drug may not only be ineffective but also potentially prevent a patient’s recovery. In fact, Lee and Lovibond (2021) have demonstrated that participants can learn and report preventive relationships in a food allergist paradigm despite the counterintuitive concept of foods preventing allergic reactions.
There are several issues that may arise when using unidirectional rating scales to measure causal judgements and illusory causation. First, unidirectional scales do not allow participants to report preventive relationships. By effectively censoring one section of the distribution of participants’ possible causal beliefs, unidirectional scales are potentially biasing the mean rating in a positive direction. It has also been demonstrated that participants are often reluctant to make responses at the extremes of rating scales (see Baumgartner & Steenkamp, 2001 for review). This is especially problematic for studies of illusory causation in which the normative rating is zero (i.e., no relationship), which lies at the bottom extreme of a unidirectional scale. In a study directly comparing rating scales, Neunaber and Wasserman (1986) found that presenting participants with a bidirectional scale in an instrumental contingency learning task led to more accurate and sensitive estimates of contingencies (including null contingencies) compared to unidirectional scales, and participants were less biased by outcome density effects. The bidirectional scale points ranged from preventive to causal, whereas the unidirectional scale ranged from “no control” to “complete control” over the outcome. Interestingly, participants under the unidirectional condition made more accurate estimates of contingencies when presented with additional instructions that the relationship between their actions and the target outcome could be either causal or preventive. This implies that participants may not spontaneously consider the full range of potential relationships when making causal judgements on a unidirectional scale. Applying these findings to illusory causation paradigms, unidirectional scales may therefore bias participants who would have correctly rated the contingency as null towards reporting positive responses. Furthermore, participants who believe there is a small positive relationship may be encouraged to report even more positive responses, which could overestimate the actual size of the illusory causation effect.
Despite previous studies having already investigated potential problems regarding the use of undirectional scales in causal learning tasks, some issues still remain unresolved. Neunaber and Wasserman (1986) directly compared bidirectional and unidirectional scales, but they presented participants with multiple contingency problems in a within-subjects design. Participants tend to perform more accurately in contingency learning tasks when presented with multiple contingency problems including null contingencies (Allan & Jenkins, 1980; Neunaber & Wasserman, 1986; Shanks & Dickinson, 1987). This improved performance is likely due to participants benefitting from the opportunity to compare different contingencies and calibrate their causal judgements in these tasks. Studies of illusory causation, by contrast, tend to use between-subjects designs (Chow et al., 2019; Matute et al., 2011; Vadillo et al., 2013), as they are primarily interested in investigating conditions that produce and eliminate causal illusions. These studies often use a standard group trained with a single null contingency as a control to compare with a group that receives a manipulation designed to reduce causal illusions.
Interestingly, Blanco et al. (2013) also found that participants judged null contingency problems more accurately when tested on a bidirectional rating scale. They surprisingly observed a positive bias for a non-contingent cue and outcome pair in a low (20%) cue and outcome density contingency learning task when recording causal judgements with a unidirectional rating scale. This effect did not replicate in a following experiment, where participants tested on a bidirectional scale gave more normative responses (i.e., causal ratings close to 0) for a low cue and outcome density null contingency problem. Similarly, Blanco and Matute (2020) compared a procedure modelling pseudotherapy for a spontaneously recovering disease with a standard illusory causation paradigm typically used as a control group. They could only observe causal illusions in their control group with a unidirectional scale, whereas the effect disappeared in a follow-up experiment using a bidirectional scale. The results of these two studies imply that participants may have genuinely believed a non-contingent cue and outcome were unrelated but were biased towards making positive causal judgements on a unidirectional rating scale. However, Blanco et al. (2013) and Blanco and Matute (2020) did not aim to investigate the influence of rating scales and additional methodological changes between experiments (e.g., task cover story) complicate cross-experiment comparisons to make inferences about potential biases of unidirectional scales.
Given that illusory causation paradigms are used to explain how people may misinterpret causal relationships, it is crucial that experiments use an unbiased and sensitive measurement to investigate this phenomenon. As the overall aim of such studies is to eventually identify conditions that can eliminate causal illusions in real-world contexts, we need to ensure first that we are accurately measuring the strength of causal illusions in lab paradigms. Overestimating participants’ causal judgements in illusory causation studies can lead to misinterpretations from combining participants who are unsure or believe there is a null contingency with other participants who genuinely hold an erroneous belief that a causal relationship exists.
Experiment 1
Experiment 1 aimed to determine whether the effect of rating scales extends to illusory causation paradigms, in which a contingency bias is induced by high cue and outcome density. We tested this by directly comparing independent groups of participants given either a bidirectional or unidirectional response scale in an illusory causation paradigm with a null contingency. Although previous studies have observed differences in causal judgements for contingency problems when measuring with bidirectional and unidirectional rating scales, this experiment will directly compare how presenting these rating scales can affect causal judgements in the same causal scenario with a single null contingency problem. Presenting a single contingency problem prevents participants from comparing different contingencies within the same task to calibrate their judgements and allows us to isolate the potential impact of the rating scales. We hypothesised that participants would report stronger causal relationships between the cue and outcome when presented with a unidirectional scale compared to a bidirectional scale at test despite experiencing the same null contingencies during training.
Method
Participants
This study was conducted online with participants recruited from the online platform Prolific. Participants were required to be fluent in English and have a minimum Prolific approval rate of 90%. A total of 119 participants were recruited (48 female, 69 male, 2 non-binary, M age = 27.87,
Materials
The experiment was coded in Javascript using the jsPsych library (De Leeuw, 2015) and was hosted on a JATOS server for online distribution and data storage (Lange et al., 2015). Participants were required to complete the experiment on their personal desktop computers. The stimuli used were 200 × 200 px images representing a light in an on (blue circle) position and an off position (grey circle), as well as a 250 × 250 px electric shock symbol. The stimuli, experimental code and Supplementary Materials can be found at https://osf.io/3ukbp/.
Procedure
This study was approved by the University of New South Wales Human Research Ethics Committee (HREAP #3316). Before starting the task, participants were presented an online information statement with broad-level details of the overall study. Participants declared consent to the experiment by selecting a checkbox to acknowledge they had read and understood the information statement and would like to participate in the study. In the experimental task, participants were presented with a hypothetical scenario in which a fictional Mr. X is investigating a strange machine that delivers electric shocks. They were told that the machine also has a light that periodically turns on and off, and were shown what the light looks like when on (blue) and off (grey). Participants were instructed to observe instances when the light was either on or off and a shock either occurred or not, to help Mr. X determine whether there is any relationship between the light and shock (see Supplementary Materials). This cover story was chosen over a medical scenario typically used in illusory causation studies (e.g. Matute et al., 2011), where the cue is the administration of medicine and the outcome is recovery from a disease. The task instructions used in these medical scenarios often focus on whether the medicine cures participants from a target disease, which may potentially bias participants towards expecting a positive relationship. The light-shock scenario was intended to be more neutral by not hinting at the nature of the relationship (if any) between the light and shock and so could be less likely to bias participants into believing a positive causal relationship existed. The instructions were followed by an instruction check, in which participants were asked to identify the correct task instructions, as well as identify the light in the off position. If participants answered any questions incorrectly, they were timed out for 5s before being allowed to attempt the questions again. Participants were only able to proceed to the training phase once they answered both questions correctly.
In the training phase, participants observed a pseudorandomized sequence of 32 trials separated into 2 blocks. Each block of 16 trials was fully randomised within each block and consisted of 9
At the end of training, participants were asked to make a causal judgement rating, in which the on light was displayed along with a sliding response scale. The unidirectional group was asked “To what extent does this light cause shock?,” and the response scale was labelled “Has no effect on shock” at the left extreme and “Definitely causes shock” at the right extreme (see Figure 1, top panel). The bidirectional group was asked “To what extent does this light cause or prevent shock?,” and the response scale was labelled “Definitely prevents shock” at the left extreme, “Has no effect on shock” in the centre, and “Definitely causes shock” at the right extreme (see Figure 1, bottom panel). Participants made a response by dragging the slider to a point relative to the labels on the scale to indicate their causal beliefs. For data analysis, responses were converted to numerical ratings ranging from −100 to 100 for the bidirectional scale and 0 to 100 for the unidirectional scale. Scaling participants’ ratings in this way allowed us to observe and compare responses in both groups despite the additional response range on the bidirectional scale. Following the causal judgement question, both groups responded to two frequency questions that asked them to predict how many shocks Mr X would experience if the light was

Causal rating questions in Experiment 1.
Results and discussion
All data cleaning was conducted using R (R Core Team, 2021). Subsequent analyses were conducted using R using the ez package (Lawrence, 2016) as well as jamovi (The jamovi project, 2022). Of the 119 participants recruited, 62 participants were randomly allocated to the bidirectional group and 57 to the unidirectional group. As shown in Figure 2, both groups provided positive causal ratings overall, and a one-sample

Mean and individual causal ratings for the target cue in Experiment 1.
A one-sample t-test comparing all participants’ average responses to both frequency questions (

Mean frequency ratings in Experiment 1.
Participants’ frequency ratings can also be used to infer their perceptions of the trial frequencies and, therefore, compute each individual’s implied Δ
Figure 4 displays the relationship between participants’ causal ratings and their implied Δ

Scatterplot of implied Δ
Discussion
In Experiment 1, we successfully replicated the illusory causation phenomenon in a typical causal learning paradigm with a high cue and outcome density. The strength of the illusion in the unidirectional group was comparable to other illusory causation studies that employed similar cue and outcome densities and a unidirectional rating scale (
Another interesting finding from Experiment 1 is that both groups underestimated outcome frequency in the absence of the cue. This result is consistent with findings reported by Barberia et al. (2019), who also found that participants more accurately estimated outcome frequency in the presence of the cue and underestimated outcome frequency in the absence of the cue. Nonetheless, we did find that participants’ implied Δ
These findings have implications for theories of illusory causation. In particular, low ratings for the frequency of the outcome in the absence of the cue are consistent with causal judgement models which posit that people weigh
Experiment 2
In Experiment 2, we aimed to extend our investigation of the effect of rating scales on causal judgements by comparing the two scales in the same task as Experiment 1 but with 50% cue and outcome densities. These densities can be seen as neutral, in the sense that there are equal numbers of
Method
The Method for Experiment 2 was the same as Experiment 1, unless otherwise specified below.
Participants
A total of 240 participants were recruited on the Prolific platform (104 female, 131 male, 4 non-binary,
Procedure
Experiment 2 followed the same procedure as Experiment 1, except for the change in cell frequencies for the 2 training blocks. Participants now observed a 50% cue and 50% outcome density trial structure. That is, participants saw two training blocks, each consisting of 4
Results and discussion
Of the 240 participants recruited, 118 were randomly allocated to the bidirectional group, and 121 were randomly allocated to the unidirectional group. One participant’s data file was lost due to technical issues during the experiment and was excluded from analysis. As shown in Figure 5, both groups again provided positive causal ratings overall. However, both group means were lower compared to Experiment 1, as expected given the lower cue and outcome densities of Experiment 2. Nonetheless, a one-sample t-test comparing causal ratings to 0 reached significance for both the bidirectional group,

Mean and individual causal ratings for the target cue in Experiment 2.
Figure 6 shows participants’ frequency predictions in Experiment 2. Critically, the underestimation of outcome frequency for cue-absent trials observed in Experiment 1 was not replicated in Experiment 2. In fact, both groups made ratings that were relatively close to the normative response of 50 (i.e., 50% outcome density) for both cue-present and cue-absent trials. A one-sample t-test comparing all participants’ average frequency ratings (

Mean frequency ratings for cue-present and cue-absent trials in Experiment 2.
Likewise, participants’ implied Δ

Scatterplot of implied Δ
General discussion
Over two experiments, we compared the effect of unidirectional and bidirectional rating scales in assessing participants’ causal judgements in a zero contingency learning task. We observed causal illusions with both scales, regardless of whether cue and outcome density were high (both 75%, Experiment 1) or neutral (both 50%, Experiment 2). However, mean causal ratings were significantly higher (less normative) in the group that gave ratings on a unidirectional scale compared to the group that gave ratings on a bidirectional scale.
There are at least three factors that may account for these group differences in causal ratings. First, the nature of the unidirectional rating scale censored negative responses, restricting variance in participants’ causal judgements to be distributed among the positive section of the scale range (see Figures 4 and 7). This pattern was most apparent in Experiment 2, where a substantial number of negative ratings were observed in the bidirectional group, but not of course in the unidirectional group, even though the groups had been exposed to identical training and presumably had similar causal beliefs. Second, the normative response of zero was located in the middle of the scale for the bidirectional group compared to an extreme the scale in the unidirectional group. Thus participants in the unidirectional group may have been less willing to report a zero contingency compared to the bidirectional group because it was located at a scale extreme (Baumgartner & Steenkamp, 2001). In fact, for a normative mean causal rating to be observed in the unidirectional group,
An additional goal in Experiment 2 was to determine whether this illusion could be attributed to an underweighting of
The present results have several implications for our interpretation of the existing illusory causation literature and how we should investigate the phenomenon moving forward. Given that illusory causation paradigms are presented as an analogue of how people come to form fallacious beliefs, it is critical that we are accurately capturing participants’ causal judgements within our experiments before applying findings to real-world contexts. The majority of published experiments employing illusory causation paradigms have used unidirectional rating scales (e.g., Barberia et al., 2019; Blanco et al., 2011; Matute et al., 2011), which may have complicated identification of the conditions that generate causal illusions. Our results also have implications for studies that aim to eliminate or reduce causal illusions. For example, Barberia et al. (2019) found that extended training reduced the strength of causal illusions but did not completely eliminate them. It is possible that the efficacy of such manipulations could be understated when measured with unidirectional rating scales, similar to our findings in Experiment 2 where a reduction in cue and outcome density appeared to produce a greater decrease in causal ratings for the bidirectional group compared to the unidirectional group.
In conclusion, our results reveal that although illusory causation is a genuine phenomenon, a proportion of the effect may be due to the use of a unidirectional rating scale. Participants’ true belief in a causal relationship between illusory cues and outcomes in these paradigms may therefore be weaker than previous experiments imply. Our results suggest that the bias associated with unidirectional rating scales is a general one that is independent of cue and outcome density effects. As such, we recommend using bidirectional scales in future causal illusion or contingency learning research to capture a more accurate and unbiased representation of participants’ causal judgements.
Supplemental Material
sj-docx-1-qjp-10.1177_17470218231175003 – Supplemental material for Unidirectional rating scales overestimate the illusory causation phenomenon
Supplemental material, sj-docx-1-qjp-10.1177_17470218231175003 for Unidirectional rating scales overestimate the illusory causation phenomenon by David W Ng, Jessica C Lee and Peter F Lovibond in Quarterly Journal of Experimental Psychology
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was funded by an Australian Research Council Discovery Project Gran #DP190103738 awarded to Peter Lovibond. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
