Abstract
Background
Diagnostic reasoning requires clinicians to think through complex uncertainties. We tested the possibility of a bias toward an available single diagnosis in uncertain cases.
Design
We developed 5 different surveys providing a succinct description of a hypothetical individual patient scenaric. Each scenario was formulated in 2 versions randomized to participants, with the versions differing only in whether an alternative diagnosis was present or absent. The 5 scenarios were designed as separate tests of robustness using diverse cases, including a cautious scenario, a risky scenario, a sophisticated scenario, a validation scenario, and a comparative scenario (each survey containing only 1 version of 1 scenario). Participants included community members (
Results
The first scenario described a cautious patient and found a 47% reduction in the estimated odds of COVID when a flu diagnosis was present compared with absent (odds ratio = 0.53, 95% confidence interval 0.30 to 0.94,
Limitations
Brief hypothetical scenarios may overestimate the extent of bias in more complicated medical situations.
Conclusions
These results demonstrate that an available simple diagnosis can lead individuals toward premature closure and a failure to fully consider additional severe diseases.
Highlights
Occum’s razor has been debated for centuries yet rarely subjected to experimental testing for evidence-based medicine.
This article offers direct evidence that people favor an available simple diagnosis, thereby neglecting to consider additional serious diseases.
The bias can lead individuals to mistakenly lower their judged likelihood of COVID or another disease when an alternate diagnosis is present.
This misconception over the laws of probability appears in judgments by community members and by health care workers.
The pitfall in reasoning extends to high-risk cases and is not easily attributed to information, incentives, or random chance.
Keywords
Introduction
The principle of diagnostic parsimony has endured for centuries and is popularized by the term
The severe acute respiratory syndrome coronavirus 2 pandemic starting 2019 (termed COVID) provides a new opportunity to examine medical judgment under uncertainty because symptoms are ambiguous, test results can take time, a definitive diagnosis is eventually established, and outcomes can be serious. Moreover, the presence of one virus does not exclude the possibility that a second infection is present simultaneously. 5 Epidemiology suggests, furthermore, that co-infection from different pathogens is frequent because of common risk factors such as crowded living spaces, lapses in hand hygiene, social interactions, insufficient distancing, and other health determinants. 6
Undue deference to Occam’s razor can thus lead to oversimplification, premature diagnostic closure, and potentially faulty patient care. 7 Direct tests of Occam’s razor in judgment under uncertainty, however, rarely appear in evidence-based medicine (beyond editorial commentary and case reports).8,9 Here, we tested whether judgments about COVID infection show a bias toward diagnostic parsimony. Specifically, we hypothesized that an available alternative explanation might lead individuals to underestimate the likelihood of a COVID diagnosis. A randomized test of this fallacy may help motivate greater awareness and more effective medical care.
Methods
Overview
We developed survey materials using methods adapted from behavioral decision science. This involved creating different scenarios that each provided a succinct description of an individual patient who had symptoms suggestive of COVID. When appropriate, each scenario also provided the population baseline norm (average risk) of COVID relevant to the setting. The purpose of each scenario was to elicit judgments about the relative likelihood of COVID in the hypothetical individual patient. The question wording was, “What are the chances that [the patient] has COVID?” The 5 response options were “definitely below the norm,” “somewhat below the norm,” “about the same as the norm,” “somewhat above the norm,” and “definitely above the norm.”
Each scenario was formulated in 2 different versions, defined the
Five other survey features were designed to minimize bias. First, each scenario was conceived based on direct clinical experience to make materials original, succinct, clear, meaningful, and relevant. Second, each survey was conducted during active waves of the COVID pandemic to check robustness, provide realistic population norms, and respect community priorities. Third, the scenarios described an individual who was young and had no comorbidities to reduce the need for medical expertise and minimize complexity. Fourth, each respondent was assigned only 1 version of 1 scenario to avoid carryover artifacts and survey fatigue. Fifth, no identifying demographic information on respondents was collected to maintain confidentiality and reduce respondent workload.
Participants
We surveyed different participant groups primarily from North America. The largest group was community participants recruited by the Qualtrics Survey platform (https://www.qualtrics.com/lp/survey-platform/). Another group was health care workers surveyed in person during a mandatory 15-min observation interval immediately after receiving a COVID vaccination. The overall intent was to elicit judgments from potential patients as well as from health care workers who were experienced, motivated, informed, and engaged. Surveys for community participants were conducted in September 2020 (second wave of the pandemic) and April 2021 (after the third wave had subsided); surveys for health care workers occurred in February 2021 (initial vaccine rollout).
All participants completed the survey at their own pace (with a planned time to completion of less than 2 min) and returned responses anonymously (with no formal debriefing). Each participant was blinded to the hypothesis, unaware of alternative versions, and given only a single scenario. Community participants were compensated at the standard internet platform rate ($15 per 1h, equal to $1 per 4 min), whereas health care workers received no incentives. The introductory script for healthcare workers was, “I am hoping you might complete a short survey. It is voluntary, intended for research, and needs no identifying data. Your involvement, results, or lack of participation will not affect your status here at Sunnybrook.
The study was approved by the Research Ethics Board of Princeton University and by the Sunnybrook Research Institute (file No. 3798). The approval included a waiver for signed consent, with consent expressed by checking the appropriate box on the survey platform or by willingness to complete the short paper survey. The community participants received the survey online, and the health care group received the survey on paper in person. Patients and the public were not involved in study design or reporting. At the time of the surveys, PCR testing was the primary method for diagnosing COVID in the community, with a typical 1- to 2 day delay for reporting. Community respondent demographics are summarized in the Appendix.
Statistical Analysis
The primary analysis compared responses in the 2 versions of each scenario, expressed as the proportion of respondents who judged the patient’s chance of COVID infection as below the norm. Statistical analyses used the chi-square test and the odds ratio as a consistent measure of effect size (analyses using the Mann-Whitney test to account for the full nonparametric distribution yielded similar results and are not reported). Sample size calculations were designed to provide an 80% power (beta = 0.20) for detecting an effect size of at least 15% (baseline 25% reduced to 10%). The analytic plan was specified in advance, and each scenario was considered an independent test. All
Results
Cautious Scenario
The cautious scenario involved a man who followed public health COVID guidelines and maintained cautious behavior. The Mr. FR is a 25-year-old man who develops a runny nose, slight cough, and general fatigue. He has no thermometer and does not know his temperature. He has been very careful isolating, not socializing, and always with a mask. Still, he is worried about COVID and goes to be tested (results in 1–2 days). At his location, the norm is 3% of COVID tests are positive. FR is also tested for influenza; those results come back fast, and those results are
A total of 199 individuals responded, of whom 101 and 98 received the
Risky Scenario
The risky scenario involved a man who failed to follow standard guidelines and exhibited risky behaviors. The Mr TR is a 25-year-old man who develops a runny nose, slight cough, and general fatigue. He has no thermometer and does not know his temperature. He has been socializing a lot recently, often indoor, not sufficiently distancing, and without a mask. He is worried about COVID and goes to be tested (results in 1–2 days). At his location, the norm is 3% of COVID tests are positive. TR is also tested for influenza, those results come back fast, and those results are
A total of 207 individuals responded, of whom 104 and 103 received the
Sophisticated Scenario
The sophisticated scenario was presented to health care workers and involved a woman who faced occupational risks. The SR is a 27-year-old ICU nurse who has not received a COVID vaccination yet. She now has a fever, fatigue, and sore throat. SR is tested for COVID and will receive results in 1–2 days. At her location, an average of 10% of COVID tests are positive. SR is also tested for mononucleosis, those results are fast, and those results are
A total of 200 individuals responded, of whom 100 each received the
Validation Scenario
A validation scenario provided nearly no descriptive features and directly tested the impact of the availability of alternative explanations on judgments. The Mr FR is a 25-year-old man who develops a runny nose, slight cough, and general fatigue. He has no thermometer and does not know his temperature. He is worried about COVID and goes to be tested (results in 1–2 days). FR is also tested for influenza, those results come back fast, and those results are
A total of 202 community participants responded, of whom 102 received the
Comparative Scenario
A comparative scenario avoided any mention of population norms and simply compared between 2 contrasting individuals following an uncertain exposure. The MB and TA are close friends, 25 years old, who have just returned from a crowded outdoor concert. They are not vaccinated, they hear of others at the concert who tested positive for COVID, and they feel worried. MB has a mild headache, slight cough, and general fatigue. He goes to be tested (results pending), and in the meantime he is also tested for strep throat. Those results come back fast, and those results are
A total of 496 community participants responded, of whom 249 received the
Discussion
Overview
Through scenarios involving a spectrum of risk, we found that respondents judged the probability of COVID infection to be lower in the presence of another diagnosis compared with when an alternative diagnosis was absent. These judgments are problematic because a simple diagnosis (e.g., influenza) provides no protection against COVID infection. Co-occurrence of influenza and COVID is less likely when behaviors are cautious (scenario 1) than risky (scenario 2), but the chance of COVID does not decrease in the presence of another disease unless the two are negatively correlated. 10 Instead, our respondents consistently lowered their judged likelihood of COVID infection when an alternate diagnosis was present (Table 1). This bias might reduce a patient’s willingness to seek care and a clinician’s propensity to investigate a medical diagnosis.
Selective Justifications
Diagnostic parsimony is sometimes appropriate. Some anatomical disorders are mutually exclusive; for example, a diagnosis of micrognathia tends to exclude a diagnosis of macrognathia. Similarly, many metabolic disturbances cannot coexist; for example, a diagnosis of Addison’s disease excludes a diagnosis of Cushing’s syndrome. Other problems are confined to different populations; for example, a diagnosis of uterine cancer is unlikely to occur with a diagnosis of prostate cancer. And some issues are segregated across time; for example, a diagnosis of ectopic pregnancy is unlikely to occur with a diagnosis of osteoporosis. In many other cases, Occam’s razor will appear correct simply due to chance, because of the low base-rate frequency of most individual diseases. 11
Reliance on Occam’s razor is sometimes expedient. Heuristic thinking early in an epidemic may be justified by the lack of hard data for directly estimating disease probabilities. Reliance on simplicity is also encouraged by binary diagnostic algorithms and by billing systems that ask for a single diagnosis for a clinical encounter. Furthermore, exhaustive diagnostic testing can potentially yield adverse patient outcomes from iatrogenic harms, false positives, and wasted time. 12 A plurality of diagnoses may also be hard to defend when a patient has minimal symptoms, since double-trouble should otherwise lead to worse severity. Medical care might also prove overwhelming if clinicians considered all the complexity occurring each day. 13
Fundamental Misconception
Many diseases, however, cluster together (e.g., hypertension and diabetes). A diagnosis of one of these diseases is no time to appeal to parsimony and neglect combined comorbidities. In the initial years of the pandemic (when COVID tests were in short supply), guidelines mistakenly suggested the presence of the flu reduced the likelihood of a COVID diagnosis and prioritized testing only when influenza was absent. 14 Subsequent microbiology evidence contradicted this assumption and found similar rates of SARS-CoV-2 infection in patients with and without other pathogens. 15 Of course, the converse is also relevant: the presence of a COVID diagnosis does not guarantee the absence of another serious disease (e.g., tuberculosis). 15
Our study highlights a misconception about the laws of probability. The logic of Venn diagrams implies that an intersection set tends to be smaller than a single set; however, the arithmetic of conditional probability is more complex (Figure 1). As an analogy, consider that a patient’s probability of having a fever exceeds the probability of fever and fatigue; however, a patient’s probability of fever with fatigue can exceed the probability of fever without fatigue. Similarly, a patient’s risk of co-infection is necessarily less than the risk of having 1 infection; however, the risk of co-infection may easily exceed the risk of having 1 infection exclusively. Fallible intuitions often reflect a failure to think of the inverse conditional probability.16–18

Venn diagram logic. Venn diagram of potential infections in patients with respiratory symptoms. The black rectangle depicts the full span of diagnostic possibilities. The left circle depicts patients with COVID infection, the right circle depicts patients with other pathogens, the middle sector depicts patients with coinfections, and the surrounding space depicts patients with no infections. In this example, the area of the gray middle sector exceeds the area of the black crescent, thereby suggesting the likelihood of co-infection exceeds the likelihood of having COVID exclusively.
Microbiology surveillance studies frequently document co-infections in COVID patients. Some case series show that more than 50% of patients with COVID have co-infections.19–22 Case-control analyses show variable trends and almost none with a significant inverse association between other pathogens and COVID risk.23–28 One study of symptomatic patients (
Limitations and Extensions
Our study has limitations that merit mention. We tested different groups at different times, yet more replication is justified. Future research could also test whether the effect varies in different individuals. We examined diagnostic judgments in simple scenarios, whereas daily care is more complicated and challenging. We examined brief scenarios that allowed time for uninterrupted thought, whereas clinical decisions often involve distractions, uncertainties, and conflicting priorities. We examined one viral infection, whereas a bias toward simple diagnoses might extend more broadly. To be sure, our design renders the results difficult to attribute to a lack of education, sloppy responses, strategic ploys, fallible test results, atypical respondent samples, incentives, or chance.
Other pitfalls in diagnostic reasoning can reinforce an unduly parsimonious explanation. People tend to discount alternative explanations once a compelling account is available. 30 In addition, a focusing illusion can arise wherein a particular explanation or a popular misconception appears overly compelling due to repeated consideration. 31 A related pitfall occurs when information originally pursued and then confirmed receives more attention than information available from the start. 32 Teamwork can also lead to groupthink, including a tendency to focus on points shared by all rather than unique insights known to a few. 33 These and other factors may contribute to a failure to search for secondary diagnoses in patients who have notable acute or chronic diseases.34,35
Summary
Our results suggest that an available simple diagnosis can lead to premature closure and a failure to fully consider additional serious diseases. Like other heuristics, this shortcut in reasoning is often effective but sometimes mistaken. The appeal to Occam’s razor is justified mostly when the alternative diagnosis is inversely correlated or relatively rare. 36 In the case of COVID infection, overreliance on Occam’s razor can contribute to misplaced complacency, discourage the search for alternatives, and reduce attention to preventive health strategies. 37 The tension between diagnostic parsimony and precision is not easy to resolve, will continue beyond the COVID pandemic, and is worth keeping in mind when diagnosing patients during the next disease outbreak.
Supplemental Material
sj-docx-1-mdm-10.1177_0272989X221121343 – Supplemental material for The Fallacy of a Single Diagnosis
Supplemental material, sj-docx-1-mdm-10.1177_0272989X221121343 for The Fallacy of a Single Diagnosis by Donald A. Redelmeier and Eldar Shafir in Medical Decision Making
Footnotes
Acknowledgements
We thank Allan Detsky, Hank Farber, Fizza Manzoor, Sonieya Nagarajah, Kelvin Ng, Sheharyar Raza, Raffi Rush, Steven Shumak, and Billy Silverstein for helpful suggestions on specific points. The views expressed are those of the authors and do not necessarily reflect the Ontario Ministry of Health.
Correction (June 2023):
The funding organizations had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; and preparation, review, or approval of the manuscript. All authors have no financial or personal relationships or affiliations that could influence the decisions and work on this article. This project was supported by the Alfred P. Sloan Foundation (No. 2014-6-16), a Canada Research Chair in Medical Decision Sciences (No. 950-231316), the Canadian Institutes of Health Research (No. 436011), and the National Science Foundation (No. SES-1426642).
Contributorship
The lead author wrote the first draft. All authors contributed to study design, manuscript preparation, data analysis, results interpretation, critical revisions, and final decision to submit. The lead author had full access to all the data in the study, takes responsibility for the integrity of the data, and is accountable for the accuracy of the analysis.
Transparency Statement
The lead author affirms that the article is an honest accurate and transparent account of the study being reported, that no important aspects of the study have been omitted, and that any discrepancies from the study as originally planned have been explained.
Accountability
The lead author (Donald A. Redelmeier) had full access to all the data in the study, takes responsibility for the integrity of the data, and is accountable for the accuracy of the analysis.
Patient and Public Involvement
Patients and the public were not involved in the design, conduct, or reporting of this study.
Dissemination Statement
We plan to disseminate the results to patients and public organizations.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
