Abstract
Objective:
Neurocognitive underpinnings are implicated in the aetiology and maintenance of body dysmorphic disorder (BDD); however, inconsistent findings across a range of neurocognitive domains suggest that a comprehensive synthesis of the literature using a hierarchical framework of neurocognition is needed.
Methods:
A final search across OVID Medline, PsycNET, Scopus and Web of Science databases was conducted on 20 June 2024 to identify research that examined performance on behavioural tasks of objective neurocognition in BDD. Risk of bias was assessed using the Newcastle-Ottawa Scale. Fifty-four studies aligned with the following inclusion criteria: (1) full-text; (2) peer-reviewed; (3) published in English; (4) employed a neurocognitive task with an objective outcome and (5) involved a case-controlled paradigm consisting of BDD and healthy control samples. Findings were synthesised according to neurocognitive sub-domains viewed as a hierarchy from basic to higher-level domains.
Results:
Neurocognitive differences in BDD relative to controls were identified at almost all levels of the hierarchy, most consistently in the upper domains of executive function and social cognition. Vulnerabilities were also demonstrated in the sub-domains of visual perception of faces, Gestalt processing, selective attention to faces and verbal memory. Methodological limitations or the influence of neurocognitive sub-groups may contribute to inconsistencies across the literature.
Conclusions:
Although neurocognitive differences appear central to BDD, a picture of neurocognitive heterogeneity emerged with the salience of stimuli important and a likely bias to local-over-global processing demonstrated across the domains.
Keywords
Body dysmorphic disorder (BDD) is a mental disorder characterised by excessive preoccupation with perceived flaws in appearance accompanied by repetitive behaviours and/or mental acts causing clinically significant distress and/or dysfunction (American Psychiatric Association, 2013). While its aetiology remains unclear, BDD presents clinically as an enduring perception of the self as deformed or grotesque, which negatively impacts all facets of life (Rossell, 2023). The prominence of misperception and fixed beliefs implicates neurocognitive aspects in the development and maintenance of symptoms.
Research has indicated significantly reduced performance in BDD compared to healthy controls (HCs) across multiple neurocognitive domains. These include visual organisation (Kerwin et al., 2014), attention (Toh et al., 2015a), memory (Deckersbach et al., 2000) and planning/problem solving (Dunai et al., 2010). The largest empirical study of neurocognition in BDD (n = 65) reported significant impairments in inhibition/flexibility, speed of processing, working memory, visual and verbal learning and reasoning/problem-solving (Malcolm et al., 2021). Another comprehensive study found a global cognition deficit in BDD using the Repeatable Battery for the Assessment of Neuropsychological Status (RBANS) with a very large effect (Toh et al., 2015a; BDD < HC d = 1.36). However, findings of neurocognitive differences in BDD have not been consistently ascertained across the literature (e.g. Greenberg et al., 2018; Rossell et al., 2014).
Johnson et al. (2018) performed the first systematic review and meta-analysis of neurocognitive functioning in BDD, indicating significantly reduced performance relative to HCs across 23 studies, with small to medium pooled effect sizes across the domains of interpretative biases (g = 0.30), memory (g = 0.56) and selective attention (g = 0.60), although no significant group differences were documented for local visual processing (g = 0.35). However, the review did not explore a broad range of neurocognitive functions beyond the four selected domains of interest.
This systematic review sought to address this gap, as well as incorporating additional studies published since the work by Johnson et al. (2018). Specifically, we aimed to examine a comprehensive range of neurocognitive investigations performed in BDD as viewed through the perspective of a hierarchically organised framework of neurocognition (Harvey, 2019). As shown in Figure 1, this framework posits that cognition starts with basic sensory and perceptual lower-order processes and builds in complexity towards higher-order operations, such as those involving language and executive functions (Harvey, 2019). Social cognition can also be considered as an advanced neurocognitive process reliant on executive functions, and thus has been included as an additional domain at the top of the hierarchy. The hierarchy model presents neurocognitive functions as inter-connected domains that increase in complexity and can influence each other through a feedback loop. Thus, basic processes both influence and are controlled by higher-order domains. Examination of patterns of neurocognitive differences in BDD according to this hierarchy can help clarify inconsistent findings to date by contextualising potential impairments in one function against those in connected domains. Furthermore, given the importance of social cognition to the cognitive-behavioural theoretical basis, as well as the lived experience of BDD, domains involving emotion processing and theory of mind, which used validated tasks with correct/incorrect outcomes, were included.

Hierarchy of neurocognitive domains.
Method
Search and selection
The systematic review protocol was developed in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses protocol (PRISMA; Moher et al., 2009) and was registered with the international prospective register of systematic reviews, PROSPERO (ID: CRD42020191074). The search strategy was amended since registration to increase the number of databases included, using PsycNET, Web of Science, Scopus and OVID Medline. The final search was conducted on June 20, 2024. Search terms were chosen to be inclusive of body dysmorphic disorder (e.g. ‘BDD’, ‘Dysmorphophobia’, ‘Body dysmorphia’) and cognition (e.g. ‘neurocognition’), and the search syntax was tailored for each database. Search strategy, including full syntax and search strings for each database, are reported in Supplemental Materials. Inclusion criteria were (1) full-text; (2) peer-reviewed; (3) published in the English language; (4) employed a neurocognitive task with an objective outcome (e.g. accuracy or response time) and (5) involved a case-controlled paradigm consisting of BDD and HC samples. A two-staged screening process was used to determine eligibility of records for inclusion. At stage one, KhàC and CC independently screened the titles, abstracts, and keywords of all records. Where eligibility was unclear, articles were retained for the second stage of the screening process. At stage two, KhàC, CC and AM independently screened the full text of retained articles for eligibility. Discrepancies that arose at stage two were resolved by author consensus. Inter-rater reliability was 100%.
Effect sizes
To support interpretation of findings, Cohen’s d values for group differences between BDD and HC participants were calculated, where possible. Where data on group means and standard deviations were not available, Cohen’s d values were calculated from either a t-statistic, F-value, or Pearson’s r value using an effect size conversion programme (Sullivan and Feinn, 2012).
Risk of bias
Risk of bias was assessed by two independent reviewers (KHàC and SLR) using the Newcastle-Ottawa Scale to examine case selection (method of case/HC definition, origin of samples; maximum 4 points), comparability of cases and controls (study design comparability, e.g. differences in age, sex, education; maximum 2 points) and exposure (exposure method, comparability of exposure, non-response; maximum 3 points; Wells et al., 2011). Studies were rated 0–9, with 0–2 denoting poor quality, 3–5 fair quality and 6–9 high quality (see Supplemental Material, Table 4).
Hierarchical framework
The reporting of findings according to cognitive domains was organised according to the bottom-up hierarchical framework reflecting increasing levels of complexity involved in each cognitive operation (Harvey, 2019). While there is overlap between domains, and generally tasks tap into multiple domains, tasks were allocated according to their predominant domain. To reveal subtle differences in contributions to cognitive processing, tasks that examined multiple variables were further split, and the variables were allocated to the appropriate sub-domain. For example, some studies using the Rey-Osterrieth Complex Figure Task (RCFT) reported both organisation, which was allocated to the perception domain, and delayed recall, which was allocated to the memory domain (e.g. Deckersbach et al., 2000). To help synthesise the extensive results, summaries for each cognitive domain are provided.
Results
Search
The search provided 8,441 records after the removal of duplicates, from which 54 articles were identified for inclusion (see Figure 2).

PRISMA flow diagram showing record screening and selection.
Risk of bias
Table 1 demonstrates that overall risk of bias within studies was low. The minimum quality of studies retained was fair quality (one study), with the vast majority being of high quality (53 studies; for a detailed analysis, see Supplemental Material, Table 4).
Quality analysis: Newcastle-Ottawa Scale (NOS) for case-controlled studies.
The designator ⋆ denotes that study has met criteria outlined in accordance with Newcastle-Ottawa Quality Assessment Scale for case-control studies.
Study characteristics
Table 2 shows the characteristics of the retained studies.
Characteristics of included studies (N = 54).
Age range is not included as it was not widely reported across the studies.
Cognitive findings
Table 3 presents a summary of the cognitive findings across the studies.
Summary of included studies, tasks employed and key behavioural findings by cognitive domain.
The designator + denotes no effect size calculated. AGN, Affective Go/No-Go Task; BRT, Beads Reasoning Task; BAAS, Beliefs about Appearance Scale; BDD, Body Dysmorphic Disorder; CGT, Cambridge Gambling Test; CANTAB, Cambridge Neuropsychological Test Automated Battery; CVLT, California Verbal Learning Test; CT, Composite Task; COWA, Controlled Oral Word Association; EFT, Ekman and Friesen Task; FEEST, Facial Expression of Emotions Stimulus and Test; HC, Healthy Controls; HSF, High spatial frequency; IR, Identity Recognition; IED, Intra-Extra Dimensional Set-shifting Task; LSF, Low Spatial Frequency; MSC, Mayer-Salovey-Caruso; MATRICS, MATRICS Consensus Cognitive Battery; MEI, Morphed Emotion Identification; MASC, Movie for the Assessment of Social Cognition; NT, Navon Task; NP, Negative Priming; NToL, New Tower of London; NSF, Normal spatial frequency; PFA, Pictures of Facial Affect; QSFID, Queen Square Face Discrimination Test; RMET, Reading the Mind in the Eyes Task; RT, response time; RAVLT, Rey Auditory Verbal Learning Task; RCFT, Rey Complex Figure Task; RBANS, Repeatable Battery for the Assessment of Neuropsychological Status; RHI, Rubber Hand Illusion; SIA, Social Inference Accuracy; SWM, Spatial Working Memory; SOC, Stockings Of Cambridge; WAIS-III, The Wechsler Adult Intelligence Scale-III; VL, verbal learning; VPA, Verbal Paired Association; VOSP, visual object and space perception; WMS-III, Wechsler Memory Scale-III; BDD-YBOCS, Yale-Brown Obsessive-Compulsive Scale modified for Body Dysmorphic Disorder.
Sensation
A single study found no significant differences in proprioception using the rubber hand illusion across both synchronous and asynchronous stimulation conditions (Kaplan et al., 2014).
Perception
Motor function
A single study found no significant difference in motor function in BDD using the Purdue Pegboard (Hanes, 1998).
Visual perception
To distinguish the perception of general stimuli from those which may be of increased pertinence to individuals with BDD, this domain was categorised into sub-categories of non-face discrimination, face discrimination, face modification detection and Gestalt perception. We further categorised the Gestalt studies into organisation, configural processing, distortion and inversion effects.
Non-face discrimination
Visual discrimination of non-face stimuli was examined in eight studies that predominantly measured object-matching (Buhlmann et al., 2014; Feusner et al., 2007, 2011; Lambrou et al., 2011; Li et al., 2015; Moody et al., 2015; Reese et al., 2010; Rossell et al., 2014). Across these, 15 variables showed no significant differences in BDD, with small to moderate effect sizes. Only a single variable showed a significantly slower response time with a large effect size; however, this study had a small sample (BDD; N = 14; Feusner et al., 2011). A further study showed a medium to large effect size for reduced accuracy in BDD, although this finding did not reach statistical significance (Rossell et al., 2014).
Face discrimination
Eleven studies examined face discrimination: the ability to discriminate between photos of different faces and objects. Four variables showed significant differences in BDD. Feusner et al. (2010a) identified reduced accuracy and slower response times discriminating emotional faces as well as slower response times for neutral faces. This pattern of findings was confirmed by Toh et al. (2017c) using neutral Pictures of Facial Affect (PFA) and own-face stimuli with moderate to large effect sizes. Furthermore, Toh et al. (2017c) demonstrated an aversion to focus on both own and other face-processing, regardless of preoccupation foci.
Sixteen variables, encompassing accuracy or response times, displayed no significant differences in face discrimination in BDD, with effect sizes ranging from small to large. These studies employed diverse measures including the Benton Facial Recognition Test (BFRT; Buhlmann et al., 2004), Queen Square Face Discrimination Test (QSFD; Fang et al., 2020), Composite Face Matching Task (Monzani et al., 2013), Famous Faces Task (FFT; Jefferies et al., 2012), Penn Faces stimuli (Feusner et al., 2007, 2010a, 2010b; Li et al., 2015) and own faces (Ritter et al., 2020). Possible ceiling effects may account for some of these findings.
Modification detection
Eight studies investigated facial modification detection (Buhlmann et al., 2014; Hübner et al., 2016; Lambrou et al., 2011; Onken et al., 2024; Pikoos et al., 2024; Reese et al., 2010; Stangier et al., 2008; Yaryura-Tobias et al., 2002). Four variables showed atypical modification processing. Lambrou et al. (2011) demonstrated significantly enhanced accuracy in modification detection to own faces with a large effect size, and Stangier et al. (2008) showed significantly greater other-face accuracy under timed conditions. Buhlmann et al. (2014) found significantly increased reporting of non-existent changes in other faces. Yaryura-Tobias et al. (2002) demonstrated significantly less-accurate facial modification detection, with half of their BDD sample (of whom 80% had predominantly facial foci of concern) incorrectly detecting modifications to own faces when none existed. No incorrect detections were reported among healthy controls. These findings have been argued to support the notion of atypical aesthetic sensitivity for faces in BDD (Buhlmann et al., 2014; Lambrou et al., 2011; Stangier et al., 2008; Yaryura-Tobias et al., 2002).
Eleven other variables, however, showed no significant differences in accuracy or response times for other face change detection (Buhlmann et al., 2014; Hübner et al., 2016; Lambrou et al., 2011; Pikoos et al., 2024) or other face symmetry preference (Onken et al., 2024; Reese et al., 2010). Variations in study design may account for the inconsistent findings. For example, computer-generated stimuli were employed in the study by Buhlmann et al. (2014), whereas others presented computer-modified naturalistic images (Hübner et al., 2016; Lambrou et al., 2011; Onken et al., 2024; Pikoos et al., 2024; Reese et al., 2010; Stangier et al., 2008; Yaryura-Tobias et al., 2002). Stangier et al. (2008) examined only women, presenting colour photos of a single woman. Whereas Yaryura-Tobias et al. (2002) employed mixed-gender stimuli but examined a predominantly male BDD sample, and Reese et al. (2010) employed black-and-white photos and allowed extended viewing time.
Gestalt non-face visual organisation
Gestalt non-face visual organisation was examined in six studies (Deckersbach et al., 2000; Giannopoulos et al., 2022; Greenberg et al., 2018; Silverstein et al., 2015; Yaryura-Tobias et al., 2002; Yousefi et al., 2020). Three variables revealed significantly diminished performance on the RCFT in terms of copy organisation and delayed recall accuracy, with large effect sizes (Deckersbach et al., 2000; Greenberg et al., 2018). Compared to controls, individuals with BDD demonstrated impaired search and recall strategies when asked to replicate the complex figure, focusing on smaller details at the expense of larger, important design features.
However, analysis of 15 other variables using a variety of tasks, including illusion tasks (Giannopoulos et al., 2022), a contour integration task (Silverstein et al., 2015), the RCFT (organisation; Yousefi et al., 2020) and the perceptual organisation components of the Wechsler Adult Intelligence Scale-III (WAIS-III: Perceptual; Yaryura-Tobias et al., 2002), detected no significant differences in perceptual organisation, although moderate effect sizes for immediate recall on the RCFT (organisation; Deckersbach et al., 2000; Greenberg et al., 2018) and Ebbinghaus illusion task (Silverstein et al., 2015) were found. Findings of preserved immediate recall but impaired copy organisation have been proposed to stem from atypical organisational strategies characteristic in BDD, relying predominantly on detail-oriented processing while de-prioritising global processing (Deckersbach et al., 2000).
Gestalt non-face configural
Gestalt (non-face) configural processing was investigated in three studies utilising the Navon Task (NT; Kerwin et al., 2014; Monzani et al., 2013; Pikoos et al., 2024) and Embedded Figures Task (EFT; Kerwin et al., 2014). There were significant differences with large effect sizes across eight variables (indicating reduced accuracy or response times) on the NT (Kerwin et al., 2014). In contrast, Pikoos et al. (2024) and Monzani et al. (2013) found no significant differences in NT accuracy, albeit utilising a different methodology with additional instructions to focus attention on global or local stimuli.
Gestalt distortion
Gestalt distortion processing was investigated in six studies employing high and low spatial frequency-altered photos of objects (Feusner et al., 2011; Li et al., 2015) and faces (Feusner et al., 2007; Li et al., 2015; Moody et al., 2015; Pikoos et al., 2024; Wang et al., 2024). BDD participants showed significantly slower response times on two variables for both low and high spatial frequency objects with moderate to large effect sizes (Feusner et al., 2011).
However, 17 variables showed no significant behavioural differences in accuracy or response time for objects, own/other faces (Feusner et al., 2007, 2011; Li et al., 2015; Moody et al., 2015; Pikoos et al., 2024; Wang et al., 2024).
Gestalt inversion
Gestalt inversion processing was examined in six studies utilising inverted face tasks (Feusner et al., 2010b; Monzani et al., 2013; Pikoos et al., 2024; Ritter et al., 2020), the FFT (Jefferies et al., 2012) and the Mooney Faces Task (Toh et al., 2017b). Six variables indicated significantly reduced face or object-inversion effects based on response times or accuracy. These include response time for long-duration stimuli measured by Feusner et al. (2010b) and significantly increased inverted face recognition accuracy with a large effect size reported by Jefferies et al. (2012). Toh et al. (2017b) identified significantly reduced face and object inversion effects, alongside more accurate and faster object identification, with medium to large effect sizes. Contrary findings were demonstrated by Pikoos et al. (2024), who found significantly larger face-inversion effects across both ‘same’ and ‘different’ trials using response time.
However, 12 further behavioural variables exhibited no significant differences. These include no significant accuracy differences for both short- and long-duration stimuli, as reported by Feusner et al. (2010b), as well as no difference in response time for short-duration stimuli. Similarly, Monzani et al. (2013) found no significant differences in both accuracy and response times, and Ritter et al. (2020) found no significant differences in short-duration stimuli accuracy or response time across both own and other faces, findings which were mirrored by accompanying neurobiological evidence.
Summary
The current literature findings examining visual perception employing behavioural tasks in BDD are mixed. Only limited evidence exists for impairments in discriminating non-face stimuli (Feusner et al., 2011), while there is a larger body of evidence supporting atypical face discrimination (Feusner et al., 2010a; Toh et al., 2017c) and considerably more indicating differences in facial modification detection (Buhlmann et al., 2014; Lambrou et al., 2011; Stangier et al., 2008; Yaryura-Tobias et al., 2002). In the gestalt perception literature, substantial evidence indicated atypical, non-face, configural (Kerwin et al., 2014), inverted face/object perception (Feusner et al., 2010b; Jefferies et al., 2012; Pikoos et al., 2024; Toh et al., 2017b) and a potential predominance of detail-oriented, local-over-global visual processing.
Attention/processing speed
Selective (non-face) attention
Selective (non-face) attention was examined in four studies (Malcolm et al., 2021; Toh et al., 2015a, 2017a; Yousefi et al., 2020). Utilising the RBANS, Toh et al. (2015a) demonstrated significantly reduced attention using the Digit Span Forwards and the Attention Index with large effect sizes. This was confirmed by Yousefi et al. (2020) employing the Digit Span Backward of the Wechsler Memory Scale-III (WMS-III). Toh et al. (2017a) also showed disorganised eye-tracking on the Emotional Stroop Task (a word-reading task involving BDD-relevant stimuli), suggesting attention deficits in BDD.
Two further behavioural variables showed no significant differences in non-face attention on the Digit Span Forwards and WMS-III (Yousefi et al., 2020) and MATRICS (Malcolm et al., 2021).
Selective face/body attention
Selective attention to faces and bodies using eye-tracking technology was examined in six studies. Reduced attention was found across four variables (both dwell time and fixations) when viewing own (Toh et al., 2017c; Wong et al., 2022) and other faces (Toh et al., 2017c). Grocholewski et al. (2012) showed significantly increased fixations to own or other faces (especially to areas corresponding to a perceived defect), although there were no significant differences in dwell time. Waldorf et al. (2019) showed significantly increased attention to attractive own-body features and reduced attention to unattractive own-body features. Kollei et al. (2017) showed this same pattern in faces with large effect sizes for dwell time and fixations, except for fixations of own-face attractive features, which were not significantly different from controls. Greenberg et al. (2014) showed significantly increased fixations for own least-attractive facial features and significantly increased dwell time for least-attractive features in both own and other faces, with small effect sizes.
Processing speed
Processing speed was examined in three studies (Jefferies-Sewell et al., 2017; Malcolm et al., 2021; Toh et al., 2015a). The Speed of Processing Index of the MATRICS battery demonstrated significantly slower processing in a large sample with a large effect size (Malcolm et al., 2021), the coding sub-test of the RBANS found no significant differences but a large effect size (Toh et al., 2015a) and the Stop Signal Task (SST) demonstrated no significant differences (Jefferies-Sewell et al., 2017). Of the limited literature examining processing speed in BDD, the finding of Malcolm et al. (2021) using a large sample and demonstrating significantly slower processing with a large effect is the most pertinent of all in this sub-domain.
Summary
Selective attention to faces and bodies emerged as an important area of cognitive difference in BDD, with all studies demonstrating significant variations, some exhibiting large effect sizes.
Language
A single study examined language (Rossell et al., 2014). Two out of three tests of semantic fluency showed significantly reduced verbal learning capabilities with large effect sizes, and three tests of phonological fluency showed no significant differences with only small effect sizes.
Learning and memory
Non-verbal memory
Nine studies examined non-verbal memory and learning (Deckersbach et al., 2000; Dunai et al., 2010; Fang et al., 2020; Greenberg et al., 2018; Hanes, 1998; Malcolm et al., 2021; Toh et al., 2015a; Wang et al., 2024; Yousefi et al., 2020). Four variables showed significant differences in BDD. Two showed significantly poorer immediate recall on the RCFT – in which subjects were asked to copy and recall a complex figure drawing – with large effect sizes (Deckersbach et al., 2000; Yousefi et al., 2020). Yousefi et al. (2020) demonstrated significantly poorer RCFT copy accuracy, in which the BDD group recalled more detailed aspects of the figure at the expense of overall accuracy. In the largest sample, Malcolm et al. (2021) showed significantly poorer visual learning with a large effect size using the MATRICS.
Seven further variables showed no significant differences. Small to moderate effect sizes were found using the Cambridge Neuropsychological Test Automated Battery (CANTAB) pattern recognition (Dunai et al., 2010), Cambridge Face Memory Test (CFMT; Fang et al., 2020) and RCFT (Deckersbach et al., 2000; Greenberg et al., 2018; Hanes, 1998; Toh et al., 2015a). While Wang et al. (2024) found no significant difference in distorted face learning, they demonstrated a large effect size, and the accompanying neurobiological data showed differences in BDD. In some cases, ceiling effects and task demands were suggested as explanations for these non-significant findings (e.g. Dunai et al., 2010; Toh et al., 2015b). The intact RCFT copy accuracy but impaired organisation finding of Deckersbach et al. (2000) implicates organisational strategy as a possible mechanism for memory impairment in BDD.
Verbal learning and memory
Six studies examined verbal learning and memory (Deckersbach et al., 2000; Hanes, 1998; Malcolm et al., 2021; Reese et al., 2011b; Toh et al., 2015a; Yousefi et al., 2020). Six variables examining immediate recall showed large effect sizes for both short and long delays (Deckersbach et al., 2000; Yousefi et al., 2020). Two variables demonstrated significantly poorer verbal learning with moderate effect sizes using the MATRICS (Malcolm et al., 2021) and Verbal Paired Association (Yousefi et al., 2020). The Immediate Memory Index of the RBANS showed a significant deficit in story memory with a large effect size. Impaired story recall with a large effect size was also found (Toh et al., 2015a).
However, findings of significant differences in verbal learning and memory in BDD were not found consistently. The Language and Delayed Memory Indexes of the RBANS showed no significant differences, although all tests showed large effect sizes, and ceiling effects were observed in some sub-tests (Toh et al., 2015a). Three further tests found no significant differences in word recall (Hanes, 1998; Reese et al., 2011b). The preserved memory for lists, but impaired story memory, reported by Toh et al. (2015a) implies a specific impairment in gist understanding. This suggests that salient memory tasks, rather than simple recall, may be more difficult in BDD.
Summary
While there are inconsistencies in findings across the literature, taken together, it appears that individuals with BDD experience significant deficits across multiple domains of memory. Deficits in both non-verbal and verbal memory and learning are demonstrated in large samples and with large effect sizes. While some variables showed no significant deficits, patterns of impairments, notably that of immediate memory deficits but intact language learning, have been found repeatedly (Deckersbach et al., 2000; Hanes, 1998; Toh et al., 2015a). Furthermore, individuals with BDD demonstrated significant difficulties with immediate verbal recall (Deckersbach et al., 2000; Yousefi et al., 2020), as well as memory for gist meanings of complex linguistic material (Toh et al., 2015a).
Executive function
Working memory
Working memory was examined in four studies (Chen et al., 2023; Dunai et al., 2010; Malcolm et al., 2021; Yousefi et al., 2020), revealing consistent findings across three. Four variables demonstrated significantly poorer working memory in BDD with large effect sizes for visual working memory (Dunai et al., 2010), verbal working memory (Malcolm et al., 2021) and visual and verbal working memory (Yousefi et al., 2020). The significant and large finding of Malcolm et al. (2021) obtained using a large sample is of particular note. Furthermore, Yousefi et al. (2020) found that individuals with BDD performed comparably to controls on the forward component of the Digit Span sub-test but exhibited significantly poorer performance on the backward component.
However, significant working memory differences in BDD were not observed in two studies. Despite a large effect size observed on the Spatial Span in the study by Dunai et al. (2010), no significant differences were noted in accuracy and strategy. In addition, Chen et al. (2023) did not find significant group differences for accuracy in their study using the CANTAB Spatial Working Memory.
Cognitive inhibition
Across the three studies examining cognitive inhibition (Chen et al., 2023; Hanes, 1998; Jefferies-Sewell et al., 2017), five variables using the Stroop Test (Hanes, 1998), Cambridge Gambling Test (CGT; proportion of bets, risk taking and delay aversion) and SST (response time; Jefferies-Sewell et al., 2017) showed significantly poorer performance in BDD with large effect sizes. While two further variables using the CGT (quality of decision-making and deliberation time) did not reach statistical significance, they demonstrated moderate to large effect sizes (Jefferies-Sewell et al., 2017). Two further variables measured through the SST (response time: Chen et al., 2023) and CGT (quality of decision-making; Jefferies-Sewell et al., 2017) found no significant differences demonstrating small effect sizes.
Cognitive flexibility
Cognitive flexibility was investigated in five studies with differences demonstrated across six variables (Greenberg et al., 2018; Jefferies-Sewell et al., 2017; Kerwin et al., 2014; Malcolm et al., 2021). This included significant differences on the CANTAB: Intra-Extra Dimensional Set-shifting Task (IED) with moderate to large effect sizes (Greenberg et al., 2018; Jefferies-Sewell et al., 2017), while Kerwin et al. (2014) reported significantly slower response times in the switch condition of the NT, and Malcolm et al. (2021) identified significantly poorer flexibility using the cognitive flexibility scale of the MATRICS.
Emotional inhibition
Findings in emotional inhibition yielded mixed results across eight studies. Toh et al. (2017a) found significantly slower response times for both BDD-negative and BDD-positive words on the Emotional Stroop task. Furthermore, eye-tracking data in the same study revealed disorganised viewing strategies in BDD, involving both avoidance and extended focus. Similarly, Buhlmann et al. (2002) found significant interference from BDD-positive words in the BDD group. Two further variable outcomes showed significantly reduced accuracy and response times for BDD-positive words on the Affective Go/No-Go Task (AGN; Jefferies-Sewell et al., 2017). In addition, utilising the Emotional Stroop, Buhlmann et al. (2011b) reported significantly greater beliefs in the importance of attractiveness among individuals with BDD with a moderate effect size.
In contrast, no significant differences in response times were identified for BDD-negative words (Buhlmann et al., 2002; Jefferies-Sewell et al., 2017), accuracy and response latency on a modified Stroop task using body-related words (Rossell et al., 2014), emotion management (Malcolm et al., 2021), AGN (Hartmann et al., 2015) and negative priming tasks comparing appearance-related threat and non-threat words (Wilhelm et al., 2003).
Planning and problem-solving
Planning and problem-solving was examined in five studies. Five variables consistently demonstrated significantly diminished planning and problem-solving ability in BDD using the Stockings of Cambridge (Dunai et al., 2010), the New Towers of London (Hanes, 1998) and the MATRICS (Malcolm et al., 2021), all with large effect sizes. These deficits manifested as both significantly slower and less-accurate problem-solving and poorer planning skills.
However, two variables, evaluated through the Beads Reasoning Task (Reese et al., 2011a) and the Wisconsin Card Sorting Test (Yaryura-Tobias et al., 2002), did not show significant differences in planning and problem-solving.
Summary
Impairments in executive function in BDD were most pronounced in the sub-domains of working memory, cognitive flexibility and cognitive inhibition. There was some, but not consistent, evidence of planning and problem-solving deficits. Findings in the emotional inhibition sub-domain were least consistent.
Social cognition
Emotion perception
Emotion perception was investigated across 10 studies. Five variables indicated significantly reduced overall emotion perception in BDD, most with large effect sizes (Buhlmann et al., 2004, 2006; Grace et al., 2019; Toh et al., 2015b, 2017c). Two variables did not demonstrate general reduced accuracy but revealed significant differences in specific emotions (Buhlmann et al., 2011a; Jefferies et al., 2012). Jefferies et al. (2012) observed a large effect size for impairment in recognising fear, while Buhlmann et al. (2011a) reported a moderate effect size for the recognition of neutral faces. Jefferies et al. (2012) also noted a large effect for impairment in overall perception using the Facial Expression of Emotions Stimulus and Test, although this did not reach statistical significance. Similarly, Fang et al. (2020) found a moderate effect but no significant difference in overall emotion perception using the, QSFD and Rossell et al. (2014) found a trend towards an anger recognition bias. While Buhlmann et al. (2006) noted a significant, large effect for overall emotion recognition in a self-referent condition, the findings for the other-referent condition did not reach statistical significance, although they did note a medium effect. Buhlmann et al. (2011a) reported no significant differences using a seven-emotion recognition task. Similarly, Pikoos et al. (2024) found no significant differences in emotion recognition accuracy or response time. Task variation across the emotion perception literature is significant including the type and number of emotions presented, the presentation manner and timing and the colour and type of the images. These variations in task characteristics have been suggested as possible explanations for the inconsistent findings (Pikoos et al., 2024).
More consistent findings have revealed significantly reduced accuracy and slower response times for negative emotions such as anger, sadness, fear and disgust and for neutral expressions (Buhlmann et al., 2004, 2006, 2011a; Toh et al., 2015b).
Theory of mind
Theory of mind has been examined in three studies. Two variables showed differences, with Buhlmann et al. (2015) showing significantly reduced social inference (thoughts and intentions) accuracy with moderate to large effect sizes on the Movie for the Assessment of Social Cognition (MASC). Fang et al. (2020) also revealed a significant difference, demonstrating a large effect size, for reduced complex emotion recognition on the Reading the Mind in the Eyes Task (RMET). In contrast, Buhlmann et al. (2013) found no significant differences and only small effect sizes across three variables employing the RMET, and Buhlmann et al. (2015) showed no significant difference on the emotion recognition component of the MASC.
Summary
Extensive research attention has been dedicated to exploring emotion perception in BDD, and despite some inconsistencies, indications point towards likely deficits in emotion recognition for facial stimuli, and preliminarily for own faces in individuals with BDD. The limited scope of studies and variations in outcomes underscores the need for further research examining theory of mind.
Figure 3 summarises the results (as displayed comprehensively in Table 3) at a high level, showing the cognitive variables and domains examined across the studies.

Figure demonstrating the significant differences in BDD according to the cognitive hierarchy.
Discussion
This systematic review aimed to synthesise findings relevant to the behavioural measurement of cognition in BDD, using a cognitive hierarchy framework. A definitive cognitive profile for individuals with BDD was not uncovered; however, the findings evidenced differences between BDD and HC participants across all levels of the cognitive hierarchy which may be interrelated.
Atypical visual perception (Feusner et al., 2010b), especially of emotionally salient stimuli (Lambrou et al., 2011; Toh et al., 2017c), represents a likely cognitive marker in BDD. Moreover, evidence for local-over-global processing in BDD was found using varied tasks across all sub-domains of visual perception, as well as tasks involving selective attention to faces, non-verbal memory, emotion recognition and theory of mind. Thus, the influence of emotional salience and locally biased visual perception appears to be present across all levels of the cognition hierarchy. When neurological evidence was considered in addition to the behavioural findings, this processing tendency in BDD was corroborated (e.g. Feusner et al., 2007, 2010a, 2011; Li et al., 2015). The cognitive-behavioural model of BDD proposes that a cognitive style which preferences detail over holistic processing of visual information underlies BDD’s development and maintenance (Fang and Wilhelm, 2015). The findings of this review support this model; the most consistent behavioural findings are in relation to BDD global/local processing differences across the domains (i.e. in modification detection, Gestalt visual processing, selective attention to faces/bodies, learning and memory, working memory and emotion perception). However, the review findings further suggest that visual disturbances may be experienced heterogeneously across individuals and tasks. Cognitive sub-groups, such as individuals who are selectively versus broadly impaired, may underlie behavioural differences reported in BDD (Malcolm et al., 2021). It may be that a single cognitive profile is unlikely to be representative of the heterogeneous presentation of BDD. The data do, however, indicate that establishing a comprehensive understanding of local/global processing in BDD is important.
An emerging finding is that salience appears influential in BDD. For example, findings in visual and Gestalt perception revealed that impairments are more consistently demonstrated for salient stimuli, such as faces, than for general stimuli. This finding is also demonstrated in the higher-order sub-domains of selective attention and emotion perception, as well as at the executive function level in emotional inhibition, for the processing of appearance-related words. Thus, employing a hierarchical model of cognition reveals that BDD is not simply associated with ‘bottom-up’ visual problems alone. Instead, cognitive differences were most consistently demonstrated in the higher level, ‘top-down’ domains of executive function and social cognition. While less consistent, they were also noted at the lower levels, suggesting the likelihood of an interconnected ‘network of influence’ in which co-occurring cognitive differences interact across the hierarchy. Multi-level modelling with comprehensive cognitive datasets will need to be conducted to further elucidate such a supposition.
This review has highlighted that cognitive difficulties in BDD are present and increase in consistency with increasing cognitive complexity. At present, cognitive deficits are not specifically treated in BDD treatment models, and these data suggest that they should be.
Limitations
Comparison of findings within sub-domains was difficult due to the great variation in methodology used across the studies, and some inconsistencies in the findings can be accounted for by these variations. Differences in tasks and task characteristics, including in stimuli, presentation time and sequencing, have been noted to account for some of these discrepancies. There is also some overlap across cognitive domains, as tasks generally tap into multiple cognitive processes, which may blur domain-specific differences. However, there is also evidence of individual differences in how neurocognition in BDD is experienced (e.g. sub-groups), although this heterogeneity is unsurprising given the diversity and complexity in the disorder.
The aim of this study was to conduct a systematic review. It was however noted that the small number of studies per sub-domain and variations in task methodologies prevented a meta-analytic synthesis. Furthermore, we limited the review to examining ‘cold’ neurocognition, and not broader cognition, including psychological, emotional and evaluative elements. The influence of psychological ‘hot’ cognition, such as underlying emotions, schemas, beauty ideals and beliefs regarding the importance of attractiveness, also factor in individuals’ perception of their ugliness; however, this is beyond the scope of this review. Yet, our findings regarding the importance of salience in potentially contributing to neurocognitive differences in BDD suggest ‘hot cognition’ is likely to be part of the total cognitive picture of BDD. The cross-sectional designs of the included studies also limited the ability to examine the relationship between neurocognitive profiles and the disorder, including whether these profiles represent a risk factor for its development and maintenance or are a consequence of the disorder itself. Longitudinal investigation is needed to examine whether neurocognitive differences in BDD can be impacted by treatment.
Compared to other conditions, the field of research focused on cognition in BDD is still nascent, and the current literature likely under-represents the diversity of neurocognitive experience among people with the disorder. Thus, it is unclear if the findings are generally representative of the condition, or whether they may relate more to certain sub-groups. Furthermore, while the studies were of acceptable quality with low risk of bias, most studies employed Western and predominantly female samples. Future research should investigate potential neurocognitive differences in samples of diverse cultural backgrounds and genders.
Future directions
Further investigation of neurocognitive differences in BDD employing a standardised, valid assessment battery longitudinally across larger samples is warranted. The review highlighted neurocognitive abilities that are yet to be examined in BDD. Sub-domains such as interoception, for example, have been proposed to play an important role in unconscious perception errors, which may underlie BDD (Jenkinson and Rossell, 2024). Future reviews could examine patterns of neurocognitive differences between BDD and related disorders such as obsessive-compulsive disorder and anorexia nervosa to better characterise patterns specific to BDD.
Conclusion
This review contributes to understanding the complex experiences of BDD from a ‘cold’ cognition perspective utilising a neurocognitive hierarchy framework. The literature proffers evidence of behavioural differences in neurocognitive tasks across all levels of the neurocognitive hierarchy, particularly for emotionally salient stimuli and local/global processing biases. Inconsistencies in the findings may be due to methodological variations, or heterogeneity in how individuals experience the disorder. Research employing a comprehensive, standardised test battery of neurocognitive tasks across the hierarchy utilising large samples is needed to further define the neurocognitive profile of BDD.
Supplemental Material
sj-docx-1-anp-10.1177_00048674241309747 – Supplemental material for A systematic review of neurocognition and social cognition in body dysmorphic disorder
Supplemental material, sj-docx-1-anp-10.1177_00048674241309747 for A systematic review of neurocognition and social cognition in body dysmorphic disorder by Katrina Holmes à Court, Amy Malcolm, Wei Lin Toh and Susan L Rossell in Australian & New Zealand Journal of Psychiatry
Supplemental Material
sj-docx-2-anp-10.1177_00048674241309747 – Supplemental material for A systematic review of neurocognition and social cognition in body dysmorphic disorder
Supplemental material, sj-docx-2-anp-10.1177_00048674241309747 for A systematic review of neurocognition and social cognition in body dysmorphic disorder by Katrina Holmes à Court, Amy Malcolm, Wei Lin Toh and Susan L Rossell in Australian & New Zealand Journal of Psychiatry
Footnotes
Acknowledgements
The authors thank Chris Cooper (CC) for his early work.
Author contributions
KHàC and SLR were responsible for conceptualisation, literature search and review, methodology, visualisation and interpretation and writing – original draft, review and editing. AM and WLT were responsible for methodology, project administration, interpretation and writing – review and editing. SLR additionally supervised/mentored.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by a Senior National Health and Medical Research Council Research Fellowship (GNT1154651 to SR). All funding sources had no role in the study design, collection, analysis or interpretation of the data, writing of the manuscript, or the decision to submit the paper for publication.
Disclaimer
The views expressed herein are the authors’ opinions and do not reflect any official positions of the institutional affiliations or funding bodies. The authors do not have any conflicts to declare in relation to this manuscript.
Supplemental Material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
