Abstract
The Reading the Mind in the Eyes Test (RMET) is a purported theory of mind measure and one that reliably differentiates autistic and non-autistic individuals. However, concerns have been raised about the validity of the measure, with some researchers suggesting that the multiple-choice format of the RMET makes it susceptible to the undue influence of compensatory strategies and verbal ability. We compared the performance of autistic (
Lay abstract
Recognizing and understanding the perspectives of others—also called theory of mind—is important for effective communication. Studies have found that some autistic individuals have greater difficulty with theory of mind compared to non-autistic individuals. One purported theory of mind measure is the Reading the Mind in the Eyes Test (RMET). This test presents participants with photographs of pairs of eyes and asks them to identify the emotion displayed by each pair of eyes from four choices. Some researchers have argued that the multiple-choice format of the RMET may not be an accurate measure of theory of mind, as participants could simply be guessing or using a process of elimination to select the correct answer. Participants may also be disadvantaged if they are not familiar with the specific emotion words used in the multiple-choice answers. We examined whether a free-report (open-ended) format RMET would be a more valid measure of theory of mind than the multiple-choice RMET. Autistic and non-autistic adults performed better on the multiple-choice RMET than the free-report RMET. However, both versions successfully differentiated autistic and non-autistic adults, irrespective of their level of verbal ability. Performance on both versions was also correlated with another well-validated adult measure of theory of mind. Thus, the RMET’s multiple-choice format does not, of itself, appear to underpin its ability to differentiate autistic and non-autistic adults.
Recognizing and understanding the perspectives of others—or theory of mind (Golan et al., 2006)—is important for effective communication. One widely used measure of theory of mind is the Reading the Mind in the Eyes Test (RMET; Baron-Cohen, Wheelwright, Hill, et al., 2001), which presents participants with photographs of pairs of eyes and asks them to identify the emotion displayed by each pair of eyes from four response options. Concerns have been raised about the validity of the measure (Gernsbacher & Yergeau, 2019), with some researchers suggesting that it does not actually measure theory of mind, but rather, emotion recognition (Oakley et al., 2016), intelligence (Rosso & Riolfo, 2020), and vocabulary (Olderbak et al., 2015).
Another concern raised about the RMET is that the multiple-choice response format provides respondents with contextual information that influences their test performance (Betz et al., 2019; Cassels & Birch, 2014). Cassels and Birch (2014) explored these concerns by comparing non-autistic children’s performance on the multiple-choice RMET with a free-report version, arguing that free-report performance would be less (a) vulnerable to the influence of deductive reasoning or process of elimination strategies, and (b) less dependent on receptive vocabulary. They found that children (aged 4–12 years) scored lower on the free-report than the multiple-choice RMET. Moreover, unlike the free-report RMET, the multiple-choice RMET was strongly associated with verbal ability. They proposed that the free-report RMET may therefore be advantageous when trying to identify emotion recognition deficits and when working with populations with limited verbal ability.
Betz et al. (2019) also found that non-autistic adults (aged 18–63 years) scored higher on the multiple-choice than the free-report RMET. They argued that the former’s response options provide contextual cues that influence participants’ interpretations of the stimuli. They also speculated that the RMET performance differential typically observed between autistic and non-autistic individuals may reflect difficulties in concept learning (i.e. the ability to categorize objects based on common attributes) rather than perspective-taking difficulties. For example, it is possible that non-autistic individuals are more likely to use deductive reasoning to select the correct answer (e.g. “It looks like an unpleasant emotion, so the answer can’t be ‘happy’”). As there is some suggestion that autistic individuals may have difficulty with category learning and generalization of concepts (e.g. Klinger & Dawson, 2001), autistic individuals may rely less on such compensatory strategies to complete the multiple-choice RMET compared to non-autistic individuals, thus resulting in lower scores. Such findings raise concerns about the construct validity of the multiple-choice RMET as a theory of mind measure.
Betz et al. (2019) argued that inferences drawn from prior research using the RMET be re-evaluated. One such inference is that autistic adults perform more poorly on the RMET than non-autistic adults due to difficulties with theory of mind that are considered to characterize autistic individuals (Baron-Cohen, Wheelwright, Skinner, et al., 2001). Yet, recent research suggests that difficulties with theory of mind are not universal among autistic adults (Brewer et al., 2017; Gernsbacher & Yergeau, 2019). Given the aforementioned limitations in the construct validity of the multiple-choice RMET, it is possible that these group differences reflect differences in verbal ability or concept learning, rather than theory of mind. It is thus important for accurate measures of theory of mind to be developed, as such tools would enable clinicians to better understand the specific needs of their clients and the potential factors that may be contributing to their difficulties with social communication and interaction.
We (1) replicated Betz et al.’s (2019) examination of response format on RMET performance, but used both autistic and non-autistic adult samples, (2) compared the discriminant validity of the multiple-choice and free-report RMET for autistic and non-autistic adults, and (3) examined the convergent validity of both RMET formats using an independent theory of mind measure, the Adult Theory of Mind test (A-ToM-Q; Brewer et al., 2022).
Method
Participants
As both Cassels and Birch (2014) and Betz et al. (2019) reported large effect sizes of response format on RMET performance, we targeted a sample size of 128 participants to detect a medium effect size (
Materials
Ten-item Reading the Mind in the Eyes Test (RMET)
The 10-item RMET (Olderbak et al., 2015) presents respondents with 10 images of a pair of human eyes and asks them to judge the emotion captured in the image. The 10-item version of the RMET was used as it demonstrates better unidimensionality and internal consistency than the original 36-item version (Olderbak et al., 2015). The multiple-choice RMET had four response options per item, accompanied by a glossary defining those options. In the free-report format, participants typed their answer in a text box (participants in the free-report condition were not provided with a glossary). Free-report responses were scored by three independent raters against the Merriam-Webster online thesaurus and dictionary as meeting either a stringent, lax, or boundary definition of the target emotion, or as not meeting the definition. For example, on Item 3 (Skeptical), “confused” was considered a boundary definition, “leery” a lax definition, and “suspicious” a stringent definition. (The complete scoring sheet can be accessed at https://osf.io/93sjm/). On all but one response, at least two of the three raters provided the same score. Disagreements were discussed until consensus. Responses meeting a stringent or lax definition were scored correct; all other responses were scored incorrect. RMET scores range from 0 to 10; higher scores indicate higher levels of theory of mind.
Autism Spectrum Quotient (AQ)
The AQ (Baron-Cohen, Wheelwright, Skinner, et al., 2001) is a 50-item self-report measure of autistic traits. Scores range from 0 to 50; higher scores indicate a higher degree of autistic traits. A cut-off score of 26 has been found to have good sensitivity and specificity in discriminating autistic and non-autistic individuals (Kurita et al., 2005; Woodbury-Smith et al., 2005).
Adult Theory of Mind–Quick (A-ToM-Q)
The social subscale of the Adult Theory of Mind–Quick (A-ToM-Q) test (Brewer et al., 2022) requires respondents to view six videos of interpersonal interactions, each followed by a multiple-choice question (four alternatives) probing their interpretation of subtle social nuances (e.g.
Self-Administered Vocabulary IQ Test (SA-VIQT)
The SA-VIQT is an online verbal IQ test from the Open-Source Psychometrics Project. On each of 45 items, participants are presented with five words and select the two that mean the same. Correct responses receive one point, while incorrect responses are deducted one point. “Don’t know” responses are neither awarded nor deducted points. The SA-VIQT provides an overall verbal IQ (VIQ) score ranging from 40 to 160. It is moderately correlated with the Wechsler Abbreviated Scale of Intelligence (WASI-II) (Wechsler, 2011), Verbal Comprehension Index (VCI;
Design
RMET performance was examined using a 2 (Group: autistic, non-autistic) × 2 (Response Format: multiple-choice, free-report) between-subjects design.
Procedure
This project was approved by the Flinders University Human Research Ethics Committee; participants read a study information sheet and gave informed consent. The study was administered using Qualtrics. Participants provided demographic information and indicated if they had received a formal diagnosis of autism. Two attention checks were used to identify the use of robots or automated systems. Participants completed the AQ and A-ToM-Q social subscale, were randomly allocated to either the free-report or multiple-choice RMET, and then completed the SA-VIQT. Participants received an honorarium as compensation for their time.
Community involvement statement
Two of the authors are practicing clinical psychologists who consult with autistic adults and children.
Results
As shown in Table 1, the autistic group scored higher on the AQ and lower on the A-ToM-Q than the non-autistic group. There was no significant group difference in VIQ, but the non-autistic group was significantly older than the autistic group. The correlations between all variables are provided in Supplementary Materials (p. 2).
Descriptive statistics for age, AQ, VIQ, and A-ToM-Q for the two groups.
A 2 (Group: autistic, non-autistic) × 2 (Response Format: multiple-choice, free-report) between-subjects analysis of variance (ANOVA) revealed a main effect of response format on RMET scores, with higher scores on the multiple-choice than the free-report version,
Mean (standard deviation) and median Reading the Mind in the Eyes Test (RMET) scores by response format and group.
CI: confidence interval.
Multiple-choice RMET scores were missing for one participant from each group.
For the overall sample, multiple-choice performance was correlated with verbal IQ,
Free-report RMET performance was also significantly correlated with verbal IQ for the overall sample,
Given that verbal IQ and age were significantly correlated with RMET performance, analyses were repeated with verbal IQ and age as covariates. The main effects of response format,
There was a strong correlation between the multiple-choice RMET and the A-ToM-Q in the overall sample,
Discussion
Consistent with Cassels and Birch (2014) and Betz et al. (2019), participants performed better on the multiple-choice than the free-report RMET, suggesting that the multiple-choice format enables the use of additional strategies. Regardless of RMET response format, the RMET decisively discriminated autistic and non-autistic adults. Although the difference between groups was larger for the multiple-choice format than the free-report format, this difference was no longer statistically significant with VIQ controlled. Moreover, although VIQ was correlated with both multiple-choice and free-report performance, controlling for VIQ did not undermine the ability of either version to discriminate the two groups.
In addition, examination of the concurrent validity of both RMET formats revealed that multiple-choice performance correlated strongly with the A-ToM-Q. Although free-report performance was not as strongly correlated, this likely reflects free-report performance being close to the floor. These correlations with A-ToM-Q performance remained consistent after controlling for VIQ. In sum, our findings provide evidence for the concurrent validity of both versions and suggest that the validity of the RMET is not dependent on verbal ability. Given the demanding coding requirements for scoring free-report RMET responses, the multiple-choice RMET is the more accessible, efficient, and economical option.
Limitations
First, we did not obtain evidence that participants had received a formal diagnosis of autism, relying instead on self-reports of a diagnosis and AQ scores. Second, the SA-VIQT, a quick screening measure of VIQ is not as rigorous as a full-scale verbal IQ measure such as the Wechsler scales. Third, although our results provided promising evidence of the RMET’s concurrent validity with the A-ToM-Q, we note that the A-ToM-Q’s stimulus videos depicting social interactions include (inter alia) the target individuals’ facial expressions. Thus, it is possible that cues from the eye region may contribute to a degree of shared variance between RMET and A-ToM-Q scores. One way to examine this possibility would be to isolate or pixelate the eye region of the characters in the A-ToM-Q stimuli.
Conclusions
Our results indicate that both the multiple-choice and free-report versions of the RMET differentiated autistic and non-autistic adults irrespective of verbal ability. However, given its ease of administration, the multiple-choice format offers clear practical advantages over the free-report format.
Supplemental Material
sj-docx-1-aut-10.1177_13623613231167226 – Supplemental material for Response format changes the reading the mind in the eyes test performance of autistic and non-autistic adults
Supplemental material, sj-docx-1-aut-10.1177_13623613231167226 for Response format changes the reading the mind in the eyes test performance of autistic and non-autistic adults by Alliyza Lim, Neil Brewer, Denise Aistrope and Robyn L Young in Autism
Footnotes
Author contributions
N.B. and R.L.Y. developed the study concept and design. A.L. and D.A. collected the data. A.L. analyzed the data under the guidance of N.B. and wrote the original draft. N.B. and R.L.Y. provided critical manuscript revisions. All authors approved the final version of the paper for submission.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Research supported by ARC DP 190100162 and the Hamish Ramsay Fund.
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
