Abstract
People with autism spectrum disorders (ASDs) have difficulty with socio-emotional functioning; however, research on facial emotion recognition (FER) remains inconclusive. Individuals with ASD might be using atypical compensatory mechanisms that are exhausted in more complex tasks. This study compared response accuracy and speed on a forced-choice FER task using neutral, happy, sad, disgust, anger, fear and surprise expressions under both timed and non-timed conditions in children with and without ASD (n = 18). The results showed that emotion recognition accuracy was comparable in the two groups in the non-timed condition. However, in the timed condition, children with ASD were less accurate in identifying anger and surprise compared to children without ASD. This suggests that people with ASD have atypical processing of anger and surprise that might become challenged under time pressure. Understanding these atypical processes, and the environmental factors that challenge them, could be beneficial in supporting socio-emotional functioning in people ASD.
Introduction
Among the core diagnostic criteria comprising autism spectrum disorders (ASDs) are difficulties with social communication and interaction, difficulties with socio-emotional reciprocity, non-verbal communication and developing and maintaining interpersonal relationships (American Psychiatric Association, 2013). Although difficulty with facial emotion recognition (FER) is not a required Diagnostic and Statistical Manual of Mental Disorders (DSM-5) criterion, it is central skill for successful social interaction and is often listed as a diagnostic marker of ASD (Uljarevic & Hamilton, 2013). However, the nature or even the existence of such an ‘impairment’ is debated. Although impaired processing and recognition of facial emotions have often been reported in people with autism (Capps et al., 1992; Griffiths et al., 2019; Hobson et al., 1986; Weeks & Hobson, 1987; Yeung et al., 2014), some studies of ASD have not found difficulties with these skills (Ozonoff et al., 1991; Ponnet et al., 2004; Rump et al., 2009; Tracy et al., 2011) or have reported only partial difficulties with specific negative emotions (Ashwin et al., 2006; Enticott et al., 2014). In a meta-analysis, Uljarevic and Hamilton (Uljarevic & Hamilton, 2013) reviewed 48 studies, reporting that when happiness was used as a baseline, only the recognition of fear was impaired in those with ASD, and even that difference disappeared after Bonferroni correction. In an online study, Ola and Gullon-Scott (Ola & Gullon-Scott, 2020) examined female adults with ASD and found that FER difficulties were related to co-occurring alexithymia rather than ASD itself.
One of the main factors researchers have suggested contributes to the discrepancy in findings about FER impairments in ASD are experimental task differences (Harms et al., 2010). Diverse experimental tasks have been used in the literature – for example a delayed matching task (Celani et al., 1999), labelling faces (Yeung et al., 2020), morphing the six basic emotions (Bal et al., 2010), mental rotation (Nagy et al., 2018; Tantam et al., 1989) or a forced choice between two possibilities (Spezio et al., 2007). Matching tasks may allow people with ASD to use compensatory strategies (Hariri et al., 2000; Teunisse & de Gelder, 2001), forced-choice labelling tasks may allow for guessing (Baron-Cohen et al., 1997) and free labelling tasks have proven to be affected by language difficulties in ASD rather than FER per se (Uljarevic & Hamilton, 2013). Although they have received less research, the length of stimulus exposure and processing times have also revealed diverse findings. Children, adolescents and adults with ASD demonstrated longer reaction times (RTs) in FER tasks (Bal et al., 2010; Dalton et al., 2005; Sucksmith et al., 2013), whereas Tracy et al. (Tracy et al., 2011) found that children with ASD had comparably rapid RTs to children without ASD. When Clark et al. (Clark et al., 2008) used very brief time durations (15 and 30 ms) to prevent participants from using compensatory cognitive mechanisms, at 15 ms, all of the groups performed equally around the chance level across tasks. However, participants with ASD recognised emotional expressions less accurately than the participants without ASD when the stimuli were presented for 30 ms. When Celani et al. (Celani et al., 1999) extended this time to 750 ms, children with ASD were less accurate at recognising facial emotions compared to children without ASD and children with Down’s syndrome. When Rump et al. (Rump et al., 2009) used 500 ms exposure of video recordings of facial expressions, only the youngest children (5–7 years) and the adults with ASD performed poorly compared to those without ASD. In contrast, other studies reported that a decreased presentation length of visual stimuli affected autistic and non-autistic populations in a similar manner. Tracy et al. (Tracy et al., 2011) presented facial expressions of anger, contempt, disgust, fear, happiness, pride, sadness and surprise for 1,500 ms and asked participants to respond within this time frame, and found no group differences in emotion recognition accuracy or speed. When only responses within the first 800 ms were considered, the performance of the two groups remained comparable. The diverse findings in FER in ASD may be due to methodological diversity – that is, in certain situations, individuals with ASD are less able to use atypical compensatory mechanisms in more complex experimental tasks (Grossman & Tager-Flusberg, 2008; Grossman et al., 2000) and under time pressure (Celani et al., 1999; Clark et al., 2008; Rump et al., 2009). People with ASD were found to use explicit and more effortful compensatory mechanisms in FER tasks compared to people without ASD who use faster, implicit mechanisms instead (Harms et al., 2010). Such compensatory mechanisms include following explicit rules to associate certain facial features with emotions, such as a frown with anger, when determining a facial expression (Rutherford & McIntosh, 2007; Teunisse & de Gelder, 2001). Combined ERP and eye-tracking revealed an experience-dependent specialisation for face processing that results in an automatic and efficient neural network in children without ASD (Meaux et al., 2014). Due to differences in perceptual processes, it was suggested that children with ASD spend less time attending to socially relevant information, possibly resulting in atypical processing; this deficit becomes further compromised with age (Chawarska & Shic, 2009). Price and Friston (Price & Friston, 2002) suggested that decreased activity in brain regions responsible for socio-emotional functions is compensated by an increase in activity of other regions that could fulfil some of the functions using alternative pathways. In the case of ASD, a decreased connectivity between the posterior cingulate and precuneus areas (Di Martino et al., 2014) was found to be compensated by enhanced activities in the ventromedial prefrontal and anterior temporal areas within the social brain (Subbaraju et al., 2018). Such explicit compensatory mechanisms may be responsible for the mixed results in the above FER tasks with people with ASD when compared to those without ASD. One way to detect possible compensation strategies in experiments is to use a time limit that is sufficient for template-based implicit processing in people without ASD but might be insufficient for alternative compensatory mechanisms in ASD. The present study explored whether longer stimulus presentation times allow autistic individuals to use compensatory mechanisms in emotion recognition tasks and if time pressure resulted in decreased performance.
The study compared performances on a forced-choice FER task using the six basic emotions (Ekman et al., 1983) both under timed and non-timed conditions in children with ASD (ASD) and without ASD (NA). It hypothesised that children with ASD would underperform compared to those without ASD in the timed condition, but that their emotion recognition performance would be less affected in the non-timed condition. Performance was measured by the accuracy of the children’s responses and the speed of these responses (measured by RTs).
Methods
Participants
Eighteen participants took part in the study, aged between 12 and 17 years old. Nine of these participants were in the ASD group, officially diagnosed with Autistic Disorder by the National Health Service. The data were collected in 2012 and 2013 and all the children were diagnosed using the Diagnostic and Statistical Manual of Mental Disorders (DSM-IV-TR) (4th Edition, revised text version) (American Psychiatric Association, 2000). All the children with autism attended special autism units in mainstream schools and were verbal. Nine children were in the control or non-ASD group (NA). The mean age of ASD group was 179 months, SD = 21.83 (age range: 155–209 months), eight males, one female and the mean age of NA groups was 188 months SD = 20.66 (age range: 151–212 months) four males and five females. Participants were all pupils at Monifieth High School, a mainstream school. They were recruited through distribution of consent forms to parents via the Headteacher. Approval for this study was applied for and approved by the University of Dundee’s, University Research Ethics Committee (UREC) and the Education Department of Angus Council. Monifieth High School also consented to the study. Signed informed consent was required from parents and children before participation in the study.
Materials
The Autism Screener Questionnaire (ASQ) (Berument et al., 1999) was used in order to confirm the diagnosis of autism in the two groups.
The FER task used photos taken from the Pictures of Facial Affect set (Ekman & Friesen, 1975). The current study used 66 photos from this set of 10 actors (five males and five females) each displaying a neutral expression as well as the six basic emotions of happiness, sadness, fear, anger, surprise and disgust (see the ID numbers of each picture in Table A1 of the Appendix). In order to prevent participants recognising the picture from other non-facial features, each picture was edited using ADOBE Photoshop 7.0 software. This removed hair, jewellery and background, leaving only the face visible (see Figure 1 for an example of the stimuli before and after editing). The FER experiment was conducted using SuperLab Pro (version 2.0) software run on a 15-inch Toshiba laptop. Verbal and Visuo-spatial ability was measure using the Vocabulary and the Block Design subtests from the Wechsler Intelligence Scale for Children (Wechsler, 1991).

Example of stimuli before and after editing. Unedited angry woman (left) and edited angry woman (right).
Procedure
Before testing could commence, the parents of the children returned the ASQ and a signed consent form in a sealed envelope. After parental consent, children were also required to sign the consent form before beginning the task. The order of the study was: the FER task, the Block Design subtest and then the Vocabulary subtest.
The Facial Emotion Recognition Task
The FER task had two parts, a timed and non-timed experiment. In the timed experiment participants had 1,200 ms to respond whereas in the non-timed experiment participants had no time limit to respond within. The same procedure was applied in both the timed and the non-timed experiments.
The experiment was broken down into four conditions; sex (male and female), ‘easy’ (happy and sad), ‘difficult 1’ (fear and surprise) and ‘difficult 2’ (anger and disgust) (see Figure 2 for illustration of the faces). In each condition, one stimulus was presented at a time in a forced-choice task. Participants were asked to decide which type of stimulus was displayed out of two possible options. Each condition was administered in four blocks and the key responses were randomised and counterbalanced for each emotion. There were 80 stimuli in each condition and 20 stimuli in each block. Over the two experiments (timed/non-timed) there were 640 stimuli. This is illustrated in Figure 3 (see Figure 3). The order of the conditions was randomised across participants. The order in which pictures appeared in each block was randomised by the Superlab Pro software.

Examples of the eight different stimuli used in their pairings. From top left to bottom right: male; female; happy; sad; fear; surprise; anger; disgust.

Diagram showing structure of experiments, conditions and stimuli used in both the timed and non-timed experiments.
Before each block of trials, instructions appeared on the screen to inform the participant of which key (‘l’ covered with blue sticker; ‘s’ covered with red sticker) corresponded to which emotion.
For the timed experiment, stimuli were presented on screen only for 1,200 ms, or if shorter, until the participant responded. For the non-timed experiment stimuli remained on the screen until the participant responded.
The responses and time taken (milliseconds) to respond were recorded for each condition. The FER task began with a short practice test encompassing examples from all conditions so that children familiarised themselves with the task. The practice session contained four example pictures from each block, resulting in 16 practice pictures in total.
The order of experiments – timed/non-timed – was randomised across participants.
In total, the two experiments and the block design and vocabulary subtests lasted for approximately 35 minutes. During this time, children were able to take short breaks in between the blocks and tasks if they wished.
Results
The two groups were comparable in terms of age, Vocabulary and Block Design performances (see Table 1).
Means and (standard deviations) for age, block design and vocabulary subtest scaled scores for the ASD and NA groups.
Abbreviations: ASD = autism spectrum disorders; ASQ = Autism Screener Questionnaire; NA = non-ASD.
An 8 × 2 mixed design analysis of variance was conducted on the effect of the nature of the stimuli (emotions: man, woman, happy, sad, fear, surprise, anger, disgust) and the groups ‘Group’ (children with autism spectrum disorders: ASD, and children without: NA) on the number of the correct responses. When the Mauchly’s test of sphericity was significant, the Greenhouser-Geisser correction was used to interpret the results. When significant emotion*group interaction was found, post-hoc pairwise comparisons were used with Bonferroni corrections. p < .05 was accepted as significant throughout.
Timed Condition
Correct Responses
There was a significant main effect of emotions, F (3.77, 60.5) = 37.8, p < .01,
Post-hoc pairwise comparison using Bonferroni correction showed that children with ASD had a significantly lower number of correct responses for anger and surprise compared to children without ASD. The other comparisons were non-significant, although, on a tendency level (p < .06), children with ASD also had a lower mean of correct recognition of the ‘Woman’ faces compared to children without ASD (see Table 2). Given the small sample sizes, boxplots 1–8 (see Figure 4) show the possible outliers for each emotion in the two groups. Inspection of the boxplots indicates potential outliers in the ASD group in the man and the anger emotions. Given the expected variability in the ASD group, these data have been kept in the analysis.

Boxplot diagrams of the data in the eight conditions in the two groups in the timed experiment, percentage of correct responses.
Mean percentages of correct responses (mean) and standard deviations (SD) based on type of emotion and group in the timed condition.
Abbreviations: ASD = autism spectrum disorders; NA = non-ASD.
Response Times
There was a significant main effect of emotion, F (2.56, 41.01) = 17.138, p < .001,
Mean reaction times (mean) and standard deviations (SD) based on type of emotion and group in the timed experiment.
Abbreviations: ASD = autism spectrum disorders; NA = non-ASD.
Non-Timed Condition
Correct Responses
There was a significant main effect of emotion, F (3.89, 62.16) = 11.17, p < .001,
Mean percentage of correct responses (mean) and standard deviations (SD) based on type of emotion and group in the non-timed experiment.
Abbreviations: ASD = autism spectrum disorders; NA = non-ASD.
Response Times
There was a significant main effect of emotion, F (1.66, 26.48) = 14.16, p < .001,
Mean reaction times (mean) and standard deviations (SD) based on type of emotion and group in the non-timed experiment.
Abbreviations: ASD = autism spectrum disorders; NA = non-ASD.
Discussion
This study investigated whether recognition of six basic emotions was affected by task difficulty – namely, time pressure. It hypothesised that children with and without ASD would perform comparably when there was no time pressure for their responses. As predicted, the accuracy of emotion recognition was comparable in the two groups in the non-timed condition. In the timed condition, however, children with ASD were less accurate in identifying anger and surprise compared to children without ASD. These results could partially reconcile studies with contradictory outcomes regarding emotion recognition in children with ASD. Further, when the groups were matched in terms of age and IQ, differences in emotion recognition were undetectable. Finally, difficulties in the accuracy of emotion recognition only appeared when children with ASD were under time pressure. It was suggested that instead of the theory that those with ASD cannot recognise facial emotions, it might be that they use ‘atypical’ mechanisms to achieve FER (Hobson, 1986; Homer & Rutherford, 2008). While performance on FER tasks might be comparable, people with ASD displayed increased activity in the anterior cingulate cortex (ACC) and precuneus and decreased activity in the amygdala and fusiform gyrus when performing emotion recognition tasks compared to non-autistic controls (Ashwin et al., 2007; Wang et al., 2004). The ACC and precuneus are related to self-monitoring and attentional load, indicating that emotion recognition tasks might be more cognitively effortful for people with ASD compared to people without ASD (Harms et al., 2010) – when task demands increase, performance diminishes. When Clark et al. (Clark et al., 2008) restricted stimulus presentation to 30 ms, they observed a marked emotion recognition deficit in adults with ASD. However, when Tracy et al. (Tracy et al., 2011) used a 1,500 ms time limit for stimulus presentation, there were no group differences. The current study used a 1,200 ms limit for stimulus presentation and response; it found that children with ASD only decreased accuracy for anger and surprise reactions compared to children without ASD.
Several studies have reported an overall decreased performance in recognition of negative emotions such as anger, fear and other ‘cognitive’ emotions in children with ASD (Ashwin et al., 2006; Baron-Cohen et al., 1993; Humphreys et al., 2007; Smith et al., 2010; Taylor et al., 2015). The present study’s lack of differences in accuracy under no time pressure supports those studies in which children with and without ASD recognised emotions equally well (Jones et al., 2011; Loveland et al., 2008). This study refines this previous work, reconciling those that found (vs. those that did not find) differences in FER.
Atypical processing of anger was reported by several studies (Ashwin et al., 2007; Lindner & Rosén, 2006). A recent study by Leung et al. (Leung et al., 2019) found no group differences with RTs in recognising angry and happy faces, but the neural responses using magnetoencephalography (MEG) showed otherwise. Children without ASD showed greater right middle temporal and superior temporal activation in response to angry faces than to happy faces, but children with ASD showed greater fronto-temporal activation in response to happy faces than to angry faces. Additionally, the posterior cingulate and thalamus were less active in response to both angry and happy faces in children with ASD compared to children without ASD. Leung et al. (Leung et al., 2019), who studied children 7–10 years old, speculated that the results might reflect a maturational delay (Lindner & Rosén, 2006) in children with ASD and that the recognition of anger improves later in adolescence. Although our sample size is very small, the children were 12–17 years of age and the recognition of anger in the ASD group was still lagging behind the performance of children without ASD, making the above maturational theory less plausible. In a study by Malaia et al. (Malaia et al., 2019), children and adolescents with ASD failed to show the early distinctive neural responses to anger and fear that are typical to those without ASD, and Wright et al. (Wright et al., 2012) found a decrease of gamma power for anger, disgust and sadness in the left supramarginal and left precentral gyrus in ASD, while people without ASD showed increases of gamma power. Such results support an atypical neural processing hypothesis in emotion processing in ASD. In our study, employing time pressure might have challenged the atypical, less efficient neural processes in ASD, resulting in less accuracy in responses to angry faces in the timed experiment, but no difference in the recognition in the non-timed experiment.
It has been suggested that the recognition of surprise, a cognitive emotion that is related to understanding other people’s beliefs and intentions, is more difficult for people with ASD (Baron-Cohen et al., 1993; Gross, 2004; Smith et al., 2010), although many studies have found no evidence for such impairment (Humphreys et al., 2007; Uljarevic & Hamilton, 2013). Nagy et al. (Nagy et al., 2018) even found a faster recognition of surprise in ASD. The study, however, used rotated images, thus the advantage in the recognition was likely related to a faster mental rotation of surprised faces in ASD (Nagy et al., 2018).
Jones et al. (Jones et al., 2011), using facial and vocal stimuli, found evidence for an impaired recognition of surprise and suggested that as children with ASD typically respond to surprising events with anxiety, parents and caretakers might avoid exposing them to surprise (Jones et al., 2011). Whether children with autism use atypical ways (Hernandez et al., 2009; Pelphrey et al., 2002) or just reduced time to explore socially relevant stimuli and the decreased recognition of surprise is a result of the lack of experience with that emotion is still not known. It is possible that atypical mechanisms in the looking patterns of people with ASD are related to the fact that they spend less time exploring the relevant stimuli, but the way they are processing them is comparable to people without ASD (Pelphrey et al., 2002; Van Der Geest et al., 2002). Neuroimaging studies, using functional magnetic resonance imaging (fMRI), however, have shown atypical activation in the amygdala and fusiform gyrus in ASD during FER (Ashwin et al., 2007; Critchley et al., 2000). There is growing evidence for an atypical functionality in the posterior cingulate cortex and ventromedial prefrontal cortex in ASD (Lau et al., 2020), and the disruption of their connectivity is related to a cognitive inflexibility, a central feature of ASD (Watanabe et al., 2019). The posterior cingulate cortex also mediates between visual and emotional processing (Vogt et al., 2006) and its disruption results in difficulties in children with ASD in socio-emotional processes.
A major limitation is the very small sample in the study; recruitment sources to reach children with ASD in Angus, Scotland were limited. Even with the small sample, the effect sizes for the main effects and the interactions were moderate to large; according to Cohen (1988), however, the tendency results (p < .10) could indicate that some of the effects might have been missed. It is also important to note that only the ASQ (Berument et al., 1999) was used to confirm group membership. While all children with ASD had confirmed diagnoses as a basis of their admission to the special autism unit from which they were recruited and their teachers confirmed the diagnoses prior to the testing, the experimenters had no direct access to the clinical notes on the diagnoses. Future studies could utilise diagnostic tools, including the Autism Diagnostic Observation Schedule-Generic (ADOS-G); (Lord et al., 2000) and the Autism Diagnostic Interview-Revised (Lord et al., 1994). Future studies could include full IQ testing to ensure that no IQ-related differences contribute to outcome measures. The use of still pictures in a forced-choice computerised task in a laboratory environment may pose limitations. Loveland (Loveland, 1991) suggested that people with ASD may have difficulties not with emotion recognition (Loveland et al., 1997) but with perceiving and understanding the significance or ‘affordances’ of their social environment (Gibson, 1979), thus the use of ecologically valid facial-emotional stimuli would lead to more robust results according to the impaired social affordances hypothesis. Further studies using neuroimaging methods and using both time pressure and non-timed conditions could provide a more direct link between atypical processing and socio-emotional behavioural difficulties in children with ASD. Moreover, it is important to note that people with ASD are a heterogeneous population, with no unifying theory to account for the symptoms in the symptom groups as outlined by the DSM-5 (American Psychiatric Association, 2013; Hassan & Mokhtar, 2019). It means that the underlying mechanisms of any social difficulties might be variable, and the conclusions drawn from a small sample study should be regarded as exploratory.
In summary, the results of the study suggest that people with ASD may have atypical processing of facial emotions of anger and surprise and these processes might become challenged under time pressure. Understanding these atypical processes and the environmental factors that challenge them could be beneficial in supporting socio-emotional functioning in people ASD.
Footnotes
Acknowledgements
The authors sincerely thank all the pupils, parents and the staff at Monifieth High, in particular Bev Dobson, for the help in years 2012–2013.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors received no financial support for the research, authorship and/or publication of this article.
Appendix
See Table A1.
