Sage Journals: Discover world-class research

Abstract

Current challenges in early identification of autism spectrum disorder lead to significant delays in starting interventions, thereby compromising outcomes. Digital tools can potentially address this barrier as they are accessible, can measure autism-relevant phenotypes and can be administered in children’s natural environments by non-specialists. The purpose of this systematic review is to identify and characterise potentially scalable digital tools for direct assessment of autism spectrum disorder risk in early childhood. In total, 51,953 titles, 6884 abstracts and 567 full-text articles from four databases were screened using predefined criteria. Of these, 38 met inclusion criteria. Tasks are presented on both portable and non-portable technologies, typically by researchers in laboratory or clinic settings. Gamified tasks, virtual-reality platforms and automated analysis of video or audio recordings of children’s behaviours and speech are used to assess autism spectrum disorder risk. Tasks tapping social communication/interaction and motor domains most reliably discriminate between autism spectrum disorder and typically developing groups. Digital tools employing objective data collection and analysis methods hold immense potential for early identification of autism spectrum disorder risk. Next steps should be to further validate these tools, evaluate their generalisability outside laboratory or clinic settings, and standardise derived measures across tasks. Furthermore, stakeholders from underserved communities should be involved in the research and development process.

Lay abstract

The challenge of finding autistic children, and finding them early enough to make a difference for them and their families, becomes all the greater in parts of the world where human and material resources are in short supply. Poverty of resources delays interventions, translating into a poverty of outcomes. Digital tools carry potential to lessen this delay because they can be administered by non-specialists in children’s homes, schools or other everyday environments, they can measure a wide range of autistic behaviours objectively and they can automate analysis without requiring an expert in computers or statistics. This literature review aimed to identify and describe digital tools for screening children who may be at risk for autism. These tools are predominantly at the ‘proof-of-concept’ stage. Both portable (laptops, mobile phones, smart toys) and fixed (desktop computers, virtual-reality platforms) technologies are used to present computerised games, or to record children’s behaviours or speech. Computerised analysis of children’s interactions with these technologies differentiates children with and without autism, with promising results. Tasks assessing social responses and hand and body movements are the most reliable in distinguishing autistic from typically developing children. Such digital tools hold immense potential for early identification of autism spectrum disorder risk at a large scale. Next steps should be to further validate these tools and to evaluate their applicability in a variety of settings. Crucially, stakeholders from underserved communities globally must be involved in this research, lest it fail to capture the issues that these stakeholders are facing.

Keywords

ASD assessments computer digital gamified low-resource mHealth scalable smartphone tablet virtual reality

Introduction

Autism spectrum disorder (ASD), which affects 1 in 132 people globally with little regional variation (Baxter et al., 2015), is characterised by persistent difficulties in social communication and behavioural flexibility (American Psychiatric Association, 2013). ASD is often comorbid with epilepsy, gastrointestinal disorders, sleep disorders and other neurodevelopmental disorders or conditions, such as intellectual disability and attention-deficit hyperactivity disorder (Doshi-Velez et al., 2014; Levy et al., 2010). Fine and gross motor atypicalities and sensory sensitivity are also commonly observed in individuals with ASD (American Psychiatric Association, 2013).

Early childhood, as a period of rapid brain development, presents great opportunity and risk in shaping the developmental potential of all children, including those with neurodevelopmental disorders, such as ASD (Black et al., 2017). Early detection of ASD and intervention when the brain is most plastic lead to the best outcomes (Estes et al., 2015; Flanagan et al., 2012; Kasari et al., 2012). However, current challenges in diagnosing ASD in low-resource settings lead to significant delays in detection, and therefore in triaging to appropriate interventions (Mukherjee et al., 2014; Patra et al., 2020). For example, inadequate parental and community awareness about the red flags of autism compromise help-seeking behaviours (Divan et al., 2021). The available diagnostic tools demand administration by skilled and trained specialists, a scarce resource in most settings (Durkin et al., 2015). Moreover, these specialists are concentrated in urban areas or expensive private clinics inaccessible to the large majority of the population. Standardised assessment methods for ASD are lengthy, proprietary, globally priced and therefore not feasible for large-scale deployment (Durkin et al., 2015). However, the more scalable autism screening measures that depend on parent-report questionnaires are often unreliable, as they assume parental knowledge about autism symptoms is often lacking in communities with low maternal education and limited awareness about child development (Dawson & Sapiro, 2019; Khowaja et al., 2015). All these factors contribute to a failure in timely identification of children with autism, resulting in a large ‘detection gap’ (Dasgupta et al., 2016), with consequent delays in receiving a diagnosis and being placed on appropriate care pathways (Bhavnani et al., 2022).

Therefore, there is a critical need to develop scalable tools for autism risk assessment in the early years to leverage into improved outcomes throughout the life course. Digital tools have tremendous potential to address the scalability issue as portable computers and smart devices are now highly accessible across the globe, even in low-resource settings (Istepanian & AlAnzi, 2020). Over 5 billion people, representing more than two-thirds of the global population, have access to smart phones (World Health Organization Global Observatory for eHealth, 2011). The potential for these mHealth tools to be administered in children’s natural environments, such as homes and schools (Sapiro et al., 2019), and reports generated through automated analysis of objective and high-resolution dimensional data make them feasible for administration by non-specialist providers, including parents. This natural environmental setting also garners more representative behavioural observations. By leveraging the multitude of sensors, such as cameras, audio recorders and touch-sensitive displays, digital tools can measure a wide range of autism-relevant phenotypes, including differences in social-emotional, motor and language skills, helping to capture the heterogeneity of the autism phenotype and providing a comprehensive view of the child’s strengths and weaknesses. Alongside clinical practice, this potential for task-sharing for ASD risk screening (Naslund et al., 2019) protects the time and efforts of highly skilled specialists towards diagnosis and treatment of the small fraction who screen positive. Finally, direct assessment of child behaviour through performance-based tasks picks up quantitative information complementary to parent reports that depend on awareness about autism-related behaviours (Dawson & Sapiro, 2019).

Recent reviews have summarised the evidence on the use of digital tools for autism assessment based on parent-report questionnaires (Marlow et al., 2019; Stewart & Lee, 2017), and the more technologically challenging eye-tracking (Alcañiz et al., 2022; Mastergeorge et al., 2021; Papagiannopoulou et al., 2014), electroencephalography (O’Reilly et al., 2017) and magnetic resonance imaging (Sato & Uono, 2019) methods. However, these tools are not ideal for screening in low-resource settings either because of their dependence on parent reports which may be unreliable, the requirement for expensive equipment and software typically administered in controlled laboratories or the need for high levels of manual input and expertise in analysing the data. In contrast, digital tasks administered using more accessible and portable devices, such as computers, tablets and smartphones, and amenable to automated analysis of child responses and behaviours, have a much greater potential to scale since they are suitable for task-sharing approaches (Naslund et al., 2019). However, a comprehensive review of the characteristics and utility of scalable digital tools for direct assessment of autism risk during early childhood is critically missing. This omission is especially significant in terms of their potential to be further developed into valid screening tools deployable at scale in low-resource settings.

This review attempts to bridge this gap by addressing the following questions:

What types of digital tasks are being used for direct assessment of autism risk during early childhood, and which diagnostic (DSM-5) criteria and specific ASD-related phenotype do they target?

How well are these tools (and specific metrics derived therefrom) able to discriminate between ASD and typically developing (TD) groups in case–control studies?

What are the implementation strategies of these tools in relation to hardware and configuration, passive or active task, personnel and time taken for administration?

Methods

Search

While this review focuses on scalable digital tools to assess autism risk during early childhood (0–8 years), it is based on a subset of papers identified from a more comprehensive search of peer-reviewed articles describing scalable digital tools for assessment of autism and attention deficit hyperactivity disorder across 0–18 years. Four databases (PubMed, PsycInfo, Scopus and Web of Science) were searched in two phases to retrieve relevant articles. During the first phase conducted in May 2018, no date restrictions were applied. The second phase, specific to this review topic, updated the original search by including relevant articles published from June 2018 through October 2020. Specific keywords used for Phases 1 and 2 are presented in Supplementary Table 1.

Study selection and data extraction

Search results from selected databases were imported into the Rayyan software (https://rayyan.ai/) (Ouzzani et al., 2016). Titles and abstracts of the imported articles were screened by three reviewers (D.M., V.R. and J.D.) during Phase 1 and two reviewers during Phase 2 (D.M. and G.L.E.) using the inclusion/exclusion criteria described below. Screening results were ‘unblinded’ for group review weekly, and conflicts were resolved through group consensus. Full texts of included articles were downloaded and screened for eligibility. Data were extracted from included articles.

Eligibility criteria

Scalable digital tools were defined as those that collected and analysed data in a digital format using desktop or mobile devices (laptop, tablets, smartphones or any other mobile smart device). Included studies either required the child to engage actively with tasks presented on the device or used the device to acquire data from the child passively (e.g. via voice or video recording).

The inclusion criteria were (a) peer-reviewed primary research articles published in the English language; (b) case–control study design with at least two groups – ASD and TD comparison group (papers with additional atypical comparison groups, such as neurodevelopmental disorders other than ASD, were included) and (c) mean age of the participant groups ⩽ 8 years (defined as early childhood by the World Health Organization, 2020). The exclusion criteria were (a) digital tools that collected only parent-report data since this review focused on digital tools for direct child assessment; (b) tools that required manual coding of child behaviour post data collection since this method is time-consuming and subjective, therefore unlikely to scale in low-resource settings with limited numbers of trained specialists and (c) studies that only reported the acceptability and feasibility testing of the tool or used a small sample (N < 5 per group) since one of the primary objectives of this review was to evaluate the discriminative ability of these novel tools for early identification of ASD risk.

Analysis

For each included study, data were tabulated to describe the task(s) presented to the child, the experimental setup, device(s) used and the format in which the child’s response was recorded. The primary metric(s) used to determine group differences also were tabulated, along with the main findings (Table 1). A brief description of the participants (mean and standard deviation or range of the age distribution, sample size and gender distribution) was included in the table. Papers were grouped based on the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) diagnostic criteria they covered, along with a mention of the specific developmental domain/phenotype assessed and the country in which the research was based. A more detailed description of the study groups and methods is presented in Supplementary Table 2 (demographic details, inclusion/exclusion criteria, standardised tools used to diagnose children with autism, level of functioning of the participants and sample size of children recruited versus completing the tasks, along with reasons for loss to follow-up).

Table 1.

Characteristics and discriminating ability of scalable digital tools to assess ASD risk during early childhood.

Citation, specific measure	Participant details^a (Mean age, sample size, gender distribution)	Device specifications	Experimental setup	Primary metric (s) and summary results
DSM-5 criteria: Social communication/Social interaction
Anzulewicz et al. (2016) Specific measure:‘Sharing’ and fine-motor drawing movementsCountry: United Kingdom	1. ASD group^a M_age (SD) = 53 (11) months, N = 35, %male = 67.572. TD groupM_age (SD) = 55 (11) months, N = 45, %male = 71.11	Tablet computer (iPad mini) running standard iOS version 7.0	Task description: Two commercially available computerised games.Child response: Game 1 (Sharing): Slice a piece of food by tapping on it and distributing it evenly among four characters. Characters expressed positive/negative emotions based on equal/unequal distribution of food.Game 2 (Creativity): Trace a picture followed by colouring using finger motions.#trials: 1Setting: Not specifiedDuration: 5 min	In total, 262 features from touch data (gestures on screen) and tablet’s inertial movement sensors acquired.Machine learning (ML) using touch and sensor data to predict child’s diagnostic group (TD vs ASD).Significant differences in the following features:1. Impact Force and Gesture Pressure: ASD > TD2. Distribution of forces into the device: (gyroscope data) patterns of force distribution different3. Mean gesture velocity: ASD > TD4. Mean area occupied by a gesture: ASD > TD5. Gestures in distal parts of the screen: ASD > TD6. Minimum duration of a screen tap: ASD < TD7. ML prediction accuracy: Max AUC = 0.93, sensitivity = 0.83, specificity = 0.85
Ruta et al. (2017) Specific measure:Social preferenceCountry: Italy	1. ASD groupM_age (SD) = 39.9 (11.5) months, N = 21, %male = 85.72. TD groupM_age (SD) = 45.5 (10.7) months, N = 37, %male = 48.6	Tablet computer (iPad)	Task description: Deliberate choice task with two pictures – non-social (toy-train) and social (smiling face) hidden under buttons. Button position was randomised in each trial. Control trials comprised scrambled images of the same pictures.Child response: Tap on button to show picture of choice.#trials: 8 per conditionSetting: Laboratory/HomeDuration: 5 min	1. Proportion of button taps to access social image: Significant difference (ASD < TD)No difference in control trials (scrambled images)
Chetcuti et al. (2019)Specific measure:Social imitation of simple versus complex motor tasksCountry: Australia	1. ASD groupM_age (SD) = 41.77 (8.5) months, N = 35, %male = 802. TD groupM_age (SD) = 44.7 (13.64) months, N = 20, %male = 65	Tablet computer (iPad) with iPad application Slide & Spin	Task description: Four on-screen targets presented on the screen, manipulable using tap, drag, swipe and rotate actions.Child response: Imitate five-action sequence under four conditions (2 social × 2 motor complexity). Social comprised a socially responsive versus aloof instructor. Motor complexity involved low (five consecutive taps) versus high (multiple motor actions) complexity sequences.#trials: 1 per conditionSetting: Laboratory/HomeDuration: Not specified	1. Number of correct imitations by condition: Significant difference in high-motor demand task (ASD < TD). No difference in low-motor demand task. No difference based on social condition.
Carlsson et al. (2018) Specific measure:False-belief understandingCountry: Sweden	1. ASD group^a M_age (SD) = 91.19 (10.8) months, N = 52, %male = 79.412. TD groupM_age (SD) = 89.99 (10.8) months, N = 98, %male = 51.02	Tablet computer (specifications not provided)	Task description: Watch a short film adapted from Sally–Anne task. Facilitating effect of language support assessed using three auditory conditions: (1) narrative; (2) silent and (3) interference.Child response: Respond to questions by tapping on one of two yellow circles (correct and incorrect ROI). Data saved in the device.#trials: 2 per conditionSetting: Clinic (ASD); Quiet school room (TD)Duration: 3–4 min	1. Task completion: 100% in TD group, 75% in ASD group2. Accuracy (two successful trials per condition): Significant difference (ASD < TD) in narrative and silent conditions; same trend (but not significant) in interference condition
Jones et al. (2018) Specific measure:Statistical learningCountry: USA	1. ASD group^a M_age (SD) = 64.67 (16.07) months, N = 56, %male = 80.362. TD groupM_age (SD) = 60.11 (16.19) months, N = 68, %male = 55.88	Tablet computer (iPad)	Task description: Statistical learning task: sequence of two images (cue followed by target/distractor) presented for 2 s each. Cues were either high frequency (HF) or low frequency (LF), indicating likeliness of the next image to be a target or distractor.Child response: Tap on the target image, avoid distractor image.#trials: 7 runs of 24 trials each; 84 trials preceded by LF cue and HF cue, respectively.Setting: LaboratoryDuration: Not specified	1. Accuracy: No difference.2. Reaction time: Significant difference (ASD > TD). Unique RT patterns in TD (quadratic pattern in LF but not HF) versus ASD (linear pattern in both conditions).3. Bayes classification to determine degree to which ASD child’s RT pattern was similar to TD group: ASD children with less severe autism symptoms (Social Responsiveness Scale-2) had similar learning profile to TD group.
Campbell et al. (2019) Specific measure:Social attention and response to nameCountry: USA	1. ASD groupM_age (SD) = 26.19 (4.07) months, N = 22, %male = 77.272. TD groupM_age (SD) = 21.91 (3.78) months, N = 82, %male = 58.54. Includes eight children with a diagnosis of language or developmental delay sufficient to qualify for speech or developmental therapy	Tablet computer (iPad)	Task description: Set of developmentally appropriate videos presented on the screen (cascading bubbles, mechanical bunny, animal puppets interacting with each other).During three of the videos, ‘Call Name’ appeared on the screen. The examiner, standing behind the child, called the child’s name loudly.Child response: The front camera on the tablet recorded the child’s video in response to name being called. ML algorithm automatically tracked head position.#trials: 3 prompts to call child’s nameSetting: LaboratoryDuration: 5 min	1. Task engagement (time looking at videos or people in the room): Significant difference (ASD < TD)2. Orienting to name:a. At least once across three trials: No differenceb. Multiple times across three trials: Significant difference (ASD < TD)c. Latency to orient to name: Significant difference (ASD > TD)
Gale et al. (2019) Specific measure:Social preferenceCountry: Norway	1. ASD groupM_age (SD) = 59.3 (18.3) months, N = 27, %male = 77.82. TD groupM_age (SD) = 34.5 (12.3) months, N = 40, %male = 55	Tablet computer (Samsung Galaxy)	Task description: Study 1 and 2: Blurred videos of social (faces of people and dogs) and non-social (abstract moving geometric patterns) stimuli presented side-by-side. Study 3: Any one stimulus presented to assess reinforcement strength.Child response: Study 1 and 2: Tap on (blurred) video of choice. Chosen video grew larger and became clearly visible for 2 s. Study 3: Tap on stimuli (social or non-social) multiple times (progressively increasing – 2, 4, 6 times across trials – to view video clearly).#trials: 8 sessionsSetting: child’s home, nursery or clinicDuration: 90 s for each session	1. Study 1 and 2: Proportion of taps on social videos: Significant difference (ASD < TD)2. Study 3 (reinforcement strength):a. Proportion of taps to access non-social stimuli: Significant difference (ASD > TD).b. Proportion of time spent on session showing non-social stimuli: Significant difference (ASD > TD)c. Breakpoint of non-social video session (number of times the child clicked the non-social video before giving up): Significant difference (ASD > TD).
Carpenter et al. (2021)Specific measure:social-emotional reciprocity/facial expressionsCountry:USA	1. ASD groupM_age (SD) = 26.2 (4.1) months, N = 22, %male = 77.272. TD groupM_age (SD) = 21.7 (3.8) months, N = 74, %male = 583. DD (Non-ASD delay)M_age (SD) = 23.9 (3.7) months, N = 8, %male = 62	Tablet computer (iPad)	Task description: Same as Campbell et al. (2019). Set of developmentally appropriate videos presented on the screen. Social (woman singing nursery rhymes) and non-social (noise-making toy) videos were also presented on the left and right sides of the screen, respectively.Child response: Front camera captured child’s video. ML algorithm predicted the probability of facial expressions (positive, neutral or other) for each 3-s window.#trials: 1Setting: ClinicDuration: 5 min	ASD group displayed increased frequency of neutral expression compared to the non-ASD group.1. Area under the ROC curve (predict ASD diagnosis based on two metrics – mean probability of facial expressions for 3-s windows and frames not attending for each movie): (1) Bubbles_1: 0.75, Bunny: 0.81, Puppets: 0.78, Rhymes: 0.83Bubbles_2: 0.79.Model for ‘Rhymes’ movie yielded the strongest predictive ability
Zhao and Lu (2020)Specific measure:Imitation of facial expressionsCountry: China	1. ASD groupM_age (SD) = NA (NA) months; Range: 36–72 months, N = 10, %male = 502. TD groupM_age (SD) = NA (NA) months; Range: 60–96 months, N = 10, %male = 50	Varying: mobile phones (both Android and iOS), personal computer (Windows 10) with Windows server	Task description: A software prompted children to imitate seven different facial expressions through pictures and sounds (happy, sad, angry, disgust, surprise, fear, neutral).Child response: Camera recorded children’s facial expressions. ML algorithm estimated probabilities of the child making each of the seven expressions.#trials: 3 per expressionSetting: Quiet school roomDuration: Not specified	1. Accuracy to correctly imitate on-screen expressions: Significant difference (ASD < TD).ASD group performance most compromised for disgust, neutral, surprise and fear.
Bovery et al. (2021) Specific measure:Social preference and attentionCountry: USA	1. ASD groupM_age (SD) = 26.19 (4.07) months, N = 22, %male = 77.272. TD groupM_age (SD) = 21.91 (3.78) months, N = 82, %male = 58.54Includes eight children with a diagnosis of language delay or developmental delay sufficient to qualify for speech or developmental therapy	Tablet computer (iPad fourth generation). Front camera recorded video at 1280 × 720 resolution and 30 frames/s	Task description: 1-min video containing social (singing women) and non-social (dynamic toys with sound) stimuli. Both types of stimuli changed at pre-specified times within the movie (a different woman or toy appeared at different times).Child response: Webcam captured children’s videos while they watched the movie. ML algorithm 1 – automatically detected head position. ML algorithm 2 – predicted where on the screen the child looked (left, right or indeterminate)#trials: 1Setting: ClinicDuration: 1 min	1. Attention on screen (number of frames in which child looked at the screen): Significant difference (ASD < TD).2. Attention to social versus non-social: No difference3. Attention shift to novel stimuli when stimuli changes: Significant difference (ASD < TD).
Lu et al. (2019) Specific measure:Abstract rule learning in social and non-social contextsCountry: China	1. ASD group^a M_age (SD) = 69.96 (9.72) months, N = 28, %male = 82.142. TD groupM_age (SD) = 66.36 (5.04) months, N = 28, %male = 82.14	Laptop computer (specifications not provided)	Task description: Gamified tasks based on distrust and deception tasks. Token hidden in one of three boxes. Player incorrectly indicated to opponent the box with the hidden token. Two conditions: non-social – computer as opponent, and social – computer-controlled avatar as opponent that children believed were real people. Games included (1) recognising and avoiding misleading cues (distrust) and (2) providing misleading cues (deception).Child response: Distrust task: correct response was to choose any of the two boxes not indicated by the computer (computer always indicated the wrong box). Deception task: correct response was to indicate a box other than the one in which token was hidden (Computer always chose the indicated box).#trials: 10 eachSetting: Not specifiedDuration: Not specified	1. Number of correct trials:Distrust task – Significant difference (ASD < TD) in both social and non-social conditions.Deception task – Significant difference (ASD < TD) in social, but not in non-social condition.2. Speed of learning: Significant difference (ASD < TD) in both tasks in social condition. No difference in non-social condition.
Nakai et al. (2014) Specific measure:Speech intonationCountry: Japan	1. ASD group^a M_age (SD) = 87.23 (6.76) months, N = 26, %male = 76.922. TD groupM_age (SD) = 83.48 (9.13) months, N = 37, %male = 54.05	Lavalier Microphone (Sony, ECM-77B/9X), Audio Capture (ROLAND, EDIROL UA-25EX),Mobile Note PC (TOSHIBA, Dynabook SS RX2/T7G)	Task description: 50 picture cards of animals and objects displayed on the screen. Child had to name them.Child response: Microphone fixed to child’s clothing recorded verbal responses. Audio data of correct responses isolated for analysis.#trials: 2Setting: Not specifiedDuration: Not specified	1. Pitch coefficient of variation (each word): No difference in preschool children (4–6 years). Significant difference in school-age children (7–9 years).2. Pitch range and SD (each word): No difference3. Whole speech pitch metrics: No difference.
Wijesinghe et al. (2019) Specific measure:Speech characteristicCountry: Sri Lanka	1. ASD groupAge range in both groups: 18–36 monthsM_age (SD) = NA (NA) months, N = 8, %male = NA2. TD groupM_age (SD) = NA (NA), N = 9, %male = NA	Voice recorder (specifications not provided)	Task description: Voice recorder placed either within a pocket in the child’s clothing or within a metre from the child for periods varying from 2 to 10 h. Recording of conversations between the index child with a familiar adult in a familiar environment was captured.Child response: Audio signals were segmented as ‘silent’ and ‘non-silent’, and further to ‘vocal’ and ‘noise’ segments from non-silent segments. Child utterance data measured from vocal segments only.#trials: Not specifiedSetting: Child’s familiar environmentDuration: 2–10 h	1. ML accuracy: ML model with feature set (duration of each utterance category (child uttering a meaningful word, meaningless word, vegetative sounds, adult utterances, noises, silences, total child duration and total audio duration per 10 min) not effective in classifying children).
Gyori et al. (2008)Specific measure:Social-emotional reciprocity/facial expressionsCountry: Hungary	1. ASD groupM_age (SD) = 58.38 (8.45) months, N = 13, %male = 69.22. TD groupM_age (SD) = 57.15 (6.74) months, N = 13, %male = 46.2	Desktop computer. Webcam below the monitor captured emotional expressions. Noldus FaceReader (v5.1, Noldus Information Technology) for emotional states analysis	Task description: Gamified task to assess ability to use deception and sabotage as social strategies in competitive and co-operative contexts. Game included tasks to evoke emotional, behavioural and gaze responses.Child response: Webcam captured child’s video; analysed by the Noldus FaceReader#trials: Not specifiedSetting: LaboratoryDuration: ~24 min	1. Mean intensities of emotions: No difference
Lin et al. (2013) Specific measure:LanguageCountry: Taiwan	1. ASD group^a M_age (SD) = 66.11 (8.90) months, N = 35, %male = NA2. TD groupM_age (SD) = 60 (12) months, N = 300, %male = 49	Computer. Online (worldwide web) language assessment tool connected to backend server. ‘Offline’ version available which allows temporary storage of data on device for later upload.	Task description: Six language tests presented in auditory and visual formats – (1) decoding (DE), (2) homographs (HOM), (3) visual vocabulary comprehension (VVC), (4) auditory vocabulary comprehension (AVC), (5) visual sentence comprehension (VSC) and (6) auditory sentence comprehension (ASC).Child response: Test administrator recorded correct/incorrect responses using keyboard key presses or mouse clicks.#trials: DE = 50, HOM = 14, VVC = 53, AVC = 38, VSC = 15, ASC = 15. However, 104/186 items retained in the final test.Setting: Quiet room (clinic or school)Duration: ~35 min	1. Accuracy on each sub-test: Significant difference in DE, HOM, VVC, VSC (ASD > TD) and ASC (ASD < TD). No difference in AVC.In DE and VVC, largest difference seen at 4 years (ASD > TD), reduces at 6 years.For ASC, differences at 4 years (ASD < TD) further enhanced at 6 years.
Chaminade et al. (2015)Specific measure:Biological motion/anthropomorphic biasCountry: France	1. ASD groupM_age (SD) = 61 (25) months, N = 12, %male = 91.6%2. TD groupM_age (SD) = 39 (7) months. In total, 12 of the youngest TD children mentally age-matched to ASD sample, N = 24, %male = 58.3%	Computer touchscreen (specifications not provided)	Task description: Gamified task where four characters (two humans and two cartoons) are associated with two kinds of motions: biological and artificial. From these set of combinations, two videos were presented simultaneously on the left and right of the touchscreen.Child response: Children touched the video they liked more. The touched video grew in size while the other disappeared.#trials: 16/session, up to 4 sessions/participant.Setting: LaboratoryDuration: Not specified	1. Proportion of choices to biological motion (human): Significant difference (ASD < TD)2. Proportion of choices to biological motion (cartoon): No difference.Results similar for mentally and chronologically age-matched samples
Deschamps et al. (2014) Specific measure:Empathy and prosocial behaviourCountry: The Netherlands	1. ASD group^a M_age (SD) = 81.59 (6.95) months, N = 22, %male = 81.822. TD groupM_age (SD) = 86.39 (6.71) months, N = 29, %male = 82.76	Computer screen (specifications not provided)	Task description: Ball-throwing computer game against two computer-controlled players who gave rewards when ball was passed to them. When rewards ran out (final round), one player showed progressively distressed facial expressions each time the ball was not passed to them. Two conditions – girl and boy as the distressed player.Child response: Choose player to pass ball.#trials: 20 in final round, 10 in previous roundsSetting: Quiet school roomDuration: Not specified	1. Number of ball throws to distressed player: No difference
Aresti-Bartolome et al. (2015) Specific measure:Social interaction and eye contactCountry: Spain	1. ASD group^a M_age (SD) = NA. Age range: 36–96 months, N = 20, %male = NA2. TD groupM_age (SD) = NA. Age range: 36–96 months, N = 20, %male = NA Gender-matched to the ASD group	21″ touch screen. Tactile pointer of 40 cm. Games configured by APNABI Association	Task description: Gamified task with three levels of difficulty involving collecting as many pre-specified items within a 3-min period as possible by touching the screen with a pointer. Game stopped every 30 s or when an error was made.Child response: For game to continue, child had to interact with test administrator, who recorded the latency of the interaction through key presses on the keyboard (separate keys for interactions with and without eye contact).#trials: 1 per levelSetting: ClassroomDuration: 12 min	1. Latency of interaction with administrator: Significant difference (ASD > TD).2. Number of interactions with eye contact: Significant difference (ASD < TD)3. Task completion (% not completing levels): 10, 14, 15% in ASD versus 0, 0, 5% in TD4. Number of pre-specified items collected per level: Significant difference (ASD < TD)
P. Li et al. (2016) Specific measure:Selective trustCountry: China	1. ASD group^a M_age (SD) = 73.55 (9.71) months, N = 30, %male = 86.672. TD groupM_age (SD) = 70.31 (7.79) months, N = 30, %male = 86.67	Computer screen (specifications not provided)	Task description: Virtual candy hidden in one of the two boxes. Boxes had pictures of faces with contrasting themes: own versus other race, attractive versus unattractive, trustworthy versus not trustworthyChild response: Tap on box with preferred face characteristic.#trials: 30 trials, 10 per condition.Setting: Not specifiedDuration: Not specified	1. Accuracy of choosing own race, more attractive, or more trustworthy faces: (a) Race: No difference; (b) Attractiveness: Significant difference (ASD > TD) for more attractive face; (c) Trustworthiness: No difference
Borsos and Gyori (2017)Specific measure:Social-emotional reciprocity/facial expressionsCountry: Hungary	1. ASD groupM_age (SD) = 58.38 (8.45) months, N = 13, %male = 69.22. TD groupM_age (SD) = 57.15 (6.74) months, N = 13, %male = 46.2	Desktop computer. Webcam below the monitor captured emotional expressions. Noldus FaceReader (v5.1, Noldus Information Technology) for emotional states analysisStandard desktop-mounted eye-tracking device (EyeFollower 2 by LC Technologies) to detect attention to screen.	Task description and child response: Same as Gyori et al. (2008).For each frame, data were classified as invalid if the Noldus FaceReader was unable to identify the face or unable to assign an emotion to the frame.#trials: 3 game sections played one time eachSetting: LaboratoryDuration: ~24 min	1. Mean of static (i.e. frame-by-frame) intensities of emotional states: Significant difference in scared and surprised emotions (ASD > TD).2. Speed of emotion expression change: Significant difference in ‘surprised’ emotion (ASD > TD).3. Valid data ratio (ratio of valid vs invalid data): No significant difference.
Martin et al. (2018) Specific measure:Social preference and head postural responseCountry: USA	1. ASD groupM_age (SD) = 60.8 (16.52) months, N = 21, %male = 80.952. TD groupM_age (SD) = 51.23 (15.35) months, N = 21, %male = 66.67	19″ video monitor with a camera mounted on the top edge	Task description: Six videos containing social and non-social stimuli presented on a screen. One video each was purely non-social and purely social. Remaining four had a mix of both.Child response: Camera recorded children’s videos while they observed the stimuli. A computer vision algorithm (Zface) tracked pitch, yaw and roll of head movement frame-by-frame.#trials: 1Setting: LaboratoryDuration: 16 min	1. Proportion of successfully tracked frames: No significant difference2. Number and duration of epochs (successfully tracked consecutive frames) per video: No significant difference3. Angular displacement: Significant difference in yaw (ASD > TD) in social video. No difference for pitch and roll. No difference between groups in non-social video.4. Angular velocity: Significant difference in yaw and roll in social video (ASD > TD), but no difference in pitch. No difference between groups in non-social video.
Li et al. (2020)Specific measure:False-belief understandingCountry: China	1. ASD group^a M_age (SD) = 77.04 (9) months, N = 17, %male = 82.42. TD groupM_age (SD) = 77.04 (9) months, N = 17, %male = 58.82	Notebook computer (specifications not provided)	Task description (1): Truth value judgement (TVJ) task: One puppet made statements about pictures shown on a screen. Statements contained factive (‘knows’), counter-factive (‘pretend’) and non-factive (‘thinks’) verbs.Child response: Judge the truth of the puppet’s statement by pressing one of the three buttons – ‘yes’, ‘no’ and ‘maybe’.# trials: 10 ‘pretend’ and 15 (‘knows’, ‘thinks’)Task description (2): First- and second-order false-belief tasks presented as series of pictures with voiceovers.Child response: Answer questions to demonstrate FB understanding. Each correct answer awarded one point.#trials: 8Setting: Kindergarten/primary school/training centreDuration: Not specified	1. Mean (SD) of TVJ task score: Significant difference (ASD < TD) for ‘pretend’ condition; approaching significance (ASD < TD) in ‘know’ condition. No difference in ‘think’ condition.2a. Mean (SD) of first-order FB tasks: Significantly different (ASD < TD)2b. Mean (SD) of second-order FB tasks: Significantly different (ASD < TD)
Li et al. (2019)Specific measure:Social gazeCountry: China	1. ASD groupM_age (SD) = NA (NA) months; Range: 48–84 months, N = 136, %male = NA2. TD groupM_age (SD) = NA (NA) months; Range: 72–96 months, N = 136, %male = NA	Computer screen (specifications not provided). Head movements recorded using 360 intelligent camera	Task description: Children seated in front of a screen with their mother’s picture displayed.Child response: Camera recorded children’s videos while they observed the stimuli. ML approach used to track trajectory of eye movements in the first 2001 frames of the video. Trajectory data divided into angle and length information and used to classify children as ASD or TD.#trials: 1Setting: Primary and special education schoolsDuration: 10 min	1. No. of frames with eyes not visible: Significant difference (ASD > TD)2. Accuracy of ML algorithm to classify children into diagnostic groups: 92.6% using all features (both length and angle information)
Shahab et al. (2017) Specific measure:Joint attention and imitationCountry: Iran	1. ASD group^a M_age (SD) = 58.8 (9.96) months, N = 14, %male = NA2. TD groupM_age (SD) = 60 (10.8) months, N = 21, %male = NA	Virtual-reality (VR) setup	Task description: VR setup to play the xylophone and drums, led by robots controlled remotely via an operator. Step 1: Virtual robot showed child how to play a virtual drum and the xylophone, using VR controllers as mallets. Step 2: Children asked to describe what they saw in the virtual room. In case of unable to name objects present in the room, robots pointed to objects to direct the child’s attention. Midway through task, children requested to wear VR headset.Child response: Imitate robot’s actions. Child behaviour recorded by two video cameras. One point awarded for naming each picture in virtual room (4 total).#trials: 1Setting: LaboratoryDuration: ~ 10 min	1. Score in the drum and xylophone imitation tasks: Significant difference (ASD < TD).2. Picture naming: Significant difference (ASD < TD)3. Task engagement (defined as (1) duration engaged with the game and (2) duration for which the child wore the VR headset): Significant difference (ASD < TD) for both metrics.
Jyoti et al. (2020)Specific measure:Joint attentionCountry: India	1. ASD group^a M_age (SD) = 76.8 (14.76) months, N = 20, %male = 652. TD groupM_age (SD) = 80.4 (10.38) months, N = 20, %male = 65	VR-enabled HCI-based task platform. Touch-sensitive monitor to record child responses.	Task description: Joint attention (JA) task with 3D avatar within a VR setup. The avatar provided cues through (1) eye gaze alone; (2) head turn with gaze; (3) finger pointing, head turn and gaze and (4) all of the above and sparkling of the target contingent with JA cues. JA cues offered randomly.Child response: Touching the target object indicated by JA cue on a touch-sensitive monitor. Performance recorded on confirmation of choice.#trials: 8 (2 per cue)Setting: LaboratoryDuration: 20 min	1. Average performance score on JA task: Significant difference (ASD < TD) in two cues – eye gaze alone and eye gaze with head turn.Ceiling effect (100%) for TD group irrespective of JA cue type. ASD group performance improved with increasing information in the JA cue.2. Average reaction time: Significant difference (ASD > TD). Approximately three times longer in ASD. Average response time decreases as cue information increases.3. Effectiveness index (EI): EI = PI (scaled performance score) + (1-RTI) (scaled reaction time). Significant difference (TD > ASD). EI consistently > 1.5 across all cues for TD. For ASD group, EI progressively increases from 0.5 to 1.5 for JA cues with increasing information.
DSM-5 criteria: Stereotypical, repetitive or restricted behaviours or interests
Moradi et al. (2017) Specific measure:Patterns of play movementsCountry: Iran	2.ASD group^a M_age (SD) = 57.24 (12.36) months, N = 25, %male = 84%2. TD groupM_age (SD) = 60.24 (8.76) months, N = 25, %male = 64%	Wii remote (with a 3-axis Micro-Electro-Mechanical System (MEMS) ADXL330 accelerometer) embedded into a toy car	Task description: Children played with a toy car embedded with an accelerometer.Child response: Accelerometer recorded car’s movement in 3D. Data transferred using Bluetooth or Wi-Fi. Typical data file consisted of ~5000 samples of time and acceleration data in 3D representing child’s play with the car.#trials: 1Setting: quiet roomDuration: ~5 min	1. Accuracy, sensitivity, specificity of ML (SVM) methods: The full feature set (44 – see select examples below) discriminated between groups with reasonable accuracy (62%), sensitivity (65%) and specificity (61%).Feature examples: Play time; In each three dimension – correlation of acceleration between two of the three axes, mean and variance of acceleration, dominant frequencies of acceleration direction, total acceleration signal energy, number of jolts in the forward direction.
Motor (not covered in DSM-5 criteria)
Rafique et al. (2019) Specific measure:Fine-motor drawing movementsCountry: Pakistan	1. ASD groupM_age (SD) = NA (NA) months, Age range in both groups: 60–144 months, M: 89 months, N = 22, %male = 77.22. TD groupM_age (SD) = NA (NA) months, N = 22, %male = 54.5	Android phone (version 6.0.1)	Task description: Trace and colour a dotted square shape. Smartphone was placed on a flat table, so that the sensor values recorded force applied on the phone. Smartphone recorded four inertial and six touch data.Child response: Perform drawing task.#trials: Not specifiedSetting: School roomDuration: Not specified	1. Accuracy of ML methods to classify children into diagnostic groups using top 10 features of the creativity game reported by Anzulewicz et al. (2016): Accuracy > 85%.
Mahmoudi-Nejad et al. (2017) Specific measure:Fine-motor tracing movementsCountry: Iran	1. ASD groupM_age (SD) = NA (NA) months. Range: 48–84 months, N = 5, %male = 100%2. TD groupM_age (SD) = NA (NA) months. Age-matched with ASD group, N = 7, %male = 57.14%	Tablet computer and smartphone (specifications not provided)	Task description: Follow a pre-specified path marked with pink flowers to take a bee to its hive. Pink flower turns green (win) when bee touches it. Untouched flowers turned red (fail). Haptic feedback provided if defined trajectory is not followed.Child response: Trace path to take bee to hive. Touch data recorded on the tablet. Children awarded 0–4 stars/trial depending on their accuracy (win/total). Data analysed only from Level 1 since ASD group could not play beyond this level.#trials: 8 sub-levels within 4 difficulty levels.Setting: Not specifiedDuration: Not specified	45 features extracted from touch data in two categories – point-based and progress-based.Points-based: Calculated using two adjacent points the child touched (e.g. distance, velocity, acceleration, time, curvature, error)Progress-based: Indicator of attempts made (e.g. score, levels played, completion time).ML methods (Linear SVM) used to classify children into their respective groups. Three features – Total score, Average velocity and Average curvature – discriminated between ASD and TD groups.
Dawson et al. (2018) Specific measure: Rate of head movementsCountry: USA	1. ASD groupM_age (SD) = 26.19 (4.07) months, N = 22, %male = 77.272. TD groupM_age (SD) = 21.91 (3.78) months, N = 82, %male = 58.54Includes eight children with a diagnosis of language delay or developmental delay sufficient to qualify for speech or developmental therapy.	Tablet computer (iPad). Front camera recorded video at 1280 × 720 resolution and 30 frames/s	Task description: Same as Campbell et al. (2019).Child response: Camera captured child video while they viewed stimuli on tablet screen. ML algorithm detected and tracked head position. Three head pose angles – yaw (left-right), pitch (up-down) and roll (left-right tilt) calculated per frame. Child was considered to be looking at the screen when yaw ⩽ 20°#trials: 1Setting: ClinicDuration: 5 min	1. Rate ratio of head movement of ASD with TD as reference: Rate ratio calculated from the association between head movement rate (for every one-third second across the movie) and ASD diagnosis.Robust differences observed in four out of five movies (except bubbles) (ASD > TD). These four movies had complex stimuli compared to bubbles video.Associations unchanged with eight DD participants removed from the TD group during sensitivity analysis.
Fleury et al. (2013) Specific measure:Fine-motor drawing movementsCountry: Canada	1. ASD group^a M_age (SD) = 81.6 (19.08) months, N = 15, %male = 86.962. TD groupM_age (SD) = 63 (12) months, N = 19, %male = 85	Wacom Cintiq 15X digitising tablet and pen	Task description: Draw circles on a touchscreen using a stylus. Six conditions based on hand-use (dominant vs non-dominant) and three circle drawing styles (continuous drawing without pausing: comfortable vs fast pace; and discontinuous by pausing at the top before starting again).Child response: Draw circles as per the six specified conditions.#trials: 1 each (6 total)Setting: Not specifiedDuration: 14 min	1. Mean circle drawing time: No difference2. Coefficient of variation (CV) of circle drawing time: Significant difference in discontinuous condition (ASD > TD)3. Power spectral density (PSD), root mean squared (RMS) fluctuations during drawing and statistical persistence (Hurst coefficient): No difference
Dowd et al. (2012) Specific measure:Motor planning and executionCountry: Australia	1. ASD group^a M_age (SD) = 74.39 (16.8) months, N = 11, %male = 72.732. TD groupM_age (SD) = 79.19 (18) months, N = 12, %male = 75	17″ LCD touch screen (MicroTouch 3MM170), connected to a HP Compaq 6910p laptop	Task description: Point-to-point movement task. Join two points presented on a vertical plane using a stylus. Start position at the bottom centre of the screen. End position at the top of the screen either left, centre or right from start. Some trials had a visual distractor near the target.Child response: Trace a line from start to end. Stylus movement was sampled at 125 Hz.#trials: 25Setting: LaboratoryDuration: Not specified	1. Various kinematic variables:a. Significant difference in variability of movement preparation time (ASD > TD). No difference in any other variableb. Interaction effects observed for distractor condition: longer and more variable total movement time in TD. No difference in ASD.
Crippa et al. (2013) Specific measure:Hand-eye coordinationCountry: Italy	1. ASD group^a M_age (SD) = 74.4 (25.2) months, N = 14, %male = 85.7%2. TD groupM_age (SD) = 75.6 (27.6) months, N = 14, Age- and gender-matched	100 Hz 17″ LCD touch screen (Elo TouchSystem) at 1024 × 768 pixel resolution.50X Tobii for eye-tracking, AB Danderyd, Sweden, with 5-point calibration.Video recorder	Task description: Gap overlap task.Child response: Two conditions – (1) ‘press’: participants indicated the target’s left/right position on screen using a button box; and (2) ‘touch’: participants touched the targets on the screen. Movements started from a set position by lifting the hand off a pad on the table.#trials: 3 sessions of 16 trials each.Setting: LaboratoryDuration: Not specified	1. Reaction time for button press in ‘press’ condition: No difference2. Accuracy of touching target in ‘touch’ condition: No difference3. Eye-hand coordination (Pearson correlation between eye fixation latency on target and hand response (key presses and touch times): Strong and significant correlation in TD. Weak (‘press’) or no (‘touch’) correlation in ASD.
				4. Differences in gap and overlap conditions: No difference in ‘press’ condition. Significant gap effect in TD but not ASD in ‘touch’ condition.
Jung et al. (2006) Specific measure:Visuomotor coordinationCountry: South Korea	1. ASD group^a M_age (SD) = 72 (0) months, N = 12, %male = 83.332. TD groupAge range: 60–72 months, N = 20, %male = NA, M_age (SD) = NA	VR setup: Pentium IV PC, one projector, one screen (200 X150 cm), one infrared reflector, one digital camera and tangible devices (stick)	Task description: Burst virtual balloons in a VR setup. Auditory and visual reinforcements provided when successful. Number of balloons and the type of reinforcement changed by level.Child response: Burst virtual balloons by moving a real stick.#trials: 10 sessionsSetting: LaboratoryDuration: Not specified	1. Accuracy: No difference (may be due to high variability in ASD group).2. Reaction time: Significant difference (ASD > TD)3. Movement of stick to pop balloons: Significant difference (ASD < TD)4. PCA index: Significant difference indicating TD more efficient in popping balloons using combination of three variables.
Alcañiz Raya et al. (2020) Specific measure:Gross-motor movementsCountry: Spain	1. ASD group^a M_age (SD) = 61.56 (16.2) months, N = 24, %male = 87.52. TD groupM_age (SD) = 58.32 (10.92) months, N = 25, %male = 64	RGB-D camera (includes depth information in video recordings): RealSense – camera D435 (FRAMOS, Munich, Germany) and Intel RealSense SDK 2.0 (Intel RealSenseTechnology, Santa Clara, CA, USA)VR setup: CAVE-Automatic Virtual Environment (CAVETM): semi-immersive room with 3–6 rear-projected surfaces	Task description: VR setup showing street intersection with three types of stimuli conditions: visual (V): avatar walks into the scene and waves to participant; visual–auditory (VA): avatar dances to a song for 10 s; visual–auditory–olfactory (VAO): two avatars bite into a buttered muffin, while participant is able to smell, see and hear actions.Child response: Imitate avatars while a camera recorded movements. OpenPose algorithm used to detect body joints in each frame. Joint displacement computed across two consecutive frames and then averaged across a condition.#trials: 9 (3 per condition)Setting: LaboratoryDuration: 14 min	1. Average movement of each joint in each condition: Significant difference (ASD > TD) in all three conditions.Condition V: leg, head and trunkCondition VA: leg and headCondition VAO: headML models used to discriminate between TD and ASD groups.2. ASD classification accuracy:Head metrics performed the best in all conditions. Highest accuracies observed when using head metrics in VAO condition and all joints in V condition (89.36%). Least accuracy (70.2%) in model including all joints and all conditions.
Cognitive (not covered in DSM-5 criteria)
Chen et al. (2019)Specific measure:Executive functioningCountry: China	1. ASD group^a M_age (SD) = 54 (11) months, N = 40, %male = NA2. TD groupM_age (SD) = 55 (7) months, N = 51, %male = NA. Groups were gender-matched.	Tablet computer (PC or PAD)	Task description: Series of gamified tasks presented on a screen. Included joint attention tasks, responding to social requests, matching shapes, categorisation, visual search and visuomotor coordination.Child responses: Tap on target objects on the screen during gameplay. Responses stored on the tablet. One point awarded for each correct answer.#trials: 1Setting: Therapeutic centre and kindergartenDuration: 15–20 min	1. Completion rate (proportion of participants completing game): Significant difference (ASD < TD for children > 48 months)2. Efficiency (ratio of average score to average completion time): Significant difference (ASD < TD; visual search and visuomotor coordination in ⩽ 48 months only; Categorisation and matching shapes in > 48 months)
Hetzroni et al. (2019) Specific measure:Relational (abstract) thinkingCountry: Israel	1. ASD group^a M_age (SD) = 78.84 (10.44) months, N = 24, %male = 752. TD groupM_age (SD) = 67.56 (3.24) months, N = 24, %male = 41.663. IDD groupM_age (SD) = 142.56 (27.48) months, N = 24, %male = 62.5	Portable computer (specifications not provided)	Task description: Pictures of animals presented in various relational configurations (example – two koalas as mirror images). In a subset of trials, another panel with the same configuration, but with a different animal, was shown to strengthen concept. Influence of level of familiarity determined using local (known), foreign (partially known) and made-up (unknown) animal pictures.Child response: Tap on one of the two options that match the configuration of the target panel. Correct responses awarded one point.#trials: 8Setting: quiet roomDuration: 30–40 min	1. Accuracy (proportion of correct choices): Significant difference (IDD < ASD < TD in single panel condition). No benefit of second panel (to strengthen concept) to ASD group.2. Influence of familiarity: Performance unaffected by familiarity in ASD group. In the TD and IDD groups, better performance with known target images.
Veenstra et al. (2012) Specific measure:Executive functioningCountry: The Netherlands	1. ASD group^a M_age (SD) = 61.2 (9.6) months, N = 13, %male = NA2. TD groupM_age (SD) = 45.6 (1.68) months, N = 5, %male = NA	Computer screen (specifications not provided)	Task description: Web-based gamified Go/No-go task (www.samenslim.nl).Child response: Through mouse clicks during gameplay.#trials: 2–3 sessions, max of 7 games/session.Setting: quiet room in a medical day-care centre or playgroupDuration: Not specified	Significant differences observed in all metrics:1. Accuracy: ASD < TD2: No-go (Number of clicks when no clicks should have been made): ASD < TD3. Missing go (No clicks during clicking moments): ASD > TD4. Go (Number of clicks during clicking moments): ASD < TD5. Reaction time: ASD > TD6. Repeated clicks (Repeated clicks on the same objects): ASD > TD7. Variability across levels: ASD < TD
Gardiner et al. (2017) Specific measure:Executive functioningCountry: Canada	1. ASD groupM_age (SD) = 66.88 (13.41) months, N = 24, %male = 83.332. TD groupM_age (SD) = 58.47 (15.87) months, N = 19, %male = 57.89	Touchscreen monitor (specification not provided)	Task description: Computerised tasks assessing executive functioning:1. spatial working memory (Boxes task)2. cognitive flexibility, inhibition and working memory (Go/No-Go, Preschool-Continuous Performance Test (PCPT))3. Multicomponent planning task (Monkey Tower) – adaptation of Tower of Hanoi taskChild response: Tap on screen during gameplay#trials: 1Setting: LaboratoryDuration: 90–120 min	1. Accuracy on Boxes, Go/No-Go, PCPT: No difference2. Number of correct trials in multicomponent planning task: Approaching significance (ASD < TD; p = 0.036).

ASD: autism spectrum disorder; TD: typically developing; AUC: area under the ROC curve; ROI: region of interest; RT: reaction time; DD: developmental delay; ROC: Receiver Operating Characteristic; FB: false belief understanding; PI: performance index (scaled); RTI: reaction time index (scaled); PCA: Principal Component Analysis; IDD: Intellectual and Developmental Disabilities.

The following colour coding has been used in the column named ‘Device specifications’ to indicate the feasibility of the device for use in low-resource settings: Green = most feasible; Red = least feasible.

Colour coding

Figure added to Table 1 legend: Colour coding to indicate feasibility of administering the identified digital tools in low-resource settings.

The time taken to complete the assessment, when specified in the article, is included in the column titled ‘Experimental setup’ in red font colour, along with the number of trials of the assessment and the experimental setting (laboratory, clinic, school, home, etc).

To maintain consistency, age when reported in years was converted to months by multiplying by 12.

Risk of bias

A list of questions was compiled from two risk of bias assessment tools – Joanna Briggs Institute Critical Appraisal tools: Checklist for Case–Control Studies (Joanna Briggs Institute, 2017) and the QualSyst tool: Checklist for assessing the quality of quantitative studies from the Alberta Heritage Foundation for Medical Research Health Technology Assessment Initiative Series (Kmet et al., 2004). Some questions from the compiled list were adapted; the final set of questions used is listed in Supplementary Table 3.

Community involvement statement

As the reported study is a review a posteriori of the reported research, there was no community involvement.

Results

Study selection

A total of 51,953 titles and 6884 abstracts were screened for relevance across the two phases (Figure 1). However, 567 full-text articles were screened for eligibility, of which 38 met inclusion criteria. The most common reason for exclusion was the age criterion (mean age > 8 years; n = 193). Other reasons for exclusions were the following: the primary focus of the article was feasibility testing, protocol description or delivery of an intervention (n = 114); absence of a TD comparison group (n = 51); sample size of less than five in at least one group (n = 40); task not administered on digital devices or child responses being coded manually (n = 35) (Figure 1).

Figure 1.

Study selection flow diagram.

Description of the study participants

Together, these studies analysed results from 889 ASD participants, 1348 TD participants and 32 participants with a neurodevelopmental disorder other than ASD (intellectual disability (N = 24), developmental delay or language delay (N = 8)). The proportion of males in the ASD group (79.3%) exceeded that in the TD (60.4%) or NDD (62.3%) comparison groups. The mean age of the ASD group was also higher (65.3 months) compared to the TD group (60.9 months). Among 31 studies reporting the mean age, the TD comparison group was younger than the ASD group in the majority of studies (21/31; 67.7%), to allow the groups to be developmentally age-matched (Table 1 – Participant details and Figure 2). In all but three papers (Bovery et al., 2021; Gale et al., 2019; Ruta et al., 2017), the participants were > 36 months of age, indicative of most tools’ applicability beyond infancy and toddlerhood (Figure 2(a)). Gamified tasks, tasks presented on virtual-reality (VR) platforms and those assessing speech and language, were applied to children with mean age > 5 years. Tasks involving video recording of children’s behaviours while they interacted with stimuli presented on a screen or with the experimenter were typically applied to children with mean age below 5 years (Figure 2(b)). It is to be noted that developmental age equivalents and individual skill profiles and preferences are more significant than chronological ages in determining applicability.

Figure 2.

Age distribution of participants across studies and types of tasks applied to them. (a) Mean (dot) and standard deviation (error bar) of the chronological age (CA) of participants in months is represented on the X-axis (red = ASD; blue = TD; green = neurodevelopmental disorders not including ASD). Y-axis lists the included studies (in the order presented in Table 1). Most studies used CA-matched samples, except for Hetzroni et al. (2019) in which the other NDD group was significantly older as a result of being developmentally age-matched to the ASD and TD groups. Some studies reported the range of one or more participant groups instead of the mean and SD. In those cases, the range was represented as a horizontal line (e.g. Zhao et al., 2020; 10th row) using the same colour scheme. (b) Box plot demonstrating the age group to which different types of tasks were applied. The vertical line within the box represents the median age of participants to which the tasks were administered. The whiskers represent the 25th and 75th percentiles of participant age, respectively.

Overview of scalable digital tools for early assessment of ASD risk

This is an emerging field, 28 of the 38 included articles (73.6%) having been published in the past 5 years (Supplementary Figure 2). The tools identified were predominantly at the pilot or ‘proof-of-concept’ stage and typically administered in laboratory or clinic settings by research staff. Studies demonstrated high levels of heterogeneity across tasks used to assess diagnostic discriminative ability, the type of technology used to implement them, primary metrics evaluated and developmental domains assessed. Tasks were presented on portable technologies, such as laptops (H. Li & Leung, 2020; Lu et al., 2019), tablet computers (Anzulewicz et al., 2016; Bovery et al., 2021; Campbell et al., 2019; Carlsson et al., 2018; Carpenter et al., 2021; Chen et al., 2019; Chetcuti et al., 2019; Dawson et al., 2018; Fleury et al., 2013; Gale et al., 2019; Jones et al., 2018; Mahmoudi-Nejad et al., 2017; Ruta et al., 2017), smartphones (Mahmoudi-Nejad et al., 2017; Rafique et al., 2019; Zhao & Lu, 2020), intelligent toys (Moradi et al., 2017) and digital audio recorders (Nakai et al., 2014; Wijesinghe et al., 2019), and non-portable technologies, such as desktop computers (Aresti-Bartolome et al., 2015; Borsos & Gyori, 2017; Chaminade et al., 2015; Crippa et al., 2013; Deschamps et al., 2014; Dowd et al., 2012; Gardiner et al., 2017; Gyori et al., 2018; Hetzroni et al., 2019; J. Li et al., 2020; P. Li et al., 2016; Lin et al., 2013; Martin et al., 2018; Veenstra et al., 2012) and VR platforms of varying sophistication (Jung et al., 2006; Jyoti & Lahiri, 2020; Alcañiz Raya et al., 2020; Shahab et al., 2017).

In total, 21 studies (55.3%) used gamified tasks (Anzulewicz et al., 2016; Aresti-Bartolome et al., 2015; Carlsson et al., 2018; Chaminade et al., 2015; Chen et al., 2019; Chetcuti et al., 2019; Crippa et al., 2013; Deschamps et al., 2014; Dowd et al., 2012; Fleury et al., 2013; Gale et al., 2019; Gardiner et al., 2017; Hetzroni et al., 2019; Jones et al., 2018; P. Li et al., 2016; H. Li & Leung, 2020; Lu et al., 2019; Mahmoudi-Nejad et al., 2017; Rafique et al., 2019; Ruta et al., 2017; Veenstra et al., 2012), making these the most common type of performance-based tasks to detect autism risk in early childhood. Other types of assessments included video recording of children’s behaviours while they viewed or interacted with stimuli presented on a screen (n = 9; 23.7%) (Borsos & Gyori, 2017; Bovery et al., 2021; Campbell et al., 2019; Carpenter et al., 2021; Dawson et al., 2018; Gyori et al., 2018; J. Li et al., 2020; Martin et al., 2018; Zhao & Lu, 2020), tasks using VR platforms (n = 4; 10.5%) (Jung et al., 2006; Jyoti & Lahiri, 2020; Alcañiz Raya et al., 2020; Shahab et al., 2017) and audio recording of children’s speech (n = 2; 5.2%) (Nakai et al., 2014; Wijesinghe et al., 2019). One study used a toy car with an embedded accelerometer to record the child’s movement characteristics while they played with the toy (Moradi et al., 2017).

Together, these technologies targeted both criteria set within the DSM-5 for ASDs (Table 1). Several technologies also assessed neurodevelopmental domains not included within the DSM-5 criteria, but known to be affected in many children with ASD. Examples include deficits in motor and cognitive abilities (Figure 3). Two papers included a non-ASD NDD comparison group in the study design (Carpenter et al., 2021; Hetzroni et al., 2019); both demonstrated specificity to ASD symptoms. Eight studies (21.1%) used machine learning (ML) to identify nonlinear combinations of metrics as discriminants (Bovery et al., 2021; Campbell et al., 2019; Carpenter et al., 2021; Dawson et al., 2018; J. Li et al., 2020; Martin et al., 2018; Alcañiz Raya et al., 2020; Zhao & Lu, 2020).

Figure 3.

DSM-5 criteria and developmental area(s) assessed by scalable digital tools.

The majority of studies were conducted in high-income countries (26/38); however, three recent studies from India (Jyoti & Lahiri, 2020), Pakistan (Rafique et al., 2019) and Sri Lanka (Wijesinghe et al., 2019) and three from Iran (Shahab et al., 2017; Moradi et al., 2017; and Mahmoudi-Nejad et al., 2017), all low- and middle-income countries (LMICs), represent an encouraging trend for global mental health research.

Assessment of the risk of bias

More than 90% of the included articles (34 of 38) clearly described the aims and objectives, details of implementation and used valid statistical methods to report their results (Figure 4). They also reported the reasons for the loss of participants when applicable, and the results supported the conclusions. However, participant demographic details were only reported by 3/38 (7.9%) papers (Supplementary Table 2: Additional participant details), which omission precluded the determination of adequate matching of participant characteristics across groups in a case–control study design. However, 12/38 (31.6%) papers did not describe the study setting or population from which the TD group was recruited. Groups were adequately matched on gender and developmental age only in 9/38 (23.7%) studies. Gender distribution across groups was largely mismatched, with the ASD group typically having a greater proportion of males compared to the TD group (Table 1). In total, 21/38 studies did not clearly describe the inclusion/exclusion criteria for participant recruitment across both the groups. Whereas, 17/38 articles used standardised diagnostic or screening tools to select participants in the ASD group. Meanwhile, 36/38 papers did not include an NDD group without ASD to demonstrate the specificity of the tasks and metrics to ASD symptoms. Finally, while all the included studies were at the proof-of-concept stage, limitations related to small sample sizes, lack of generalisability and inadequate matching of samples were described only in 20/38 (52.6%) studies (Figure 4). Of note, none of the included articles explicitly reported on any measure related to reliability (intra- and inter-assessor reliability, test–retest reliability) or validity (face, construct, content and criterion). While greater understanding of each tool’s validity and reliability is important for their ultimate use, this state of development is to be expected for an emerging field, as the main focus of these initial studies is to demonstrate feasibility and explore the discriminative ability of these tools.

Figure 4.

Risk of bias of included studies.

Characteristics of digital ASD assessment tools

Detailed characteristics of individual tasks, details of implementation and their discriminative ability as reported by the studies are presented in Table 1. Detailed description and comparisons of tasks used to address the primary research questions are presented in the Supplementary material. Based on the evidence, the potential of these tools to screen for autism risk in low-resource settings is discussed below.

Tasks using portable technologies (tablet computers, smartphones, toy cars and digital audio recorders)

All tasks using mobile technology could be completed in 8 min on average (range = 1–20 min). Except for tasks assessing accuracy on executive functioning skills (Chen et al., 2019; Jones et al., 2018), all other tasks and metrics could discriminate between ASD and TD at a group level (details in Supplementary material). Tasks tapping the social and motor domains were particularly reliable, as discriminative ability was demonstrated by a total of 13 studies led by different study groups using a variety of tasks, metrics and devices. This included seven tasks tapping the social domain (Bovery et al., 2021; Campbell et al., 2019; Carlsson et al., 2018; Gale et al., 2019; H. Li & Leung, 2020; Lu et al., 2019; Ruta et al., 2017) encompassing social versus non-social stimulus preference and theory-of-mind, and six tapping fine- and gross-motor domains (Anzulewicz et al., 2016; Chetcuti et al., 2019; Dawson et al., 2018; Fleury et al., 2013; Mahmoudi-Nejad et al., 2017; Rafique et al., 2019). Also within the social domain, two studies assessed group differences in facial expressions in two ways – evoked expressions while watching animated videos (Carpenter et al., 2021) versus imitating the facial expressions presented on the screen (Zhao & Lu, 2020). Therefore, while similar data capture and analysis methods were used and significant group differences were reported by both, a direct comparison of the tasks and metrics for this particular construct was not possible. One study each used a toy car and digital audio recorder to assess autism risk. The former was moderately successful while the latter failed – however, more replications of these tasks are required to determine their utility.

Tasks using non-portable technology (desktop computers and VR platforms)

Tasks presented on desktop computers were highly heterogeneous in terms of the ASD phenotype assessed, making it difficult to synthesise results across studies. Except for three studies assessing EF, all others assessed a unique skill using different tasks and metrics. Consistent with tasks presented on mobile devices, accuracy on EF tasks showed mixed results in the desktop technology format as well. Results related to reaction time were consistent, with the ASD group reported to be slower in providing responses in EF tasks. Most tasks tapping the social (Aresti-Bartolome et al., 2015; Chaminade et al., 2015; Martin et al., 2018) and motor (Crippa et al., 2013; Dowd et al., 2012) domains continued to demonstrate significant group differences. The two studies assessing speech and language (Lin et al., 2013; Nakai et al., 2014) used very different metrics to assess group differences (pitch characteristics vs accuracy), so no comparison was possible. Similarly, the results from facial expression analysis using the Noldus FaceReader (Borsos & Gyori, 2017; Gyori et al., 2018) were too preliminary to determine their utility for use as autism screening measures.

The VR format of ASD risk assessment, although showing promising results, depended on sophisticated devices and administration in laboratory settings by trained research staff. However, the discriminative ability of these tasks tapping joint attention, motor imitation and visuomotor coordination continues to highlight the promise of the social and motor domains to identify autism risk. Tasks using desktop technology were completed in 23 min on average, about thrice as long as those on portable devices (8 min). VR tasks took 14.6 min on average (see Supplementary material).

Discussion

Tasks can be brief, portable and largely automated

This study identifies and characterises digital tools that have the potential to be applied in direct assessment of autism risk in early childhood in low-resource settings. Because the availability of skilled human resources is a major limitation in these settings (Divan et al., 2021), we focused on tools that require minimum assessor judgement during administration, and whose data analysis could be automated with no to minimal manual inputs. Two main modalities of direct child assessments were identified – gamified tasks and video or audio recordings of the participant while they viewed/responded to stimuli on the screen or VR platforms, or interacted with research staff or family members. Tasks were presented on both portable and non-portable technologies, namely laptops, tablet computers and smartphones on the one hand, and desktop computers and VR platforms on the other. However, some tasks presented on non-portable technologies but requiring child responses on touchscreens, or in which children’s videos were captured using webcams, are easily adaptable to portable devices (Jyoti & Lahiri, 2022). The majority of the assessments were administered in laboratory or clinic settings, but some were also deployed in homes, schools and daycares. While trained research staff administered these tasks in all studies, they typically provided only simple instructions and demonstrations before participants became able to engage with the tasks independently – extending these tasks’ promise in the hands of non-specialists. Finally, tasks delivered on portable technologies could be completed in less than 10 min on average, and most others within 30 min. Therefore, once validated, the types of tasks identified and their potential to be delivered on low-cost devices by non-specialists pave a promising path for ASD risk detection in low-resource settings, which bear the largest burden of cases worldwide (Baxter et al., 2015). Six recent studies’ being based in LMICs is an encouraging trend towards this direction.

Social and motor skills discriminate best

While specific tasks and metrics targeted multiple developmental domains, those tapping social and motor domains were the most promising. Most studies assessing these domains reported significant group differences using a variety of technologies, tasks and metrics. Lower preference for social stimuli emerged as one of the most reliable metrics. The ASD group consistently preferred non-social stimuli irrespective of the format in which these were presented, be it static images or videos presented on tablet or desktop screens. This general result of aversion to social stimuli aligns with the literature, including decades of eye-tracking literature demonstrating reduced time spent looking at social stimuli in the ASD group (Papagiannopoulou et al., 2014). Individual studies reported unique tasks and metrics relevant to the social domain that successfully discriminated between the ASD and TD groups; examples include anthropomorphic bias and the applied theory-of-mind ability to deceive and to distrust opponents. The ASD group was also found to take longer to orient to the person calling their name, or to initiate social interactions. These are examples of digital tasks tapping literature-backed autism-relevant phenotypes that historically have been assessed by in-person interactions with trained staff (Baron-Cohen et al., 1985; Bruinsma et al., 2004; Federici et al., 2020; Zwaigenbaum et al., 2005). Their success in digital and gamified formats is encouraging for their potential to scale as screening tools in low-resource settings.

Similarly, a variety of tasks tapping the motor domain found consistent differences between the ASD and TD groups. Tasks assessing fine-motor abilities, which were largely administered on portable technology including tablets and smartphones, found differences in the pattern of interactions with smart devices. Kinematic analyses contrasting autistic versus non-autistic touchscreen movements in gamified tasks have demonstrated greater variance in speed and direction with longer, less straight movement paths, amid a less fluid, more piecewise movement style (Weisblatt et al., 2019) and greater spatiotemporal error across a range of tasks (Dubey et al., 2021). A challenge in quantifying visuomotor behaviour is the choice of specific derived metrics from a large number of possible ones, for example, acceleration, jerk, direction, variances and maxima thereof, duration and extent of movement. An important element of the future research agenda will be to combine motor and developmental literatures so as to give the autism field a standard set of parameters with which movements can be described and compared across studies. Discriminative results reviewed here focus thematically on inter-trial variability in response duration in motor planning tasks, accuracy in imitation of complex motor gestures, and visuomotor coordination, but the devil is in the details of specific derived measures.

However, studies assessing gross motor movements were typically based on video recordings of child behaviour while viewing on-screen stimuli or imitating avatars presented on VR platforms. The most consistent result was differences in head movements, indicative of poor postural head control in the ASD group. These results are consistent with the literature on autism-related motor phenotypes, for example, stride length variability (Kindregan et al., 2015), and differences across a range of behaviours including arm movements, gait (Lum et al., 2021), postural stability and oculomotor coordination (Fournier et al., 2010; Johnson et al., 2016), all of which are suggestive of an overall deficit in motor planning (Rinehart et al., 2006) and coordination. Mechanisms proposed for these observed autistic deficits are an inability to chain together sequential motor events (Cattaneo et al., 2007; von Hofsten & Rosander, 2012) and difficulty incorporating visual error feedback in an online movement process (Haswell et al., 2009).

Another consistent discriminating metric was slower reaction times in the ASD group, measured as the latency to respond selectively to pre-specified target objects. This slowing manifested in a range of tasks including tablet-based gamified tasks assessing executive functioning, VR-based tasks of joint attention and visuomotor coordination, and video recording of child behaviour to assess latency in orienting to name. The validity of this metric is supported by the literature demonstrating slower reaction times in older children (Herrero et al., 2015) and adults (Lartseva et al., 2014; Schmitz et al., 2007; van den Boomen et al., 2019) with ASD using computerised simple and choice reaction tasks. Although pathologically slowed reaction time can flag developmental issues in general, by itself it would be too blunt an instrument to discriminate autism from other neurodevelopmental conditions. Similarly, task completion and task engagement, the latter defined variably as number of frames in which an ML algorithm was able to compute relevant metrics from children’s facial landmarks, or the duration for which the child played the game, were found to be a useful discriminating metric in six different studies, but cannot point to autism in particular.

Accounting for heterogeneities, sex differences, ages and stages, and available resources

Arriving at a set of screening tasks that can discriminate autism in all its forms and presentations is not as simple as deciding which tasks work. One needs to know, rather, which tasks work for which subtypes of this heterogeneous condition, at what ages and developmental stages, and in which real-world circumstances dependent on culture and context. Given the early stage of development, most studies are limited in their generalisability to broader samples and contexts. The absence of demographic details in the majority of papers not only precludes determining the comparability between the ASD and TD groups within and across studies but also obscures our understanding of the heterogeneity of the study samples and the subpopulations for whom these tools are applicable. For instance, motor tasks might be especially predictive in some individuals who are not beginning to speak on time (Belmonte et al., 2013), and conversely, assays of social responsiveness might be more predictive in others. Furthermore, while gender specificities in ASD prevalence and symptoms exist (Halladay et al., 2015), gender differences were not addressed in any of the studies. Therefore, further confirmatory studies are required to test the validity, reliability and specificity of these tools before they can be deployed as screening measures. One group has already started planning Phase 3 trials using large samples in different contexts (Millar et al., 2019), which is an important stride in the right direction. The US Food and Drug Administration (FDA) recently authorised marketing of the Cognoa ASD Diagnosis Aid, software using ML algorithms to help predict the risk of autism based on parent reports, videos of child behaviour and health provider inputs, as an adjunct though not a substitute to the regular diagnostic process. A similar open-source effort has shown > 78% accuracy in discriminating between autism, intellectual disability and typical development in 2- to 7-year-old toddlers in low-resource settings (Dubey et al., 2021).

These studies demonstrate great potential pending further validation. All social and motor tasks met our minimum criteria for use in low-resource settings – (1) independent of assessor judgement and hence with the potential to be easily administered by non-specialist providers and (2) capturing data in digital format that could be objectively analysed. Furthermore, they could be completed in less than 20 min, largely on easily accessible and affordable devices. The potential for portability of some of the tasks administered on non-portable technology (desktops, VR) is high since most tasks were in a format that could be adapted to portable technologies (e.g. Jyoti & Lahiri, 2022). With respect to the type of tasks, computer vision analysis of child behaviour is more versatile as it is applicable to the full spectrum of children of varying ages and abilities. Gamified tasks, however, the majority identified by our search, are only suitable for older or less impaired children who can understand and follow game instructions.

Advantages of digital tools for ASD risk assessment and implications for global health research

These novel tools are uncovering nuanced differences in child behaviour using objective and automated measures. Examples include inter-trial variability of a few seconds during motor planning, and differences in force, pressure and patterns of tap-and-drag gestures in tablet-based gamified tasks. The Research Domain Criteria framework theorises that early deviations from normative developmental trends may be predictive of later disorders (Cuthbert, 2014, 2020), including autism (Tunç et al., 2019). Digital tools provide the most feasible method to develop global normative trends of ASD-relevant phenotypes, which could be used to flag children with differential trajectories. Their potential to be administered by non-specialists make them amenable to task-sharing and stepped-care approaches, paving the way for large-scale ASD screening across diverse locations including low-resource settings. Given the current state of development of the tools, magnitude of demand and heterogeneity within the ASD population, digital tools can potentially aid in the initial screening of autism risk at scale. However, a confirmatory diagnosis should only be made by a clinician.

Notwithstanding their potential, it is important to recognise that a ‘digital divide’ currently exists between high- and low-resourced settings, the developmental benefits of technological advances disproportionately accruing to educated and resource-rich communities that have access to smart devices, adequate power supplies and the know-how of Internet services (World Bank, 2016). Therefore, the feasibility of using various types of digital platforms for ASD risk assessment must be carefully considered with respect to the specific LMIC context and setting in which they will be used. Portable computers and mobile devices provide the highest levels of accessibility, affordability and potential for scale across all settings of the global South. Therefore, tools that are adapted for delivery on these platforms would be most feasible to use across all settings and ensure that the technological advances do not inadvertently increase the digital divide in the global ASD community (Kumm et al., 2022).

Methodological considerations to advance the field

Based on the key findings and limitations discussed above, we provide the following directions for future studies. Tasks tapping the social and motor domains that are available in gamified formats or that estimate metrics using computer vision analysis, and provide objective measures analyzable using standard and machine learning methods, should be prioritised for further development. The main goal should be to further validate these tasks and metrics using prospective cross-sectional study designs and larger samples, and to standardise derived measures across tasks. Studies should focus on establishing reliability (intra- and inter-assessor reliability, test–retest reliability) and validity (face, construct, content and criterion) of these tools and report these metrics in future publications. Larger samples would allow assessing heterogeneity using dimensional measures (Stevens et al., 2019) and refining ML algorithms when applicable. Once a tool or task is validated for a single population, its acceptability, feasibility and validity should be assessed in diverse settings and geographies, checking the consistency of psychometric properties across contexts. Once metrics describing reliability, validity, sensitivity and specificity are available for these tools, we can begin to make judgements about their use as screening or diagnostic tools.

For this reason of cultural context, especially, stakeholder families and community members must be involved in the design and execution of such research (Staniszewska et al., 2018), ideally with representation on the research team itself. Stakeholder and community involvement heightens recruitment and retention in general (Crocker et al., 2018), and in autism studies in particular (McKinney et al., 2021), makes tools globally relevant, especially to low-resource and underserved communities (Witham et al., 2020) and must be an integral part of the research design process from the stage of conceptualisation, rather than left as an afterthought. None of the studies reviewed has reported a strategy for stakeholder and community involvement. This must change.

The social and motor tasks currently being administered on desktop computers and VR platforms should be translated for delivery on portable devices to improve access, and then subjected to feasibility and validation testing in different settings riding the wave of widespread mHealth technology use worldwide (Abaza & Marschollek, 2017; Osei & Mashamba-Thompson, 2021). Since research is linked to local capacity building (Durkin et al., 2015), greater testing of these tools in diverse low-resource settings will also generate the necessary awareness and skills, in the community and in researchers, to build momentum towards universal screening of children’s development. Finally, longitudinal studies should be designed to evaluate the developmental trajectories using metrics of the most promising tools and to develop a deeper understanding of the normative trends of ASD-relevant phenotypes.

We identified a few standalone studies that assessed ASD-related behaviours using very specific and unique tasks that could significantly discriminate between groups. Examples include studies assessing evoked and imitated facial expressions, and speech and language. We recommend replication of these studies by different groups in different populations to further test their validity and reliability.

Box 1.

Considerations for future studies evaluating digital tools for ASD risk assessments.

1. Specificity to developmental delay in general, not to autism in particular: Although the addition of other NDD comparison groups can demonstrate digital assessment tools’ specificity to ASD symptoms, at the stage of community screening and referral, it may be impractical to focus specifically on autism risk alone: a child who is at risk of a developmental delay but not autistic still needs a referral.2. Participant characteristics: Future studies should report participants’ demographic characteristics in detail. Studies would also benefit from using standardised measures to rule out ASD symptoms in the TD group, especially as we move towards more dimensional and nuanced measures proposed to characterise heterogeneity within groups. Mental age-matched TD comparison groups should be included in studies involving ASD participants with comorbid ID.3. Choice of device: While Android devices could be prioritised since they are cheaper and more widely available in LMICs, care must be taken to ensure that the sensitivity and accuracy of the device-derived metric is appropriate for the task delivered on the device, and that the delivery of stimuli and collection of responses will be robust to the fast pace of hardware development and marketing.4. Individual risk measures: Finally, while group differences are adequate to evaluate the potential of these novel technologies, the analytical methods should be refined to allow quantifying individual risk. Bayesian classification and ML methods have been employed by a few studies reviewed. Discriminative features from multiple developmental domains could serve as individual features in an ML algorithm designed to predict autism risk (Dawson & Sapiro, 2019; Jin et al., 2015; Kang et al., 2020; Liu et al., 2016). This multidimensional strategy would be akin to the standard practice of observing multiple behaviours for ASD diagnosis. Therefore, while any one of the features may not be enough to capture the full heterogeneity of the spectrum, the combination as determined through an ML approach may achieve higher degrees of sensitivity or specificity.5. Ecological validity: Taking advantage of rapid advancements in computer vision, future assessments should focus on computing social and motor metrics relevant to the autism phenotype from brief, automated tasks portable into homes, schools or other ecologically valid settings. Some examples include reciprocal social interactions, repetitive behaviours and sensory sensitivity during regular interactions of the child with their peers, teachers and parents in home or school settings or during solitary play, captured using cameras on tablet computers or smartphones.6. Patient and public involvement: Stakeholders and community members (e.g. community health workers) must be involved in planning and execution of the research, from the beginning.

ASD: autism spectrum disorder; TD: typically developing; NDD: neurodevelopmental disorders; ID: Intellectual Disability.

Limitations

As this review was limited to case–control studies, digital tools piloted or validated using other types of study designs will have been missed. Second, since we only included peer-reviewed published articles in English, emerging technologies that may have been presented in conferences or in other languages are not included. Third, the search was last updated in October 2020. This review does not cover new tools and new data (including some of our own) published beyond this date, posing a limitation in view of the rapid pace at which new technologies are introduced and evaluated in this dynamic area of research.

Conclusion

This review identifies and characterises digital tools for direct observational assessment of autism risk in early childhood that have the potential to scale in low-resource settings. This characterisation encompasses tasks and their associated metrics, developmental domains assessed, discriminative ability and details of implementation. Tasks assessing social and motor domains were found to be particularly promising and reliable in discriminating between ASD and TD groups. Their implementation on readily accessible technologies – half of them on portable devices, such as tablet computers and smartphones – coupled with objective output measures make them suitable for task-sharing with non-specialist providers. Novel methods, such as computer vision and ML, are increasingly being coupled with these tasks to allow for objective and automated analysis of data, leading to more in-depth and nuanced understanding of ASD symptoms and furthering their potential for task-sharing approaches and identification of autism risk at the individual level. The time is ripe for the field to move beyond pilot studies and small samples to large-scale, multinational validation studies, using prospective cross-sectional or longitudinal designs conceived, developed and implemented in collaboration with stakeholders and communities.

Supplemental Material

sj-pdf-1-aut-10.1177_13623613221133176 – Supplemental material for Digital tools for direct assessment of autism risk during early childhood: A systematic review

Supplemental material, sj-pdf-1-aut-10.1177_13623613221133176 for Digital tools for direct assessment of autism risk during early childhood: A systematic review by Debarati Mukherjee, Supriya Bhavnani, Georgia Lockwood Estrin, Vaisnavi Rao, Jayashree Dasgupta, Hiba Irfan, Bhismadev Chakrabarti, Vikram Patel and Matthew K Belmonte in Autism

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship and/or publication of this article: D.M. was supported by the Department of Science and Technology-INSPIRE Faculty Award (2016/DST/INSPIRE/04-I/2016/000001). G.L.E. was supported by a Sir Henry Wellcome Fellowship (Wellcome Trust Grant No. 204706/Z/16/Z). The authors acknowledge funding from the Medical Research Council UK (STREAM, MR/S036423/1 awarded to B.C.).

Ethical approval

No ethical approval was required for the conduct of this study.

ORCID iDs

Georgia Lockwood Estrin

Bhismadev Chakrabarti

Matthew K Belmonte

Data availability statement

All relevant data used to prepare this review article are reported in this manuscript as tables and figures.

Supplemental material

Supplemental material for this article is available online.

References

Abaza

Marschollek

(2017). mHealth application areas and technology combinations: A comparison of literature from high and low/middle income countries. Methods of Information in Medicine, 56(S01), e105–e122. https://doi.org/10.3414/ME17-05-0003

Alcañiz

Chicchi-Giglioli

I. A.

Carrasco-Ribelles

L. A.

Marín-Morales

Minissi

M. E.

Teruel-García

Sirera

Abad

(2022). Eye gaze as a biomarker in the recognition of autism spectrum disorder using virtual reality and machine learning: A proof of concept for diagnosis. Autism Research, 15(1), 131–145. https://doi.org/10.1002/aur.2636

Alcañiz Raya

Marín-Morales

Minissi

M. E.

Teruel Garcia

Abad

Chicchi Giglioli

I. A

. (2020). Machine learning and virtual reality on body movements’ behaviors to classify children with autism spectrum disorder. Journal of Clinical Medicine, 9(5), Article 1260. https://doi.org/10.3390/jcm9051260

American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders (5th ed.).

Anzulewicz

Sobota

Delafield-Butt

J. T.

(2016). Toward the autism motor signature: Gesture patterns during smart tablet gameplay identify children with autism. Scientific Reports, 6(1), Article 31107. https://doi.org/10.1038/srep31107

Aresti-Bartolome

Garcia-Zapirain

(2015). Cognitive rehabilitation system for children with autism spectrum disorder using serious games: A pilot study. Bio-Medical Materials and Engineering, 26(Suppl. 1), S811–S824. https://doi.org/10.3233/BME-151373

Baron-Cohen

Leslie

A. M.

Frith

(1985). Does the autistic child have a ‘theory of mind’ ? Cognition, 21(1), 37–46. https://doi.org/10.1016/0010-0277(85)90022-8

Baxter

A. J.

Brugha

T. S.

Erskine

H. E.

Scheurer

R. W.

Vos

Scott

J. G.

(2015). The epidemiology and global burden of autism spectrum disorders. Psychological Medicine, 45(3), 601–613. https://doi.org/10.1017/S003329171400172X

Belmonte

M. K.

Saxena-Chandhok

Cherian

Muneer

George

Karanth

(2013). Oral motor deficits in speech-impaired children with autism. Frontiers in Integrative Neuroscience, 7, Article 47. https://doi.org/10.3389/fnint.2013.00047

10.

Bhavnani

Lockwood Estrin

Arora

Kumar

Kakra

Vajaratkar

Juneja

Gulati

Patel

Green

Divan

(2022). ‘I was confused . . . and still am’ Barriers impacting the help-seeking pathway for an autism diagnosis in urban North India: A mixed methods study. Journal of Autism and Developmental Disorders, 52(4), 1778–1788. https://doi.org/10.1007/s10803-021-05047-z

11.

Black

M. M.

Walker

S. P.

Fernald

L. C. H.

Andersen

C. T.

Digirolamo

A. M.

Mccoy

D. C.

Fink

Shawar

Y. R.

Shiffman

Devercelli

A. E.

Wodon

Q. T.

Vargas-Barón

Grantham-Mcgregor

(2017). Early childhood development coming of age: Science through the life course. The Lancet, 389(10064), 77–90. https://doi.org/10.1016/S0140-6736(16)31389-7

12.

Borsos

Gyori

(2017). Can automated facial expression analysis show differences between autism and typical functioning? Studies in Health Technology and Informatics, 242, 797–804. https://doi.org/10.3233/978-1-61499-798-6-797

13.

Bovery

Dawson

Hashemi

Sapiro

(2021). A scalable off-the-shelf framework for measuring patterns of attention in young children and its application in autism spectrum disorder. IEEE Transactions on Affective Computing, 12(3), 722–731. https://doi.org/10.1109/taffc.2018.2890610

14.

Bruinsma

Koegel

R. L.

Koegel

L. K.

(2004). Joint attention and children with autism: A review of the literature. Mental Retardation and Developmental Disabilities Research Reviews, 10(3), 169–175. https://doi.org/10.1002/mrdd.20036

15.

Campbell

Carpenter

K. L.

Hashemi

Espinosa

Marsan

Borg

J. S.

Chang

Qiu

Vermeer

Adler

Tepper

Egger

H. L.

Baker

J. P.

Sapiro

Dawson

(2019). Computer vision analysis captures atypical attention in toddlers with autism. Autism, 23(3), 619–628. https://doi.org/10.1177/1362361318766247

16.

Carlsson

Miniscalco

Gillberg

Asberg Johnels

J. A.

(2018). Assessing false-belief understanding in children with autism using a computer application: A pilot study. Journal of Psycholinguistic Research, 47(5), 1085–1099. https://doi.org/10.1007/s10936-018-9579-2

17.

Carpenter

K. L. H.

Hahemi

Campbell

Lippmann

S. J.

Baker

J. P.

Egger

H. L.

Espinosa

Vermeer

Sapiro

Dawson

(2021). Digital behavioral phenotyping detects atypical pattern of facial expression in toddlers with autism. Autism Research, 14(3), 488–499. https://doi.org/10.1002/aur.2391

18.

Cattaneo

Fabbri-Destro

Boria

Pieraccini

Monti

Cossu

Rizzolatti

(2007). Impairment of actions chains in autism and its possible role in intention understanding. Proceedings of the National Academy of Sciences of the United States of America, 104(45), 17825–17830. https://doi.org/10.1073/pnas.0706273104

19.

Chaminade

Rosset

Da Fonseca

Hodgins

J. K.

Deruelle

(2015). Anthropomorphic bias found in typically developing children is not found in children with autistic spectrum disorder. Autism, 19(2), 248–251. https://doi.org/10.1177/1362361313512425

20.

Chen

Wang

Zhang

Wang

Liu

(2019). A pilot study on evaluating children with autism spectrum disorder using computer games. Computers in Human Behavior, 90, 204–214. https://doi.org/10.1016/j.chb.2018.08.057

21.

Chetcuti

Hudry

Grant

Vivanti

(2019). Object-directed imitation in autism spectrum disorder is differentially influenced by motoric task complexity, but not social contextual cues. Autism, 23(1), 199–211. https://doi.org/10.1177/1362361317734063

22.

Crippa

Forti

Perego

Molteni

(2013). Eye-hand coordination in children with high functioning autism and Asperger’s disorder using a gap-overlap paradigm. Journal of Autism and Developmental Disorders, 43(4), 841–850. https://doi.org/10.1007/s10803-012-1623-8

23.

Crocker

J. C.

Ricci-Cabello

Parker

Hirst

J. A.

Chant

Petit-Zeman

Evans

Rees

(2018). Impact of patient and public involvement on enrolment and retention in clinical trials: Systematic review and meta-analysis. British Medical Journal, 363, Article k4738. https://doi.org/10.1136/bmj.k4738

24.

Cuthbert

B. N.

(2014). The RDoC framework: Facilitating transition from ICD/DSM to dimensional approaches that integrate neuroscience and psychopathology. World Psychiatry, 13(1), 28–35. https://doi.org/10.1002/wps.20087

25.

Cuthbert

B. N.

(2020). The role of RDoC in future classification of mental disorders. Dialogues in Clinical Neuroscience, 22(1), 81–85. https://doi.org/10.31887/DCNS.2020.22.1/bcuthbert

26.

Dasgupta

Bhavnani

Estrin

G. L.

Mukherjee

Banerjee

Belmonte

M. K.

Chakrabarti

Divan

Dawson

Johnson

M. H.

McPartland

J. C.

Singh

N. C.

Patel

(2016). Translating neuroscience to the front lines: Point-of-care detection of neuropsychiatric disorders. The Lancet Psychiatry, 3(10), 915–917. https://doi.org/10.1016/S2215-0366(16)30186-9

27.

Dawson

Campbell

Hashemi

Lippmann

S. J.

Smith

Carpenter

Egger

Espinosa

Vermeer

Baker

Sapiro

(2018). Atypical postural control can be detected via computer vision analysis in toddlers with autism spectrum disorder. Scientific Reports, 8(1), Article 17008. https://doi.org/10.1038/s41598-018-35215-8

28.

Dawson

Sapiro

(2019). Potential for digital behavioral measurement tools to transform the detection and diagnosis of autism spectrum disorder. JAMA Pediatrics, 173(4), 305–306. https://doi.org/10.1001/jamapediatrics.2018.5269

29.

Deschamps

P. K. H.

Been

Matthys

(2014). Empathy and empathy induced prosocial behavior in 6- and 7-year-olds with autism spectrum disorder. Journal of Autism and Developmental Disorders, 44(7), 1749–1758. https://doi.org/10.1007/s10803-014-2048-3

30.

Divan

Bhavnani

Leadbitter

Ellis

Dasgupta

Abubakar

Elsabbagh

Hamdani

S. U.

Servili

Patel

Green

(2021). Annual Research Review: Achieving universal health coverage for young children with autism spectrum disorder in low- and middle-income countries: A review of reviews. Journal of Child Psychology and Psychiatry, 62(5), 514–535. https://doi.org/10.1111/jcpp.13404

31.

Doshi-Velez

Kohane

(2014). Comorbidity clusters in autism spectrum disorders: An electronic health record time-series analysis. Pediatrics, 133(1), e54–e63. https://doi.org/10.1542/peds.2013-0819

32.

Dowd

A. M.

McGinley

J. L.

Taffe

J. R.

Rinehart

N. J.

(2012). Do planning and visual integration difficulties underpin motor dysfunction in autism? A kinematic study of young children with autism. Journal of Autism and Developmental Disorders, 42(8), 1539–1548. https://doi.org/10.1007/s10803-011-1385-8

33.

Dubey

Bishain

Dasgupta

Bhavnani

Belmonte

M. K.

Gliga

Mukherjee

Estrin

G. L.

Johnson

M. H.

Chandran

Patel

Gulati

Divan

Chakrabarti

(2021). Using mobile health technology to assess childhood autism in low-resource community settings in India : An innovation to address the detection gap. https://doi.org/10.1101/2021.06.24.21259235

34.

Durkin

M. S.

Elsabbagh

Barbaro

Gladstone

Happe

Hoekstra

R. A.

Lee

L.-C. C.

Rattazzi

Stapel-Wax

Stone

W. L.

Tager-Flusberg

Thurm

Tomlinson

Shih

(2015). Autism screening and diagnosis in low resource settings: Challenges and opportunities to enhance research and services worldwide. Autism Research, 8(5), 473–476. https://doi.org/10.1002/aur.1575

35.

Estes

Munson

Rogers

S. J.

Greenson

Winter

Dawson

(2015). Long-term outcomes of early intervention in 6-year-old children with autism spectrum disorder. Journal of the American Academy of Child and Adolescent Psychiatry, 54(7), 580–587. https://doi.org/10.1016/j.jaac.2015.04.005

36.

Federici

Parma

Vicovaro

Radassao

Casartelli

Ronconi

(2020). Anomalous perception of biological motion in autism: A conceptual review and meta-analysis. Scientific Reports, 10(1), Article 4576. https://doi.org/10.1038/s41598-020-61252-3

37.

Flanagan

H. E.

Perry

Freeman

N. L.

(2012). Effectiveness of large-scale community-based intensive behavioral intervention: A waitlist comparison study exploring outcomes and predictors. Research in Autism Spectrum Disorders, 6(2), 673–682. https://doi.org/10.1016/j.rasd.2011.09.011

38.

Fleury

Kushki

Tanel

Anagnostou

Chau

(2013). Statistical persistence and timing characteristics of repetitive circle drawing in children with ASD. Developmental Neurorehabilitation, 16(4), 245–254. https://doi.org/10.3109/17518423.2012.758184

39.

Fournier

K. A.

Hass

C. J.

Naik

S. K.

Lodha

Cauraugh

J. H.

(2010). Motor coordination in autism spectrum disorders: A synthesis and meta-analysis. Journal of Autism and Developmental Disorders, 40(10), 1227–1240. https://doi.org/10.1007/s10803-010-0981-3

40.

Gale

C. M.

Eikeseth

Klintwall

(2019). Children with autism show atypical preference for non-social stimuli. Scientific Reports, 9(1), Article 10355. https://doi.org/10.1038/s41598-019-46705-8

41.

Gardiner

Hutchison

S. M.

Müller

Kerns

K. A.

Iarocci

(2017). Assessment of executive function in young children with and without ASD using parent ratings and computerized tasks of executive function. The Clinical Neuropsychologist, 31(8), 1283–1305. https://doi.org/10.1080/13854046.2017.1290139

42.

Gyori

Borsos

Stefanik

Jakab

Varga

Csákvári

(2018). Automated vs human recognition of emotional facial expressions of high-functioning children with autism in a diagnostic-technological context: Explorations via a bottom-up approach. Lecture Notes in Computer Science, 10896, 466–473. https://doi.org/10.1007/978-3-319-94277-3_72

43.

Halladay

A. K.

Bishop

Constantino

J. N.

Daniels

A. M.

Koenig

Palmer

Messinger

Pelphrey

Sanders

S. J.

Singer

A. T.

Taylor

J. L.

Szatmari

(2015). Sex and gender differences in autism spectrum disorder: Summarizing evidence gaps and identifying emerging areas of priority. Molecular Autism, 6, Article 36. https://doi.org/10.1186/s13229-015-0019-y

44.

Haswell

C. C.

Izawa

Dowell

L. R.

Mostofsky

S. H.

Shadmehr

(2009). Representation of internal models of action in the autistic brain. Nature Neuroscience, 12(8), 970–972. https://doi.org/10.1038/nn.2356

45.

Herrero

Brusque Crocetta

Massetti

Pena

Moraes

Lopes Trevizan

Guarnieri

Pena

Rezende

Villaça

K. P.

Bandeira De Mello Monteiro

(2015). Total reaction time performance of individuals with autism after a virtual reality task. International Journal of Neurorehabilitation, 2(5), Article 189. https://doi.org/10.4172/2376-0281.1000189

46.

Hetzroni

O. E.

Hessler

Shalahevich

(2019). Learning new relational categories by children with autism spectrum disorders, children with typical development and children with intellectual disabilities: Effects of comparison and familiarity on systematicity. Journal of Intellectual Disability Research, 63(6), 564–575. https://doi.org/10.1111/jir.12598

47.

Istepanian

R. S. H.

AlAnzi

(2020). Mobile health (m-health): Evidence-based progress or scientific retrogression. In Dagan Feng

(ed.), Biomedical information technology (2nd ed., pp. 717–733). Academic press. https://doi.org/10.1016/b978-0-12-816034-3.00022-5

48.

Jin

Wee

C. Y.

Shi

Thung

K. H.

Yap

P. T.

Shen

(2015). Identification of infants at high-risk for autism spectrum disorder using multiparameter multiscale white matter connectivity networks. Human Brain Mapping, 36(12), 4880–4896. https://doi.org/10.1002/hbm.22957

49.

Joanna Briggs Institute. (2017). Critical Appraisal Checklist for Case Control Studies. https://jbi.global/sites/default/files/2019-05/JBI_Critical_Appraisal-Checklist_for_Case_Control_Studies2017_0.pdf

50.

Johnson

B. P.

Lum

J. A. G.

Rinehart

N. J.

Fielding

(2016). Ocular motor disturbances in autism spectrum disorders: Systematic review and comprehensive meta-analysis. Neuroscience and Biobehavioral Reviews, 69, 260–279. https://doi.org/10.1016/j.neubiorev.2016.08.007

51.

Jones

R. M.

Tarpey

Hamo

Carberry

Brouwer

Lord Rebecca

(2018). Statistical learning is associated with autism symptoms and verbal abilities in young children with autism. Journal of Autism and Developmental Disorders, 48(10), 3551–3561. https://doi.org/10.1007/s10803-018-3625-7

52.

Jung

K. E.

Lee

H. J.

Lee

Y. S.

Lee

J. H.

(2006). Efficacy of sensory integration treatment based on virtual reality – Tangible interaction for children with autism. Psychnology Journal, 4(2), 145–159.

53.

Jyoti

Lahiri

(2020). Human-Computer Interaction based Joint Attention cues: Implications on functional and physiological measures for children with autism spectrum disorder. Computers in Human Behavior, 104, Article 106163. https://doi.org/10.1016/j.chb.2019.106163

54.

Jyoti

Lahiri

(2022). Portable joint attention skill training platform for children with autism. IEEE Transactions on Learning Technologies, 15(2), 290–300. https://doi.org/10.1109/TLT.2022.3169964

55.

Kang

Han

Song

Niu

(2020). The identification of children with autism spectrum disorder by SVM approach on EEG and eye-tracking data. Computers in Biology and Medicine, 120, Article 103722. https://doi.org/10.1016/j.compbiomed.2020.103722

56.

Kasari

Gulsrud

Freeman

Paparella

Hellemann

(2012). Longitudinal follow-up of children with autism receiving targeted interventions on joint attention and play. Journal of the American Academy of Child and Adolescent Psychiatry, 51(5), 487–495. https://doi.org/10.1016/j.jaac.2012.02.019

57.

Khowaja

M. K.

Hazzard

A. P.

Robins

D. L.

(2015). Sociodemographic barriers to early detection of autism: Screening and evaluation using the M-CHAT, M-CHAT-R, and follow-up. Journal of Autism and Developmental Disorders, 45(6), 1797–1808. https://doi.org/10.1007/s10803-014-2339-8

58.

Kindregan

Gallagher

Gormley

(2015). Gait deviations in children with autism spectrum disorders: A review. Autism Research and Treatment, 2015, Article 741480. https://doi.org/10.1155/2015/741480

59.

Kmet

L. M.

Lee

R. C.

Cook

L. S.

(2004). Standard quality assessment criteria for evaluating primary research papers from a variety of fields. Alberta Heritage Foundation for Medical Research. https://www.ihe.ca/download/standard_quality_assessment_criteria_for_evaluating_primary_research_papers_from_a_variety_of_fields.pdf

60.

Kumm

A. J.

Viljoen

de Vries

P. J.

(2022). The digital divide in technologies for autism: Feasibility considerations for low- and middle-income countries. Journal of Autism and Developmental Disorders, 52(2), 2300–2313. https://doi.org/10.1007/s10803-021-05084-8

61.

Lartseva

Dijkstra

Kan

C. C.

Buitelaar

J. K.

(2014). Processing of emotion words by patients with autism spectrum disorders: Evidence from reaction times and EEG. Journal of Autism and Developmental Disorders, 44(11), 2882–2894. https://doi.org/10.1007/s10803-014-2149-z

62.

Levy

S. E.

Giarelli

Lee

L. C.

Schieve

L. A.

Kirby

R. S.

Cunniff

Nicholas

Reaven

Rice

C. E.

(2010). Autism spectrum disorder and co-occurring developmental, psychiatric, and medical conditions among children in multiple populations of the United States. Journal of Developmental and Behavioral Pediatrics, 31(4), 267–275. https://doi.org/10.1097/DBP.0b013e3181d5d03b

63.

Leung

M. T.

(2020). Relations between verb factivity and first-order and second-order false belief understanding: Evidence from Mandarin-speaking typically developing children and children with autism spectrum disorders. Clinical Linguistics and Phonetics, 34(1–2), 185–200. https://doi.org/10.1080/02699206.2019.1628810

64.

Zhong

Han

Ouyang

Liu

(2020). Classifying ASD children with LSTM based on raw videos. Neurocomputing, 390, 226–238. https://doi.org/10.1016/j.neucom.2019.05.106

65.

Zhang

(2016). Brief report: Sensitivity of children with autism spectrum disorders to face appearance in selective trust. Journal of Autism and Developmental Disorders, 46(7), 2520–2525. https://doi.org/10.1007/s10803-016-2761-1

66.

Lin

C. S.

Chang

S. H.

Liou

W. Y.

Tsai

Y. S.

(2013). The development of a multimedia online language assessment tool for young children with autism. Research in Developmental Disabilities, 34(10), 3553–3565. https://doi.org/10.1016/j.ridd.2013.06.042

67.

Liu

(2016). Identifying children with autism spectrum disorder based on their face processing abnormality: A machine learning framework. Autism Research, 9(8), 888–898. https://doi.org/10.1002/aur.1615

68.

Fang

(2019). The perceived social context modulates rule learning in autism. Journal of Autism and Developmental Disorders, 49(11), 4698–4706. https://doi.org/10.1007/s10803-019-04174-y

69.

Lum

J. A. G.

Shandley

Albein-Urios

Kirkovski

Papadopoulos

Wilson

R. B.

Enticott

P. G.

Rinehart

N. J.

(2021). Meta-analysis reveals gait anomalies in autism. Autism Research, 14(4), 733–747. https://doi.org/10.1002/aur.2443

70.

Mahmoudi-Nejad

Moradi

Pouretemad

H. R.

(2017). The differences between children with autism and typically developed children in using a hand-eye-coordination video game. Lecture Notes in Computer Science, 10586, 256–264. https://doi.org/10.1007/978-3-319-67585-5_27

71.

Marlow

Servili

Tomlinson

(2019). A review of screening tools for the identification of autism spectrum disorders and developmental delay in infants and young children: Recommendations for use in low- and middle-income countries. Autism Research, 12(2), 176–199. https://doi.org/10.1002/aur.2033

72.

Martin

K. B.

Hammal

Ren

Cohn

J. F.

Cassell

Ogihara

Britton

J. C.

Gutierrez

Messinger

D. S.

(2018). Objective measurement of head movement differences in children with and without autism spectrum disorder. Molecular Autism, 9, Article 14. https://doi.org/10.1186/s13229-018-0198-4

73.

Mastergeorge

A. M.

Kahathuduwa

Blume

(2021). Eye-tracking in infants and young children at risk for autism spectrum disorder: A systematic review of visual stimuli in experimental paradigms. Journal of Autism and Developmental Disorders, 51(8), 2578–2599. https://doi.org/10.1007/s10803-020-04731-w

74.

McKinney

Weisblatt

E. J. L.

Hotson

K. L.

Bilal Ahmed

Dias

BenShalom

Foster

Murphy

Villar

S. S.

Belmonte

M. K.

(2021). Overcoming hurdles to intervention studies with autistic children with profound communication difficulties and their families. Autism, 25(6), 1627–1639. https://doi.org/10.1177/1362361321998916

75.

Millar

McConnachie

Minnis

Wilson

Thompson

Anzulewicz

Sobota

Rowe

Gillberg

Delafield-Butt

(2019). Phase 3 diagnostic evaluation of a smart tablet serious game to identify autism in 760 children 3-5 years old in Sweden and the United Kingdom. BMJ Open, 9(7), Article e026226. https://doi.org/10.1136/bmjopen-2018-026226

76.

Moradi

Amiri

S. E.

Ghanavi

Aarabi

B. N.

Pouretemad

H. R.

(2017). Autism screening using an intelligent toy car. Lecture Notes in Computer Science, 10586, 817–827. https://doi.org/10.1007/978-3-319-67585-5_79

77.

Mukherjee

S. B.

Aneja

Krishnamurthy

Srinivasan

(2014). Incorporating developmental screening and surveillance of young children in office practice. Indian Pediatrics, 51(8), 627–635. https://doi.org/10.1007/s13312-014-0465-1

78.

Nakai

Takashima

Takiguchi

Takada

(2014). Speech intonation in children with autism spectrum disorder. Brain and Development, 36(6), 516–522. https://doi.org/10.1016/j.braindev.2013.07.006

79.

Naslund

J. A.

Shidhaye

Patel

(2019). Digital technology for building capacity of nonspecialist health workers for task sharing and scaling up mental health care globally. Harvard Review of Psychiatry, 27(3), 181–192. https://doi.org/10.1097/HRP.0000000000000217

80.

O’Reilly

Lewis

J. D.

Elsabbagh

(2017). Is functional brain connectivity atypical in autism? A systematic review of EEG and MEG studies. PLOS ONE, 12(5), Article e0175870. https://doi.org/10.1371/journal.pone.0175870

81.

Osei

Mashamba-Thompson

T. P.

(2021). Mobile health applications for disease screening and treatment support in low-and middle-income countries: A narrative review. Heliyon, 7(3), Article e06639. https://doi.org/10.1016/j.heliyon.2021.e06639

82.

Ouzzani

Hammady

Fedorowicz

Elmagarmid

(2016). Rayyan – a web and mobile app for systematic reviews. Systematic Reviews, 5(1), Article 210. https://doi.org/10.1186/s13643-016-0384-4

83.

Papagiannopoulou

E. A.

Chitty

K. M.

Hermens

D. F.

Hickie

I. B.

Lagopoulos

(2014). A systematic review and meta-analysis of eye-tracking studies in children with autism spectrum disorders. Social Neuroscience, 9(6), 610–632. https://doi.org/10.1080/17470919.2014.934966

84.

Patra

Patro

B. K.

Padhy

S. K.

(2020). Symptom recognition to diagnosis: Pathway to care for autism in a tertiary care medical centre. Journal of Neurosciences in Rural Practice, 11(1), 164–169. https://doi.org/10.1055/s-0040-1701778

85.

Rafique

Fatima

Dastagir

Mahmood

Hussain

(2019, November 1–2). Autism identification and learning through motor gesture patterns [Conference session]. 2019 International Conference on Innovative Computing (ICIC), Lahore, Pakistan, pp. 1–7. https://doi.org/10.1109/ICIC48496.2019.8966740

86.

Rinehart

N. J.

Bellgrove

M. A.

Tonge

B. J.

Brereton

A. V.

Howells-Rankin

Bradshaw

J. L.

(2006). An examination of movement kinematics in young people with high-functioning autism and Asperger’s disorder: Further evidence for a motor planning deficit. Journal of Autism and Developmental Disorders, 36(6), 757–767. https://doi.org/10.1007/s10803-006-0118-x

87.

Ruta

Fama

F. I.

Bernava

G. M.

Leonardi

Tartarisco

Falzone

Pioggia

Chakrabarti

(2017). Reduced preference for social rewards in a novel tablet based task in young children with autism spectrum disorders. Scientific Reports, 7, Article 3329. https://doi.org/10.1038/s41598-017-03615-x

88.

Sapiro

Hashemi

Dawson

(2019). Computer vision and behavioral phenotyping: An autism case study. Current Opinion in Biomedical Engineering, 9, 14–20. https://doi.org/10.1016/j.cobme.2018.12.002

89.

Sato

Uono

(2019). The atypical social brain network in autism: Advances in structural and functional MRI studies. Current Opinion in Neurology, 32(4), 617–621. https://doi.org/10.1097/WCO.0000000000000713

90.

Schmitz

Daly

Murphy

(2007). Frontal anatomy and reaction time in Autism. Neuroscience Letters, 412(1), 12–17. https://doi.org/10.1016/j.neulet.2006.07.077

91.

Shahab

Taheri

Hosseini

S.R.

Mokhtari

Meghdari

Alemi

Pouretemad

Shariati

Pour

A.G.

(2017). Social virtual reality robot (V2R): A novel concept for education and rehabilitation of children with autism [Conference session]. 5th RSI International Conference on Robotics and Mechatronics (ICRoM), Tehran, Iran, pp. 82–87. https://doi.org/10.1109/ICRoM.2017.8466148

92.

Staniszewska

Denegri

Matthews

Minogue

(2018). Reviewing progress in public involvement in NIHR research: Developing and implementing a new vision for the future. BMJ Open, 8(7), Article e017124. https://doi.org/10.1136/bmjopen-2017-017124

93.

Stevens

Dixon

D. R.

Novack

M. N.

Granpeesheh

Smith

Linstead

(2019). Identification and analysis of behavioral phenotypes in autism spectrum disorder via unsupervised machine learning. International Journal of Medical Informatics, 129, 29–36. https://doi.org/10.1016/j.ijmedinf.2019.05.006

94.

Stewart

L. A.

Lee

L. C.

(2017). Screening for autism spectrum disorder in low- and middle-income countries: A systematic review. Autism, 21(5), 527–539. https://doi.org/10.1177/1362361316677025

95.

Tunç

Yankowitz

L. D.

Parker

Alappatt

J. A.

Pandey

Schultz

R. T.

Verma

(2019). Deviation from normative brain development is associated with symptom severity in autism spectrum disorder. Molecular Autism, 10, Article 46. https://doi.org/10.1186/s13229-019-0301-5

96.

van den Boomen

Fahrenfort

J. J.

Snijders

T. M.

Kemner

. (2019). Slow segmentation of faces in autism spectrum disorder. Neuropsychologia, 127, 1–8. https://doi.org/10.1016/j.neuropsychologia.2019.02.005

97.

Veenstra

van Geert

P. L. C.

van der Meulen

B. F.

(2012). Distinguishing and improving mouse behavior with educational computer games in young children with autistic spectrum disorder or attention deficit/hyperactivity disorder: An executive function-based interpretation. Mind, Brain, and Education, 6(1), 27–40. https://doi.org/10.1111/j.1751-228X.2011.01131.x

98.

von Hofsten

Rosander

. (2012). Perception-action in children with ASD. Frontiers in Integrative Neuroscience, 6, Article 115. https://doi.org/10.3389/fnint.2012.00115

99.

Weisblatt

E. J.

Langensiepen

C. S.

Cook

Dias

Plaisted Grant

Dhariwal

Fairclough

M. S.

Friend

S. E.

Malone

A. E.

Varga-Elmiyeh

Rybicki

Karanth

Belmonte

M. K.

(2019). A tablet computer-assisted motor and language skills training program to promote communication development in children with autism: Development and pilot study. International Journal of Human-Computer Interaction, 35(8), 643–665. https://doi.org/10.1080/10447318.2018.1550176

100.

Wijesinghe

Samarasinghe

Seneviratne

Yogarajah

Pulasinghe

(2019). Machine learning based automated speech dialog analysis of autistic children [Conference session]. 11th International Conference on Knowledge and Systems Engineering (KSE), Da Nang, Vietnam, pp. 1–5. https://doi.org/10.1109/KSE.2019.8919266

101.

Witham

M. D.

Anderson

Carroll

Dark

P. M.

Down

Hall

A. S.

Knee

Maier

R. H.

Mountain

G. A.

Nestor

Oliva

Prowse

S. R.

Tortice

Wason

Rochester

(2020). Developing a roadmap to improve trial delivery for under-served groups: Results from a UK multi-stakeholder process. Trials, 21, Article 694. https://doi.org/10.1186/s13063-020-04613-7

102.

World Bank. (2016). World Development Report 2016 – Digital dividends. Accessed from documents.worldbank.org/curated/en/896971468194972881/pdf/102725-PUB-Replacement-PUBLIC.pdf

103.

World Health Organization. (2020). Improving early childhood development: WHO guideline. https://apps.who.int/iris/handle/10665/331306

104.

World Health Organization Global Observatory for eHealth. (2011). mHealth: New horizons for health through mobile technologies: Second global survey on eHealth. https://apps.who.int/iris/handle/10665/44607

105.

Zhao

(2020). Research and development of autism diagnosis information system based on deep convolution neural network and facial expression data. Library Hi Tech, 38(4), 799–817. https://doi.org/10.1108/LHT-08-2019-0176

106.

Zwaigenbaum

Bryson

Rogers

Roberts

Brian

Szatmari

(2005). Behavioral manifestations of autism in the first year of life. International Journal of Developmental Neuroscience, 23(2–3), 143–152. https://doi.org/10.1016/j.ijdevneu.2004.05.001

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

2.61 MB