Abstract
Musical experiences are ubiquitous in early childhood. Beyond potential benefits of musical activities for young children with typical development, there has long been interest in harnessing music for therapeutic purposes for individuals with autism spectrum disorder (ASD). However, there is debate as to the effectiveness of these approaches and thus a need to identify mechanisms of change (or active ingredients) by which musical experiences may impact social development in young children with ASD. In this review, we introduce the PRESS-Play framework, which conceptualizes musical activities for young children with ASD within an applied behavior analysis framework consistent with the principles of naturalistic developmental behavioral interventions. Specifically, the PRESS-Play framework proposes that musical activities support key elements of evidence-based approaches for social engagement including predictability, reinforcement, emotion regulation, shared attention, and social play context, providing a platform for delivery and receipt of social and behavioral instruction via a transactional, developmental approach. PRESS-Play considers that these factors may impact not only the child with ASD but also their interaction partner, such as a parent or peer, creating contexts conducive for validated social engagement and interaction. These principles point to focused theories of change within a clinical-translational framework in order to experimentally test components of social-musical engagement and conduct rigorous, evidence-based intervention studies.
Keywords
Introduction
Social musical experiences are both natural and ubiquitous in early childhood interactions between parents and their children, and children and their peers (Politimou et al., 2018; Trainor & Cirelli, 2015; Trehub & Cirelli, 2019). Beginning in infancy, singing and musical play capture and maintain children’s attention, modulate their arousal levels, and provide redundant and highly salient cues (Cirelli et al., 2019; Nakata & Trehub, 2004; Trehub et al., 2016). In addition, children use child-directed singing as a meaningful marker of social attention and affiliation, showing increased attention to, social preference for, and prosocial behavior toward people who sing songs from their parents’ repertoire (Cirelli & Trehub, 2019; Cirelli, Trehub, & Trainor, 2018; Mehr et al., 2016). Shared musical activities that promote synchronous movement between children and their peers or adults support prosocial behaviors such as helping and cooperation (Cirelli et al., 2014; Kirschner & Tomasello, 2009). Behavior analysis of music engagement during early typical development suggests that musical interaction directly and incidentally scaffolds children’s social engagement with their parents and their peers.
Beyond potential social benefits of music for typically developing (TD) children and families, there has long been substantial interest in harnessing music for therapeutic goals in children with disabilities (see Alley, 1979) including autism spectrum disorder (ASD; see Alvin & Warwick, 1978). ASD is a common (1 in 59 children) and lifelong neurodevelopmental disorder characterized by impairments in social communication and interaction as well as restricted and repetitive behaviors (American Psychiatric Association, 2013; Baio et al., 2018). The earliest reports on ASD suggested interest in and propensity for music (Kanner, 1943). Subsequent behavioral studies generally reflected relatively preserved skills in areas such as pitch perception (e.g., Heaton, 2003) and musical emotion recognition (e.g., Quintin et al., 2011). More recent neuroimaging studies reported typical neural processing of musical or sung stimuli in children with ASD (Caria et al., 2011; Sharda et al., 2015). Some children with ASD have selective preferences for musical activities (e.g., listening to music, playing an instrument) and will repeatedly engage in and gain proficiency with musical activities (Bennett & Heaton, 2012; Happe & Frith, 2010; Kanner, 1943).
Music perception and production engages multiple areas and networks of the brain involving sensory, motor, emotional, attentional, and social processes leading to hypotheses that musical experiences may transfer to non-musical skills due to shared neural resources (e.g., Patel, 2011; Wan & Schlaug, 2010). These hypotheses have been applied to music therapy approaches for children with ASD, particularly through the lens of musical experiences impacting neural pathways involved in attention, sensory, motor, and emotional processes that support social communication (Janzen & Thaut, 2018; Overy & Molnar-Szakacs, 2009; Sharda et al., 2018; Trevarthen, 2002; Wan et al., 2011; see Janzen & Thaut (2018), Geretsegger et al. (2014) or Srinivasan & Bhat (2013) for reviews of music therapy approaches and findings). There are credible, albeit limited, studies supporting musical engagement for non-musical social and communicative goals for children with ASD, including those drawn from music therapy (Geretsegger et al., 2014; Kim et al., 2008; Sharda et al., 2018) and music education (Cook et al., 2018; Kern et al., 2007; McFerran et al., 2016) literatures. At the same time, a large-scale improvisational music therapy trial did not yield effects on autism symptoms relative to a non-music treatment (e.g., Bieleninik et al., 2017; though note concerns with intervention fidelity and outcome measures) and debates continue over the effectiveness of specific therapeutic approaches (Accordino et al., 2007; Sharda et al., 2019; Sreewichian, 2019). There is an urgent need to objectively identify the core components (also termed active ingredients) of interventions for children with ASD (Pellecchia et al., 2015). Identifying the active ingredients that underlie the mechanisms of change by which musical experiences may impact social engagement will help to refine and advance musical interventions for young children with ASD (Cheever et al., 2018; Collins & Fleming, 2017).
The long-standing focus in behavioral treatments for young children with ASD is expanding from analog, clinician-implemented techniques to include parent-implemented interventions and naturalistic developmental behavioral interventions (NDBIs) that incorporate behavior analytic principles and the developmental context of skill development (e.g., Pivotal Response Treatment (Koegel et al., 2016); see Schreibman et al., 2015 for a review). Such approaches teach parents skills to support their interactions with their children during natural play interactions and daily routines, providing a scaffolding therapeutic context for the development of foundational social skills. In the current paper, we introduce the PRESS-Play framework, which draws upon behavioral and neuroscientific principles to consider musical engagement as a potential NDBI
In the PRESS-Play framework, we propose that the essential elements of musical engagement experiences that align with evidence-based NDBI approaches for social engagement skill development in young children with ASD include: (1)
The PRESS-Play Framework
Social Play
Musical play is a natural and ecologically valid play context for parents and children across developmental ability levels and disability typologies. Musical play occurs in all cultures, starting with parents’ use of infant-directed singing from early infancy, building up to dyadic, triadic, and group musical play games of early childhood (Custodero, 2006; Ilari et al., 2011; Politimou et al., 2018; Trehub & Cirelli, 2018; Trevarthen, 1999). As musical activities are widely employed, natural, interactive, and child-directed, they may serve as an intervention context congruent with NDBIs. Contexts that are natural, motivating, and easily learned may assist the child with ASD and also their interaction partner (such as a parent or peer) in the seamless delivery of strategies that provide opportunities and natural reinforcers for the goal of social engagement.
Like other forms of play (e.g., symbolic play, see Kasari et al., 2012), musical play activities involve social attention, language, symbolic play, and motor activities, providing opportunities to target goals across multiple domains simultaneously within a naturalistic, shared, transactional context. The play activities can be readily adapted to focus on different skills within a given repertoire or adjusted to different developmental levels for a child with ASD. For example, an initial focus might be to use animated singing and movement activities to capture a child’s attention, later to engage in imitation of hand movements that accompany a song, and later to incorporate pretend play when acting out the song; whichever context is engaging, motivating, and developmentally appropriate for a child with ASD (see Figure 1). The naturally exaggerated interaction style of rhythmic facial expressions and movements used during child-directed singing corresponds to techniques of being animated and expressive when engaged in therapy with children with ASD and may incidentally render these important social and linguistic cues more salient and teachable than in other social contexts.

An example of the use of social musical interactions to support a child’s development of gestures and imitation. Arrows depict the transactional relationship between the child and parent in advancing the activity and associated skills over time.
Musical activities also provide a highly familiar and routinized play context with clear expectations and predictable routines. For example, a nursery rhyme like “Happy and You Know It” follows the same general structure whether it is sung by a parent, teacher, therapist, or peer, and includes music, vocabulary, gestures and social engagement in a predictable, repeated, and, for many children, motivating and engaging platform. This, in turn, may provide a transferrable naturally occurring context for a child with ASD to generalize skills to different interaction partners or settings, which may be helped because the interaction partner knows what is expected of them, as well. The use of regular and consistent behavioral and linguistic strategies has long been known to support skill generalization across settings and communication partners for children with ASD (Kaiser et al., 2000; Reisinger, 1978). Thus, musical activities may incidentally provide a context that naturally delivers previously-identified evidence-based strategies often used when targeting social engagement and language development.
Predictability
Musical routines involve predictable and salient rhythm and timing patterns. Beginning in infancy, parents’ use of rhythmic infant-directed speech and song attracts and maintains their child’s attention and creates engaging and motivating patterns in gaze and in vocalizations (Cirelli, Trehub, Trainor, 2018; Nakata & Trehub, 2004; Trehub & Gudmundsdottir, 2015). Social musical play heightens the structure and predictability of these
Attentional entrainment supported by the predictability of interactive musical activities thus has elements that directly and indirectly support social engagement and communication development in typical and atypical populations. The predictability provided by the rhythmic structure of musical activities may modulate attention to scaffold the timing of attention to important social events and draw attention to multiple linguistic elements embedded in the songs. For example, typically developing infants and toddlers modulate their gaze to the eyes of a singer when observing infant-directed singing based on the rhythmic structure (Lense & Jones, 2016).
An array of disruptions in attention are reported in children with ASD, including impairments in attention attainment, disengagement, restricted attention on atypical environmental elements, and reduced initiation of and response to joint attention, which involves using or following cues to share attention with another person and relate to social and language development (Jones et al., 2017; Mansour et al., 2017) so that directly and/or indirectly supporting attention is likely to be a useful ingredient in intervention. Similarly, many individuals with ASD exhibit rhythmic timing difficulties in nonmusical contexts, particularly attending to socially-embedded activities such as coordination of vocal turn-taking, prosodic stress patterns, and synchrony of speech and gestures (Marchena & Eigsti, 2010; Paul et al., 2005; Ramsay, 2015; Shriberg et al., 2001). Although studies of rhythm skills in individuals with ASD are limited, a growing body of work has focused on the incidental benefits of the rhythmic elements of music for interventions in this population (e.g., Srinivasan et al., 2016; Srinivasan et al., 2015; Yoo & Kim, 2018). A rhythmically predictable signal that structures a musical interaction may help both partners in an interaction time the delivery and receipt of meaningful information while providing important redundancy in social and linguistic contexts and cues (i.e., co-occurrence of multiple cues providing the same information). Indeed, increased eye contact and turn-taking have been observed during therapeutic musical activities vs. non-musical play in children with ASD (Kim et al., 2008; LaGasse, 2015; Srinivasan et al., 2016) and limited studies have reported improvements in joint attention following participation in music therapy (Kim et al., 2008; Srinivasan et al., 2016), which could be direct or indirect byproducts of the inherent predictability and salience of musical activities.
Musical activities also support sensory-motor integration due to the tight coupling of, for example, auditory and motor activity (Zimmerman & Lahav, 2012). Musical beat and rhythm perception are reliant on auditory-motor pathways and support entrainment and coordination of motor activity with systematic and predictable auditory and visual routines (Grahn et al., 2011; Grahn & McAuley, 2009). Music training has long been studied as an example of multisensory auditory-motor neuroplasticity in TD individuals (Hyde et al., 2009), but musical activities have only recently been examined in individuals with ASD in the context of exploring the role of simultaneous (and integrated) auditory-motor coordination for learning. For example, Auditory-Motor Mapping Training, which is based on Melodic Intonation Therapy previously developed for stroke patients (Schlaug et al., 2009), employs the explicit practice of linking motor movements (tapping on tuned drums) with sung (or intoned) speech to improve speech production in children with ASD (Chenausky et al., 2016; Chenausky et al., 2017; Wan et al., 2011). By tapping on the drums for each syllable produced, the training uses predictable musical routines to practice multisensory, auditory-motor skills important for speech development. In preliminary investigations, school-aged children participating in Auditory-Motor Mapping Training increased consonant and vowel speech production skills versus children in a control Speech Repetition therapy (Chenausky et al., 2016; Chenausky et al., 2017). In a separate, randomized controlled trial, school-aged children with autism participated in music therapy sessions that used music (musical instruments, songs, and rhythmic cues) to target communication, turn-taking, sensorimotor integration, and social appropriateness or a parallel non-music play therapy (Sharda et al., 2018). Children in the music therapy condition exhibited increased auditory-motor functional connectivity post-intervention compared to children in a parallel non-music play therapy (Sharda et al., 2018). Functional connectivity levels between multiple auditory and subcortical areas (thalamus, striatum) post-music therapy was associated with parent-reported communication skills (Sharda et al., 2018).
Improvements in intrapersonal movement coordination may also support increased interpersonal movement coordination. Musical activities may in part be a helpful therapeutic context because they provide a readily available (and ubiquitous) context with a clearer interaction signal (e.g., due to
Predictable rhythmic stimuli are also salient to infants and young children in part because they specify information cross-modally (Bahrick & Lickliter, 2004; Lewkowicz, 2003). For example, during child-directed singing, parents often use large visual facial expressions, movements and gestures, touch, and vestibular inputs (e.g., bouncing or rocking), all of which are temporally synchronized to the auditory rhythmic signal. These non-auditory rhythmic streams are also important for maintaining attention: TD infants attend longer to visual-only examples of singing vs. speech production (Trehub et al., 2013). Rhythmic entrainment not only modulates attention cross-modally (Bolger et al., 2013; Escoffier et al., 2015; Miller et al., 2013) but may also impact integration of cross-modal information (Su, 2014). Though speculative, given established difficulties with cross-modal integration in people with ASD (Stevenson et al., 2014), musical play may provide a context for supporting the modulation and integration of attention cross-modally in children with ASD.
Reinforcement and Emotion
Social musical experiences are motivating and autoreinforcing for many individuals (Lim, 2010). Positive emotional experience that many individuals have with music may facilitate music’s reward value for learning social goals. For example, in a series of studies set in different sound environments, infants who were moved (by bouncing the infant up and down) synchronously with an adult demonstrated more prosocial helping behaviors toward the adult versus infants who were moved asynchronously (Cirelli et al., 2014; Cirelli et al., 2017). Infants exhibited greater spontaneous helping, as well as reduced fussiness, when the bouncing activity was accompanied by music rather than nonrhythmic nature sounds suggesting that music facilitated a more positive social experience (Cirelli et al., 2017). TD children demonstrate increases in smiles directed to peers during synchronous (vs. asynchronous) rhythm and movement games (Tunçgenç & Cohen, 2018).
Despite frequent disruptions in auditory sensitivities in individuals with ASD, music is reported to be a pleasant sensory experience, including as part of social interactions such as music and movement games with caregivers (Dickie et al., 2009; Lense, 2018). For at least some individuals with ASD, musical games and materials may therefore be potent natural reinforcers (i.e., reinforcers that are inherent to the child’s goal) that can support learning, increasing motivation to engage in therapeutic activities and resulting in increased practice of skills (see Ferster, 2017). Increased frequency of social behaviors such as eye contact and turn-taking are reliably observed during therapeutic musical play in children with ASD (Kim et al., 2008; LaGasse, 2015). Increased participation and reciprocity may in turn be reinforcing for the play partner leading to synergistic increases in interaction behavior, which is consistent with NDBI principles (Chevallier et al., 2012). The use of music as a natural reinforcer can also be paired with using music as an extrinsic positive reinforcer (e.g., access to a preferred musical toy or video or song as a consequence for completing an unrelated behavioral task).
Emotional and arousal modulation by music are the primary reasons people engage with music (Juslin & Västfjäll, 2008). From infancy, parents’ singing is a potent mood regulator. Infants will remain calm longer during parental singing vs. speaking, and maternal singing more effectively calms distressed infants than maternal speaking (Corbeil et al., 2016; Trehub et al., 2015). Singing to their infants also modulates parents’ arousal levels as indicated by physiological markers (cortisol, skin conductance) and self-report ratings (Cirelli, Jurewicz, & Trehub, 2019; Fancourt & Perkins, 2018). Difficulties with emotion/arousal regulation are common in people with ASD (Mazefsky et al., 2013) and may relate to factors such as altered arousal levels (e.g., hyperarousal) (Patriquin et al., 2019), impairments in describing and identifying emotions (alexithymia) (Kinnaird et al., 2019), reduced use of adaptive coping strategies (Samson et al., 2015), and comorbid mental health problems such as anxiety (Conner et al., 2020; Mazefsky et al., 2013). However, physiological responses to emotional music appear to be similar in individuals with ASD and TD individuals (Allen et al., 2013) and individuals with ASD report using music for arousal regulation (Allen et al., 2009).
In addition to the positive emotional experiences of music providing reinforcement, musical activities may also facilitate opportunities for learning and social connection due to arousal/emotion modulation. Emotional regulation is important for learning and memory including in relation to social communication skills (Prizant et al., 2003). Arousal regulation also promotes social bonding and cooperation independently and in interaction with movement synchrony (Jackson et al., 2018). Such arousal impacts of music may be important not only for a child with ASD but also for their interaction partners. Parents of children with ASD experience higher parenting stress and physiological markers of stress dysregulation (Hayes & Watson, 2013; Seltzer et al., 2010). High stress levels may impact parents’ ability to learn and apply skills learned in parent-mediated therapies and there is increasing awareness of the need to support parents in interventions including outcomes related to parent-child dynamics, parenting stress, and quality of life (Strauss et al., 2012; Wainer et al., 2017). Thus, another mechanism by which musical activities may have a role in supporting social engagement is by regulating arousal in all interaction partners to facilitate opportunities to learn and apply specific therapeutic skills.
Shared Attention
Reductions in the amount of, and reductions in the quality of, the shared attention episodes that do occur, are a key trait in individuals with ASD (see Franchini et al., 2018). When music is a viable motivator-reinforcer, joint musical play is an opportunity for shared attention between interaction partners. As noted previously, therapeutic musical activities are associated with increased eye contact to play partners (peers, therapists) in children with ASD as compared to during non-music play (Kim et al., 2008; LaGasse, 2015). Increased attention to others may be associated with musical facilitation of joint movement activity. TD children have been found to make more eye contact with peers during a synchronous vs. non-synchronous rhythmic movement game (Tunçgenç & Cohen, 2018) and more eye contact with an adult during musical as compared to non-musical play (Beck & Rieser, 2017). During music and dancing activities, TD adults demonstrate increased attention to other adults moving synchronously versus asynchronously with them (Woolhouse et al., 2016).
Positive shared attention (between parent-child or peer partners) during music is a form of naturally occurring social engagement arising from joint attention and coordination that can be reinforcing in itself and the impact can be bidirectional across social partners. For example,
Summary, Future Directions, and Caveats
The PRESS-Play framework offers a set of principles for the use of music to support social engagement in young children via a developmental framework that connects with current theories of therapeutic practice for children with ASD, particularly NDBIs. These elements are measurable and can be tested within the context of experimental and clinical trials. Musical activities may provide a context for targeting social engagement because they include components that may serve to support not only a child with ASD but also their interaction partner, increasing opportunities for joint engagement. Moreover, music is viewed herein as a platform for delivering know teaching responses to children with ASD.
We also argue that PRESS-Play suggests areas for further research in both assessment and intervention studies in order to experimentally test mechanisms of active musical experiences such as scaffolding parent-child engagement and joint attention. Despite the longstanding interest and widespread belief in the positive effects of music therapy, knowledge of music cognition broadly and
From an intervention perspective, PRESS-Play highlights the need for specifying the developmental context of musical interactions, incorporating parents or peers into the therapeutic context, and considering the impact of the active ingredients on the parents or peers included in the interaction. This may be reflected in dyadic measurements such as interpersonal movement coordination as well as specific behaviors from both partners in the therapeutic experience (e.g., physiological arousal, eye contact, or imitation by the child
Despite the long-standing considerable interest and enthusiasm for harnessing music for therapeutic purposes for social engagement in individuals with ASD, and the plethora of testimonials (e.g., Schwartzberg & Silverman, 2017; Thompson, 2017), there are also many caveats to be considered. For example, while the predictability of musical activities is hypothesized to support engagement by guiding attention and expectations, a focus on predictable rhythms could simultaneously support a preference for repetitive activities which could possibly reduce social engagement. The use of children’s musical toys (e.g., children’s percussion instruments) or other instruments likely needs to be monitored as they may support a focus on repetitive cause-and-effect activities and an overemphasis on instruments or routinized songs could preclude use of flexible speech (by the parent or child). Similarly, the potentially reinforcing nature of musical activities must be individually evaluated using strict behavioral criteria as musical activities may not be enjoyable and motivating for all children. Musical activities must not come at the expense of participating in an array of experiences and modalities. It is also possible that musical interactions primarily impact the interaction partner rather than the child with ASD and while PRESS-Play views this as a potential method of change via creating a context conducive to interactions, at this time the direction of effects or potential for interaction cascades is unknown. Particularly when considering the heterogeneity among individuals with ASD, clearly elucidated theories of change will help determine for whom such approaches may be most effective.
Conclusions
PRESS-Play proposes that social musical activities for young children align with the principles of NDBI approaches to supporting social engagement in children with ASD. Musical activities can create a transactional platform for scaffolding foundational social skills and providing direct and indirect opportunities for the delivery of social and linguistic information consistent with the active ingredients of NDBIs. Social musical interactions incorporate predictability, reinforcement, emotion modulation, and shared attention within an ecologically and developmentally appropriate social play framework. PRESS-Play highlights the need to consider the potential impact of musical experiences for all interaction partners (not just the child with ASD) and that potential mechanisms of action may arise from creating a context conducive to the delivery and receipt of meaningful social engagement for all involved parties. These principles suggest avenues for systematic research to identify the active ingredients of musical experiences and select appropriate intervention targets within a clinical-translational framework, in order to move forward our understanding and application of musical experiences consistent with evidence-based practices for children with ASD.
Footnotes
Author contributorship
ML and SC conceived the article, researched the literature, and wrote and edited the manuscript and approved the final version.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Action editor
Katie Overy, University of Edinburgh, Reid School of Music.
Peer review
Adam Ockelford, University of Roehampton, Applied Music Research Centre.
One anonymous reviewer.
