Abstract
The dance art form involves audiovisual dual modality perception, and research is unclear regarding the original accompanying music, the audience's emotional response to the dance/music, and the impact of dance training on the audiovisual dual modality recognition and experience of emotion in dance. This study is a mixed experimental design that was adopted: 2 (presentation modality: audiovisual bimodal vs. visual modality only) × 2 (dance training experience: dance major vs. non-dance major) × 2 (emotional valence: happiness vs. sadness). The results show that audiovisual dual modality significantly impacts emotional experience but not emotional recognition. Additionally, individuals’ professional experience is critical for dance's emotional recognition and experience. Dancers can recognize and appreciate dance works through a single visual mode, which non-dancers are unable to do. The results provide insights into the relationships between dance training experience, audiovisual dual modality, and dance emotion.
Keywords
Dance is an art form involving audiovisual dual modality (i.e., visual and auditory) perception. It is also the most expressive form of emotional body language (Wigman, 1966), capable of expressing and communicating different emotions. Spectators visually perceive (visual modality) the images a dancer expresses, triggering emotions in the spectator (Langer, 1986; Ping, 2004).
Extensive research has been conducted on emotion in dance. However, most studies have focused on a single modality (Camurri et al., 2013). Specifically, only the visual perception of emotions expressed by bodily gestures and movements in dance has been investigated. For example, Zhang (2017) found that dance could promote positive emotions in the audience. Sawada et al. (2003a) found that bodily movements with different kinematic characteristics could express various emotions—slow movements indicate sadness, and rapid movements indicate anger (Sawada et al., 2003b). Using muted ballet videos, Christensen et al. (2016) investigated the emotional experiences that bodily movements elicit in viewers and found that the roundedness of movements was related to positive emotions and that dance triggered affective experiences in spectators by eliciting autobiographical memories. Van Meel et al. (1993) investigated the recognition of emotion in dance in children aged 5–12 years and found that five-year-old children performed more poorly in emotion recognition than other children and that children with and without dance training experience did not differ significantly in emotion recognition.
Compared with dance emotion, the research on music emotion is relatively rich. The concept of “musical mood” was proposed by Krumhansl (1997). The theory of emotivism defines musical emotion as the real emotion felt/experience experienced by listeners when they hear a piece of music, namely musical emotional experience. Cognitivism of musical emotions believes that music cannot make the audience experience strong emotions. Music can only express emotions as a representation of emotions. It is possible, for example, that a listener can identify the sad feelings that a piece of music is intended to convey but not feel sad when they hear the music. Therefore, cognitivists of musical emotion define musical emotion as listeners’ perception/recognition of the emotion intended to be expressed by music (Evans & Schubert, 2008; Juslin & Laukka, 2004; Lundqvist et al., 2009; Scherer et al., 2019; Zentner et al., 2008). It was also found that musical emotion recognition was stronger and more stable than musical emotion experience in self-reported results, but there was no difference in task difficulty between them (Dibben, 2004; Gabrielsson, 2002; Hunter & Schellenberg, 2010; Krahé et al., 2015; Schubert, 2007; Zentner et al., 2008). Although there are differences between music emotion recognition and music emotion experience, they are also highly correlated. On the one hand, emotional experience is accompanied by emotion recognition, which is mediated/moderated by emotion recognition (Hunter & Schellenberg, 2010). On the other hand, emotion recognition is not a simple cognitive process, and individual emotional experience can be used as effective information to assist emotion recognition (Dibben, 2004; Merrill et al., 2020; Schubert, 2013; Wöllner, 2012).
Dance, as an art form that integrates visual and auditory elements, underscores the importance of exploring embodied cognition theory. According to the embodied cognition paradigm, the environment, body, and cognition are all inextricably linked. Cognitive processing is influenced by the human body's morphological structure, sensory system, motor system, and nerve system (Anderson, 2003; Goldman & de Vignemont, 2009; Wilson, 2002; Ye, 2010). The visual, auditory, and emotional systems’ neurons are activated when a bear roars, according to James-Langer's embodied cognition theory. Then, when the same image is recalled, identical neuronal firing patterns continue even if the original scene is not present. This phenomenon arises because the sense of terror is made up of coordinated reactions in the visual, auditory, and emotional domains when an emotionally charged stimulus, such as a roaring bear, is perceived (Niedenthal, 2007). As a result, embodied emotion is a comprehensive condition that incorporates various senses. A key component of this process is multisensory integration, which is the convergence and interplay of data from several sensory channels. Studies have shown that humans receive information from the external world through multiple sensory modalities, and effective emotion processing through integrating emotional information from different modalities is an important social skill. Multimodal integration, also known as multisensory integration, refers to effectively combining combination cues from visual, auditory, and other modalities into coherent perceptual information (Ernst & Bülthoff, 2004; Kang & Luo, 2020; Lugo et al., 2008; Sun, 2018; Wen et al., 2009). When emotional visual and auditory information presented simultaneously is congruent, the auditory and visual modalities facilitate each other to enable more efficient emotion processing (i.e., the emotional congruent effect; Föcker et al., 2011; Hoenig et al., 2011). Among all sensory channels, visual and auditory stimuli contain the richest emotional information and have received more research attention. Dance is an audiovisual art, and for students majoring in dance, accompanying music is almost indispensable for routine training and performance of dance works. Therefore, music is important to the integrity of dance works and facilitates the appreciation, teaching and learning, and choreography of dance.
At present, few studies have examined the effects of audiovisual dual modality emotional mechanisms on the presentation and interpretation of emotion in dance. Previous studies usually considered dance and accompanying music as independent from each other. For example, it was found that preschool children could perceive emotions in dance movements and express movements of different emotions using the music of different emotions (Boone & Cunningham, 1998). Other studies have explored whether empirical evidence exists on emotional “matching” or emotional “congruence” between music and dance. A similar experiment was conducted by Howlin et al. (2020), who also discussed whether movement emotion matched music emotion. The findings from these studies show that the judgment of emotion perception using only the visual or auditory modality is consistent with that when using both visual and auditory modalities together (Kratus, 1993; Krumhansl & Schenck, 1997). However, when the emotion expressed by the visual and auditory modes was inconsistent, the participants’ pleasure was lowest. In addition, Reason et al. (2016) explored the influence of music on dance experience. Music, according to the results, can help audiences understand dance movements to some extent.
Notably, previous studies focused on the relationship between the audiovisual consistency between dance and music in terms of emotion but did not discuss whether the dance movements or the dance music triggered and affected the audience's emotions in the dance works. In addition, the music used in previous studies was not the original accompanying music (i.e., music composed specially for and matching the emotional characteristics of specific dance works). Using such music might result in a “split” between the music and dance and fail to effectively reveal the emotional expression intended by the dance. Thus, whether participants’ dance training experience affects the audiovisual dual modality recognition and experience of emotion in dance has not yet received research attention.
Based on previous research, this study investigated the accuracy rates and intensities of the recognition and experience of emotion in dance in dance- and non-dance majors using Chinese dance works with the original accompanying music. The study addressed three questions. 1. Can an appreciator recognize and experience the emotions intended by bodily dance movements through only a visual modality? 2. Do emotional valence and dance training experience affect the recognition and experience of emotion in dance?
Methods
Participants
To detect a medium-sized effect (f = 0.25) in a 2 × 2 × 2 mixed-design experiment, aiming for 95% power, at least 36 participants were needed(G*Power; Faul et al., 2007). In the present study, college students were recruited as participants (N = 73; Mage = 22.77 years, SD = 2.41). Among them, 40 (Mage = 22.38 years, SD = 2.16; men, n = 10: women, n = 30) were dance majors and received more than five years of dance study. There were 33 non-dance majors (Mage = 23.24 years, SD = 2.65; men, n = 5; women, n = 28) with no more than one year of learning experience in dance or music. All participants were right-handed and had normal or corrected-to-normal visual acuity and normal hearing. All participants volunteered for the experiment, and none had previously participated in a similar experiment. Students who participated in the experiment were paid 5 USD.
Experimental Design
A mixed experimental design was adopted: 2 (presentation modality: audiovisual bimodal [presence of accompanying music] vs. visual modality only [absence of accompanying music]) × 2 (dance training experience: dance major vs. non-dance major) × 2 (emotional valence: happiness vs. sadness). Presentation modality and emotional valance were within-subject variables. Dance training experience was a between-subject variable. Recognition and experience of emotion in dance (measured by accuracy rate and intensity) were dependent variables.
Experimental Materials
Three college-level dance teachers and the author selected dance videos for a preliminary list. Traditional Chinese dance works were used for evaluation. This type of dance has a distinctive style, strongly reflects traditional Chinese spirit, appeals to Chinese people's traditional aesthetic tastes, and is widely taught and performed in China. Therefore, the dance works selected for this experiment were of this type of dance. Excerpts with high emotional arousal were used as experimental materials. Thirty excerpts, each 30 s in length, were created from these dance videos using the professional video editing software Adobe Premiere Pro (PR). All dance videos were selected from classical theatrical pieces of previous sessions of the top professional dance competitions in China: the “Peach & Plum Cup” Dance Competition and the CCTV Dance Contest (Appendix 1). Among the excerpts, 15 express happiness, and 15 express sadness. All excerpts were converted into MP4 format using the following parameters: video compression: H.264; frame rate: 25 fps; target bit rate: 5.80 Mbps; maximum bit rate: 16.0 Mbp; video frame aspect ratio: 1920 × 1080 (1.0) stereophonic; 44100 Hz. Thirty-eight dance-major participants judged all dance video excerpts for their emotional type; the aforementioned participants didn't take part in the formal experiment. Excerpts in which emotional valence was correctly recognized by less than 80% of the dance majors were removed, leaving a list of 20 excerpts to be used in the subsequent experiment, with 10 for happiness and 10 for sadness (Appendix 2). Muted versions of the excerpts were created using PR software.
Experimental Procedure
The experiment was performed at a standardized behavior laboratory with good equipment and sound effects. E-Prime 2.0 was used for programming and execution. To avoid the effects of audiovisual bimodal information on visual modality, only visual-modality- (without music) excerpts were first presented in random order. Before the formal experiment began, participants were allowed to take part in three trials to familiarize themselves with the experimental procedure.
Participants were first provided the following instruction: “Welcome to the experiment. Next, you will be presented with several dance video clips. After watching each video clip, please judge the valence and rate the intensity of the emotional expressions intended by the video. Please select the valence and rate the intensity of emotion based on your own felt or experienced emotions after watching each video. The intensities of emotion recognition and experience are rated using a five-point scale (1 = very mild, 2 = mild, 3 = moderately intense, 4 = intense, 5 = very intense). Should you have any questions during the experiment, please contact the experimenter. If you have no questions, please press any key to continue.”
At the start of the experiment, a “+” was provided as a fixation point at the center of the screen for 500 ms to remind participants to focus. Participants were prompted to watch a 30 s dance video clip and judge the emotional valence of the video afterward viewing it. In order to distinguish between emotion recognition and emotion experience tasks, in the emotion recognition task, participants were prompted to enter the “1” key if happiness was perceived in a video and the “2” key if sadness was perceived. Participants were not informed whether they had selected the correct or incorrect key. After pressing the selected key, participants were prompted to rate the emotional intensity of the video using the five-point scale described above. Next, participants were prompted to judge the emotional valence they experienced while watching the video. Participants were prompted to press the “F” key for happiness and the “J” key for sadness. Finally, participants were prompted to rate the intensity of their emotional experience while watching each video using the same five-point scale. Participants were allowed to return to a neutral state of mind after watching a video before being prompted to press the space key to watch and evaluate the next video. Figure 1 illustrates the experimental procedure.

Flowchart of the experimental procedure.
Statistical Analysis
Data from the experiment were sorted and pooled; all trials were used to quantify intensity scores. A three-way (2 [presentation modality] × 2 [emotional valence] × 2 [dance training experience]) repeated measures ANOVA for emotion recognition and experience (accuracy rate and intensity) was performed using SPSS 16.0. a. Computed using alpha = 0.05; *p < .05; **p < .01; ***p < .001.
Results
Emotion Recognition Accuracy Rate
Dance training experience had no effect, according to F(1,71) = 1.31, p = .257. The emotion recognition accuracy rate in the non-dance-major group was slightly lower than in the dance-major group, although there was no discernible difference (M : 0.96 < 0.98). There was no effect of variable presentation modality, F(1,71) = 2.92, p = .092. The emotion recognition accuracy in the audiovisual bimodal was slightly higher than in visual modality only; however, there was no discernible difference (M : 0.97 = 0.97). The main effect of emotional valence had no effect, F(1,71) = 0.01, p = .906. The recognition accuracy for sadness was slightly lower than for happiness; however, the difference had no effect. (M : 0.97 = 0.97) The interaction between dance training experience and presentation modality was significant, F(1,71) = 4.07, p = .047*, η² = .054 (Figure 2). A simple effect analysis showed that in the non-dance-major group, emotion recognition accuracy in audiovisual bimodal was significantly higher than that in visual modality only (M : 0.98 > 0.95), F(1,71) = 6.34, p = .014, η² = .082. In the dance-major group, emotion recognition accuracy in audiovisual bimodal was lower than that in visual modality only; however, the difference had no effect (M : 0.98 = 0.98), F(1,71) = 0.05, p = .820. The interaction between emotional valence and dance training experience had no effect, F(1,71) = 0.01, p = .906. The interaction between presentation modality and emotional valence had no effect, F(1,71) = 2.00, p = .162. The interaction between presentation modality, emotional valence, and dance training experience had no effect, F(1,71) = 0.47, p = .495. Table 1 and Table 2 illustrate the data of emotion recognition accuracy.

Interaction of presentation modality and specialty type on emotion recognition accuracy.
Descriptive Statistics of Emotion Recognition Accuracy of Presentation Modality, Emotion Valence, and Professional Type.
Three-Factor ANOVAs for Presentation Modality × Emotional Valence × Professional Type.
Note. *p < .05; **p < .01; ***p < .001.
Emotion Recognition Intensity
The main effect of dance training experience was significant, F(1,71) = 7.90, p = .006, η² = .100, with emotion recognition intensity in the non-dance-major group being significantly higher than that in the dance-major group (M: 4.38 > 4.06). The main effect of the modality of presentation had no effect, F(1,71) = 2.78, p = .100. The emotion recognition intensity in audiovisual bimodal was slightly higher than in visual modality only; however, the difference had no effect (M: 4.26 > 4.18). The main effect of emotional valence had no effect, F(1,71) = .920, p = .341. The emotion recognition intensity for sadness was slightly higher than that for happiness; however, the difference had no effect (M: 4.25 > 4.19). The interaction between presentation modality and dance training experience was significant, F(1,71) = 14.66, p < .001***, η² = .171 (Figure 3). A simple effect analysis showed that, in the non-dance-major group, emotion recognition intensity in audiovisual bimodal was significantly higher than that in visual modality only (M: 4.51 > 4.25), F(1,71) = 13.78, p < .001***, η² = .163. In the dance-major group, emotion recognition intensity in audiovisual bimodal was lower than that in visual modality only; however, the difference had no effect (M: 4.01 < 4.11), F(1,71) = 2.58, p = .112. In the audiovisual bimodal, emotion recognition intensity in the non-dance-major group was significantly higher than that in the dance-major group (M: 4.51 > 4.01), F(1,71) = 17.44, p < .001***, η² = .197. In visual modality only, emotion recognition intensity in the non-dance-major group was slightly higher than that in the dance-major group; however, the difference had no effect (M: 4.25 > 4.11), F(1,71) = 1.25, p = .267. The interaction between emotional valence and dance training experience had no effect, F(1,71) = 1.60, p = .210. The interaction between presentation modality and emotional valence was significant, F(1,71) = 10.21, p = .002**, η² = .126 (Figure 4). A simple effect analysis showed that, for sadness, emotion recognition intensity in the audiovisual bimodal was slightly lower than that in the visual modality only; however, the difference had no effect (M: 4.23 < 4.26), F(1,71) = .211, p = .647. For happiness, emotion recognition intensity in the audiovisual bimodal was significantly higher than that in the visual modality only (M: 4.28 > 4.10), F(1,71) = 9.12, p = .004, η² = .114. In audiovisual bimodal, emotion recognition intensity for sadness was slightly lower than that for happiness; however, the difference had no effect (M: 4.23 < 4.28), F(1,71) = .609, p = .438. In visual modality only, emotion recognition intensity for sadness was significantly higher than that for happiness (M: 4.26 > 4.10), F(1,71) = 5.36, p = .024, η² = .070. The interaction between presentation modality, emotional valence, and dance training experience had no effect, F(1,71) = 1.63, p = .207. Table 3 and Table 4 illustrate the data on emotion recognition intensity.

Interaction of presentation modality and specialty type on emotion recognition intensity.

Interaction of presentation modality and emotional valence on emotion recognition intensity.
Descriptive Statistics of Emotion Recognition Intensity of Presentation Modality, Emotional Valence, and Professional Type.
Three-Factor ANOVAs for Presentation Modality × Emotional Valence × Professional Type.
Note. *p < .05; **p < .01; ***p < .001.
Emotional Experience Accuracy Rate
The main effect of dance training experience was significant, F(1,71) = 3.01, p = .087. The emotional experience accuracy rate in the non-dance-major group was slightly higher than in the dance-major group; however, the difference had no effect (M: 0.96 < 0.98). The main effect of presentation modality was significant, F(1,71) = 3.99, p = .050, η² = .053, with the emotional experience accuracy rate in audiovisual bimodal significantly higher than in visual modality only (M: 0.98 > 0.96). The main effect of emotional valence had no effect, F(1,71) = 1.53, p = .220. The emotional experience accuracy rate for sadness was slightly higher than that for happiness; however, the difference had no effect (M: 0.98 > 0.97). The interaction between presentation modality and dance training experience had no effect, F(1,71) = 2.64, p = .109. The interaction between emotional valence and dance training experience had no effect, F(1,71) = 0.09, p = .772. The interaction between presentation modality and emotional valence had no effect, F(1,71) = 3.60, p > .05. The interaction between presentation modality, emotional valence, and dance training experience had no effect, F(1,71) = 0.12, p = .731. Table 5 and Table 6 illustrate the data on emotional experience accuracy.
Descriptive Statistics of Presentation Modality, Emotional Valence, and Professional Type on the Correct Rate of Dance Emotional Experience.
Three-Factor ANOVAs for Presentation Modality × Emotional Valence × Professional Type.
Note. *p < .05; **p < .01; ***p < .001.
Emotional Experience Intensity
There was no effect of variable dance training experience, F(1,71) = 0.10, p = .753. Emotional experience intensity in the non-dance-major group was slightly lower than that in the dance-major group; however, the difference had no effect (M: 3.95 < 3.98). The main effect of presentation modality was significant, F(1,71) = 30.53, p < .001***, η² = .301, with emotional experience intensity in audiovisual bimodal significantly higher than that in visual modality only (M: 4.13 > 3.81). There was no effect of variable emotional valence, F(1,71) = 2.02, p = .159. Emotional experience intensity for sadness was slightly higher than for happiness; however, the difference had no effect (M: 4.00 > 3.94). The interaction between presentation modality and dance training experience was significant, F(1,71) = 14.66, p < .001***, η² = .665 (Figure 5). A simple effect analysis showed that, in the non-dance-major group, emotional experience intensity in audiovisual bimodal was significantly higher than that in visual modality only (M: 4.46 > 3.44), F(1,71) = 138.09, p < .001***, η² = .660. In the dance-major group, emotional experience intensity in audiovisual bimodal was significantly lower than that in visual modality only (M: 3.80 < 4.17), F(1,71) = 22.27, p < .001***, η² = .239. In audiovisual bimodal, emotional experience intensity in the non-dance-major group was significantly higher than that in the dance-major group (M: 4.46 > 3.80, F(1,71) = 30.52, p < .001***, η² = .301. In visual modality only, emotional experience intensity in the non-dance-major group was significantly lower than that in the dance-major group (M: 3.44 < 4.17), F(1,71) = 47.31, p < .001***, η² = .400. The interaction between emotional valence and dance training experience had no effect, F(1,71) = 0.00, p = .990. The interaction between presentation modality and emotional valence had no effect, F(1,71) = 0.37, p = .543. The interaction between presentation modality, emotional valence, and dance training experience had no effect, F(1,71) = 0.90, p = .345. Table 7 and Table 8 illustrate the data on emotional experience intensity.

Interaction of presentation modality and specialty type on emotional experience intensity.
Descriptive Statistics for the Intensity of Dance Emotional Experience by Presentation Modality, Emotional Valence, and Professional Type.
Three-Factor ANOVA for Presentation Modality × Emotional Valence × Professional Type.
Note. *p < .05; **p < .01; ***p < .001.
Discussion
The effects of dance training experience, emotional valence, and presentation modality on the recognition and experience (both measured by intensity and accuracy rate) of emotion in dance were experimentally investigated. The results showed that the main effect of presentation modality on emotion recognition had no effect, whereas that on emotional experience was significant. The main effect of emotional valence on emotion recognition and experience had no effect. The main effect of dance training experience on emotion recognition intensity and emotional experience accuracy rate was significant. The interaction between dance training experience and presentation modality had a significant effect on emotion recognition and experience intensities. The interaction between presentation modality and emotional valence had a significant effect only on emotion recognition intensity.
Visual Modality Only Has the Advantage of Emotion Recognition
The results showed that presentation modality had a nonsignificant main effect on emotion recognition, indicating that music, or audiovisual dual modality emotion processing, had an insignificant advantage in the recognition of emotion in dance. This finding was inconsistent with the author's expectations. Based on findings in the psychology of emotion (Fu, 2016), humans obtain most information through visual stimuli. In emotion recognition, an individual recognizes emotion in dance through facial, postural, and other external manifestations. Several studies have shown that the visual channel dominates, emphasizing the dominance of facial expressions and bodily gestures (Donohue et al., 2013; Gao et al., 2018; Hirst et al., 2018; Jessen & Kotz, 2011; Kret et al., 2013; Strelnikov et al., 2015; Tatz et al., 2020). For all the dance video clips, participants could observe the dancers’ facial expressions, gestures, and movements as well as use only the visual modality to correctly recognize the emotional expression intended by the dance works without musical rendering (i.e., with the auditory mode off). Several dancers (Lv, 2003; Wigman, 1966) have also argued that music enters dance as a temporal factor and mainly manifests in the tempo of dance movements, indicating that dance music is not a key factor in dance emotion recognition. The key factor that determines emotion recognition in dance is still the body movements in dance. Dance is a visual art. We find that in different dance segments, dancers have a common feature; that is, in the performance, dancers will show different emotional expressions through the “speed” and “force” of their movements. For example, when expressing sadness, dancers usually use slow and strong movements, such as “beating their chest and feet,” to express their sadness. When expressing happiness, dancers usually use quick, light movements to express their happiness. This also proves that in dance works, dance movement is the “soul” of dance. Therefore, in terms of emotion recognition, the audiovisual dual modal presentation has no significant difference in emotion recognition. Therefore, how to use body language to accurately convey the emotions to be expressed in the dance work needs the choreographer to constantly explore and think.
Audiovisual Dual Modality Emotion Processing Has Advantages in Emotional Experience Only
In contrast, presentation modality was found in this study to have a significant main effect on emotional experience, indicating that spectators relied more on audiovisual dual modality cues to experience emotion through viewing dance. This is basically consistent with findings from some studies (Hanser et al., 2015; Welch et al., 1986), which have suggested that bimodal (auditory and visual) presentation of affective information leads to better emotional experiences than unimodal (visual) presentation. This phenomenon happens because the individual's emotional reaction is intensified and emotional processing efficiency is improved when emotional input from several sensory channels is coordinated to express the same emotion. This phenomenon is known as the emotional congruence effect (Chen et al., 2016). This explains why participants reported better emotional experiences through audiovisual dual modality perception. In contrast, the audience did not fully experience the emotional sensations that were part of the performance when individuals were exposed to dance in a unimodal visual setting. Due to the visual channel's limitations, spectators could only deduce the feelings by observing the dancers’ facial expressions and body language. Since dance movements are typically refined, processed, or even exaggerated body movements, the audience's perception of the emotional information is reduced when there is no auditory mode. This is due to the complexity and diversity of dance art, even though participants can generally understand the emotions to be conveyed by dance movements. Therefore, when the emotional information presented through a channel is unclear, spectators integrate other modal information to obtain more cues and have a more accurate emotional experience (Welch et al., 1986). Thus, emotional experience in the absence of music was poorer than in the presence of music indicating that audiovisual dual modality affective perception has advantages for the experience of emotion in dance.
These results also indicate that adding music (auditory) to the dance performance that matches the mood of the movement can lead spectators to better experience dance works. In addition, in the dance music composition, whether the dance music composition fits the dance work will affect the overall emotional experience effect of the dance work. We found that in some dance works if the composer could not adequately understand the connotation of the dance or the mood of the dance expressed by the choreographer, even though the dancers’ body movements expressed very strong emotions, the feedback of the subjects on the emotional experience was not ideal. There are only a handful of composers who specialize in dance composition. Most composers have no dance experience, making it necessary to train specialized dance composers or a series of dance professional knowledge-related training for dance music composers. This can help them fully understand different types of dance movements and the emotional connotations expressed by the dance movements, to make dance music suitable for the dance works according to the dance movements themselves, and to achieve the unity of visual and auditory effects.
Audiovisual Dual Modality Emotion Processing Only Has Advantages for Non-Professional Spectators
Notably, the advantages of audiovisual dual modality perception in emotional experience were observed only in non-dance-major participants. This was possibly because of the differences between individual participants. Individual differences in the capacity to perceive emotion-inducing events also determine the final differences in emotional experiences. Individual differences that affect emotion recognition and emotional experiences mainly include differences in age, gender, education, and experience (Fu, 2016). Age and gender were not included in the scope of this research and are not presently considered. In addition, differences in education and experience led to the differences in the experience of emotion in dance.
Studies of emotion in music have shown that musical training experience affects individuals’ musical emotion processing. In particular, musical professionals are capable of more accurately processing musical information (Battcock & Schutz, 2022; Chen & Wang, 2019; Kantor-Martynuska & Horabik, 2015; Sun et al., 2011; Zhu, 2017) and more strongly perceiving emotions in music (Akkermans et al., 2019; Alluri et al., 2015; Chaddock-Heyman et al., 2021; Di Mauro et al., 2018; Ma et al., 2013; Spitzer & Coutinho, 2014; Wang & Liu, 2013). Zhou et al. (2020) found that the musical training experience affected individuals’ emotional experience because those who were not musical professionals lacked relevant musical expertise. Thus, a single modality was inadequate, and assistance from other channels was required. Pollak and Sinha (2002) found that physically abused children exhibited more sensitive recognition of angry expressions, due to their previous experiences with physical punishment.
Similarly, after long-term learning and training in dance, dance students have mastered the well-established dance language system and are more sensitive to the connotations expressed by dance movements (i.e., non-propositional bodily movements), especially stylized movements resulting from long-term evolution that have become the equivalent of “words” commonly used by dancers (Lv, 2003). The connotations of this body language cannot be understood by non-dance-major participants. Thus, with their proficiency and understanding of this body language, dance-major participants could respond accurately to emotion and accurately understand and experience the intended meaning using only information presented visually. This finding has not been reported in previous studies on emotion in dance.
In contrast, their lack of dance training experience prevented non-dance-major participants from fully understanding the internal effects of dance movements. Thus, they needed to resort to dual modality information for assistance. Music provides the most direct means of emotion perception besides dance movements and can effectively render and color dance. The affective and behavioral experience sharing model suggests that, when appreciating a live performance, a spectator imitates the performers’ facial and bodily expressions to acquire an empathetic emotional experience, and the degree of emotion activation is highly related to the degree of synchronization (Ammirante & Thompson, 2012; Livingstone & Thompson, 2009; Molnar-Szakacs & Overy, 2006; Thompson et al., 2008; Wöllner, 2012).
Therefore, audiovisual bimodal presentation of dance performances allows non-dance-major spectators to acquire richer affective information and thus have a more intense emotional experience than would occur if only the visual modality were presented. Therefore, non-dance-major participants better experienced the dance works with accompanying music. However, dance practitioners account for only a small part of the population. The vast majority of dance spectators have received little to no dance training. For these spectators, dance works need the rendering and coloring of music to augment the affective expressions. Hong (2003) and Wang (2006) have argued that dance movements and music constitute an integrated, unified relationship, with dance expressing the music's intentions through bodily movements and music augmenting the dance movements’ tempo and rhythm.
In the past emotional dance therapy, in most cases, patients were made to move their bodies by playing different music so as to achieve emotional regulation (Dieterich-Hartwell, 2019; Punkanen et al., 2014). However, previous studies have overlooked patients who are physically challenged and in dire need of emotional adjustment. This study found that dance videos with music can also play a role in emotional regulation, which could be applied to the mental health treatment or mental health counseling of college students in the future.
Audiovisual Dual Modality Emotion Processing Only Has Advantages in Recognition of Happiness
Notably, the intensity of recognition of emotion in dance was affected by the interaction between emotional valence and presentation modality. More specifically, the intensity of recognition of happiness in dance in the presence of music was significantly higher than in the absence of music. In contrast, the intensity of recognition of sadness in dance in the absence of music was higher than that in the presence of music, although the difference was non-significant. Moreover, in the absence of music, the intensity of recognition of sadness in dance was significantly higher than that of happiness. However, previous studies showed that sad or happy background music led to participants providing higher ratings of sadness or happiness (Logeswaran & Bhattacharya, 2009) or that sad music led to sadder ratings of sad movements, but happy music did not lead to happier ratings of happy movements (Christensen et al., 2014). The findings from this study were inconsistent with the findings of these two previous studies. This inconsistency might be because of two reasons.
The first reason is possibly related to the selection of dance works. The sad dance works were mainly narrative (with a storyline), whereas the happy dance works were mainly expressive (without a storyline). The narrative description of plots in sad dance works presented characters with distinctive characteristics. Therefore, the dance movements were not only pure bodily movements but also had been transformed into narrative body language that more readily elicited empathy from the spectator (Lv, 2003). Thus, presentation modality had an insignificant impact on participants’ emotion recognition. In contrast, the happy dance works mainly presented purely expressive arrangements of bodily movements, as they were intended to express the beauty of bodily movements. Expressive dance movements were not meant to imitate life but rather to vent emotions; thus, they were intended to express deep emotion and did not function as narratives. As a result, expressive dance is less effective in eliciting empathy in spectators than is narrative dance. Therefore, happy dance works need to be accompanied by music to facilitate the elicitation of empathy. This explains why, in the absence of music, the intensity of emotion recognition was significantly higher for sadness than it was for happiness.
Another possible reason could have been the force of the dance movements. The sad dance works selected for this experiment consisted of more forceful bodily movements than the happy ones. For example, the excerpt from Kong Yiji used in the experiment, which comes from the second half of the work, describes the experience and inner feeling of Kong Yiji, the protagonist, after his legs are broken. To present the protagonist's sadness to the audience, the dancer increased the force of his bodily movements and presented matching facial expressions. The Music and Dance volume of the Encyclopaedia of China (1985) defines dance as an art to express internal affection through bodily movements (speed, force, and other factors). Adding music to dance weakens the spectator's ability to capture the emotion intended by the dance movements alone, thereby decreasing the intensity of emotion recognition. Therefore, spectators could prefer it if sad dance works did not have accompanying music. This indicates that, for the recognition of bodily emotional expression, processing bias is not simply negative or positive; rather, it depends on how eye-catching the medium conveying the information is behind the emotional valence (Liang et al., 2019).
Research Limitations and Outlook
In this study, the effects of emotional valence and dance training experience on the recognition and experience of emotion in dance under audiovisual dual modality versus unimodal presentation were investigated. The results could be used to guide dance teaching and learning, choreography, and appreciation. In addition, these results support the use of dance as an operable means of adjuvant emotional therapy for improving psychological health.
However, this study has limitations. First, considering that the judgment of the intended emotion in dance might be affected by the type of dance, video clips of a single style of dance (traditional Chinese dance) were used in the experiment. In the future, more dance styles can be used to validate this study's findings. Second, the participant sample used in this study did not adequately represent the larger population. The participants recruited in this study were college students aged 16–28 years old. The dance-major participants were generally younger than the non-dance-major participants, which might have biased the results. In addition, the sample size was small; thus, whether the findings are applicable to other population groups (e.g., different ages and cultures) or if gender differences exist in the recognition and experience of emotion in dance require further examination. Third, researchers have suggested that the perception of dance languages is not on a single level; some dance languages cannot be expressed by bodily movements but usually can be expressed by facial movements (Lv, 2003). Therefore, further research should investigate the effects of the different parts of the body and facial expressions used by the dancer on emotional perception in dance using only the visual modality. Fourth, we discovered that the color of the dancers’ clothing in the dance videos might have an impact on how emotionally intense some participants felt their emotions. Therefore, we will ask the dancers to wear clothing that is the same color and style in the subsequent experiment. In the end, potential confounding effects of listening to traditional Chinese music were not thoroughly investigated. It is still unknown if cultural familiarity with the music or dancing training influenced the replies.
Conclusions
In this study, the effects of three variables—emotional valence, presentation modality, and dance training experience—on the recognition and experience of emotion in dance were investigated. The results show that audiovisual dual modality does not have advantages in recognition of emotion in dance but does have advantages in the experience of emotion in dance. For an individual's emotional experience, audiovisual dual modality affective perception only has advantages when experiencing positive emotion in dance. Dance training experience leads to better recognition and experience of emotion in dance. Non-dance-major college students rely more on audiovisual dual modality affective perception, whereas dance-major college students rely more on their specialized training experience. These results indicate that adding auditory cues to dance performances contributes to a better experience when viewing dance works.
Supplemental Material
sj-docx-1-art-10.1177_02762374251320531 - Supplemental material for Effects of Emotional Valence, Training Experience, and Audiovisual Dual Modality on Emotion in Dance
Supplemental material, sj-docx-1-art-10.1177_02762374251320531 for Effects of Emotional Valence, Training Experience, and Audiovisual Dual Modality on Emotion in Dance by Yi Wang and Maoping Zheng in Empirical Studies of the Arts
Footnotes
Authors Note
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. Furthermore, the authors disclose no potential conflicts of interest concerning the research, authorship, or publication of this article.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the The Science and Technology Research Program of Chongqing Municipal Education Commission, the Humanities and Social Sciences program of Chongqing Municipal Education Commission, (grant number Grant No. KJZD-K202002301, Grant No. SKGH295).
Ethics Statement
The studies involving human participants were reviewed and approved by the Research Project Ethical Review Application Form, Faculty of Psychology, Southwest University (approval number: H22081).
Supplemental Material
Supplemental material for this article is available online.
Author Biographies
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
