Limited Emotional Value Added by Immersive 3D Audio: A Reanalysis

Abstract

Music in three-dimensional (3D) audio formats is becoming increasingly important in many areas of the entertainment industry. However, little research has been done on the effects of various playback formats on the emotional listening experience. A study by Hahn made an important contribution to this topic. Based on a repeated measures design and using the Geneva Emotional Music Scale (GEMS), he conducted a listening experiment comparing music experiences resulting from presentations in stereo, 5.1 surround sound, and Auro-3D 9.1 (a 3D audio format containing a 5.1 surround sound layer and four height channels) reproduced by loudspeakers. Data were made available for a reanalysis. The main aims of this study were (1) analyzing listening differences between formats as measured by the original GEMS factors, (2) calculating effect sizes for a better estimation of sample sizes for future studies, and (3) making the data set available to the public. For the reanalysis, the ratings of participants were aggregated (mean values for the nine GEMS factors per audio format). There were significant differences between the formats as shown by a nonparametric MANOVA (N = 52) with the GEMS factors as dependent variables and the three audio formats as a repeated measures factor. For the GEMS factor Transcendence, an ANOVA (N = 52) revealed a large omnibus effect (η_p² = .206) for the three formats. Pairwise contrasts showed a significant increase (small to medium effect size) in emotional experiences for the Transcendence factor from stereo to surround sound (Cohen's d_Z = 0.31), surround sound to 3D audio (Cohen's d_Z = 0.45), and stereo to 3D audio (Cohen's d_Z= 0.64).

Keywords

3D audio emotional music experience envelopment immersion surround sound

Since the release of the Blu-ray Disc in 2006, the three-dimensional reproduction of sound (3D audio) is becoming increasingly important for the domains of film, virtual reality, video entertainment, and computer games. The increased storage capacity of the Blu-ray Disc enabled the use of multi-channel immersive audio based on the new standards of Auro-3D (released in 2006), Dolby Atmos (released in 2012), and DTS:X (released in 2015). Over recent years, 3D audio formats (mostly reproduced by headphone binauralizations) have also become increasingly popular in the music industry and are often promoted for their assumed greater “emotional depth” compared to standard stereo reproduction (Strauß, 2020, p. 18). Although “emotional depth” is neither an established concept nor a construct, it is used to postulate that emotions felt when listening to music in 3D audio formats are more intense than those when listening to the same music in stereo format. The main difference between the aforementioned playback formats is the number of playback channels (mono < stereo < 5.1 surround sound < 3D audio), and the spatial impression of sound can be enhanced by a higher number of channels: While stereo loudspeakers can create virtual sound sources on the line between them (Geluso, 2018), a 5.1 surround sound layout is able to extend the virtual sound sources to the horizontal plane around the listener (Kim, 2018). To further enhance the spatial impression to three dimensions—by elevating sound sources—three or more height channel speakers are required (Kim, 2018). In this context, an important question is whether this increasing spatiality of sounds has an objective influence on the emotional responses of listeners. Previous studies dealt with the evaluation of multi-channel stimuli, sound systems, and spatial audio in general (Francombe, Brookes & Mason, 2017; Francombe, Brookes, Mason, & Woodcock, 2017; Rumsey, 1998; Zacharov & Pedersen, 2015). However, these studies focused mainly on the quality of sound reproduction, subjective attribution of properties, and listener preferences. No conclusions on emotional effects can be drawn from these findings. As a precondition for an evaluation of the listeners’ emotional experiences, a valid psychometric inventory for the measurement of immersive music experience is crucial. The Geneva Emotional Music Scale (GEMS; Zentner et al., 2008) is designed to measure how music makes a participant feel. To do so, participants use a five-point Likert scale to rate the extent to which the emotions they feel match an adjective. This is done using a variety of adjectives, which constitute the GEMS items (for example, “sad” or “happy”). Using factor analytical techniques, Zentner et al. (2008) derived a model with nine first-order factors, each comprising some adjective items without overlap. Because of intercorrelations between first-order factors, the model also includes three second-order factors. To the best of our knowledge, Hahn (2018) conducted the first controlled study measuring the emotional experiences of listeners in reaction to different playback formats. Using a German 27-item version of GEMS (see Appendix Table A1), Hahn (2018) measured the emotions evoked by loudspeaker playback in stereo, 5.1 surround sound, and Auro-3D 9.1—a 3D audio channel-based format consisting of a 5.1 surround sound layer and four height channels. In a complete repeated measures (RM) design, 53 participants—all in individual sessions—listened to two excerpts from Schönberg's string sextet “Verklärte Nacht” (Transfigured Night) Op. 4 in each of the three audio formats via loudspeaker and indicated the intensity of emotions felt on five-point Likert scales for each stimulus. The stimuli had durations of 74 s and 97 s and were presented in random order. The length of these stimuli exceeded the average time of 8.31 s required for an emotional judgement as reported by Bachorik et al. (2009). Using the GEMS single items as a basis for data analysis, Hahn (2018) only found slight differences in emotional experience between the three audio formats, which did not reach significance. Furthermore, he extracted three principal components from the GEMS items, which are similar to the second-order GEMS factors. Based on these three components, he reported no significant differences between the three audio formats. However, Hahn did not analyze the first-order factors of GEMS.

Our main aims for the data reanalysis were:

maintenance of the original first-order GEMS factor structure for the reanalysis (in contrast to Hahn's [2018] analyses of the individual GEMS items and three principal components extracted from the items, which target the second-order GEMS factors);

calculation of effect sizes for a better estimation of the influence of playback formats on the emotional experience. The resulting effect sizes will be of particular interest for power calculations in future studies;

the sustainable use of the data set by making it available to the public.

Method

For the original study by Hahn (2018), participants provided informed written consent for inclusion, collection, use, and publication of data.

Filtering the Data Set

Hahn's (2018) data set consists of N = 53 participants. Because participants were allowed to omit responses, there are missing values on the GEMS items in the data set. Since some of the scheduled analyses “must not contain missing values” (Friedrich et al., 2022, multRM documentation), we had to decide how to handle missing values when calculating the scores from the items for the GEMS factors. As a compromise between keeping the majority of the original sample and aggregated scores based on a reasonable number of individual values, the following exclusion criterion was defined: For our analyses, a participant was considered a valid case when a response was given on at least two items for each of the GEMS factors per audio excerpt. In other words, a participant was excluded if there was at least one excerpt where they left out two items of one factor. In line with this criterion, one participant had to be excluded. The remaining N = 52 participants had an age range between 16 and 69 years (M = 30.13, SD = 12.46). Seventeen participants (32.7%) indicated their gender as female, and 35 as male (67.3%). Thirty-eight out of 52 (73.1%) reported a music-related profession or course of study (predominantly sound engineers [Tonmeister] and musicians). Most of the participants stated that they listened to classical music regularly (n = 30, 57.7%) or occasionally (n = 13, 25.0%). Thirty-three participants (63.5%) were familiar with the composition used as stimulus material and 7 (3.6%) of these reported knowing it very well. Regarding 3D audio, 19 participants (36.5%) indicated that they had heard or read about it, and an additional 25 (48.1%) that they had already listened to music in a 3D audio format.

Scores for GEMS Factors

In contrast to Hahn's (2018) excerpt-based data analysis, ratings were aggregated over the two musical excerpts for the reanalysis since there was no hypothesis on a difference between the emotional impact of the two excerpts. As a result, each participant should have one score for each of the first-order GEMS factors in all three audio formats. In the first step, item responses were averaged across the two excerpts for each format. In the second step, the factor score per format was calculated as the mean of the corresponding items. Because of the filter criterion, each factor score was based on at least four items.

Data Analysis

Due to the experimental design, an RM multivariate analysis of variance (MANOVA) was applied with the three audio formats as a within-subjects factor and the GEMS factors as dependent variables. Classical MANOVA and related procedures are based on assumptions that are often not met in real data (Bathke et al., 2018; Friedrich et al., 2019). Distribution-related assumptions include multivariate normality and the absence of multivariate outliers. In the filtered data set, multivariate normality was not present for the stereo and 3D audio conditions as indicated by Mardia's test (Mardia, 1970) implemented in the MVN R package (Korkmaz et al., 2014, 2021). Furthermore, 29 participants were multivariate outliers in at least one condition based on robust Mahalanobis distances (Korkmaz et al., 2014, 2021). However, since a non-normal distribution of the responses could validly represent their underlying mechanisms and there were no distributional assumptions based on any hypothesis, exclusion of these outliers was not considered. In addition, the exclusion would have significantly reduced the data set. To circumvent issues resulting from violated theoretical preconditions (e.g., inflated type-I-errors), we used a nonparametric method with minimal assumptions regarding the data (R package MANOVA.RM, see Friedrich et al., 2019, 2022). This approach offers a Wald-type statistic and a modified ANOVA-type statistic (MATS) with different bootstrap methods for testing MANOVA models. Based on a large simulation study, Friedrich and Pauly (2018) recommend MATS in combination with parametric bootstrapping. Its wild [sic] bootstrap approach is very liberal, and its nonparametric bootstrap tends to be more conservative than the parametric version. MATS also appears to be more robust than the Wald-type statistic. Therefore, we used MATS in combination with parametric bootstrapping.

A common, but also debatable, approach following a significant MANOVA result would be to conduct individual univariate ANOVAs on each of the dependent variables (Denis, 2015, Chapter 12; Field et al., 2012, Chapter 16.5.3; Rencher & Christensen, 2012, Chapter 6). Some authors argue that a significant MANOVA protects against alpha inflation when conducting the individual ANOVAs. Others argue that this is only partially the case and therefore suggest correction of the alpha level for the univariate tests (Field et al., 2012). However, since we are interested in the largest possible effect on emotions felt that the respective audio formats can have, we will focus on only one ANOVA. The GEMS factor that showed the largest differences in emotional experiences between the audio formats was Transcendence, and was therefore considered for analysis. The ANOVA model that predicts the Transcendence score from the audio format taking into account the RM structure of the data can be formulated using common R syntax as follows:

Transcendence \sim Format + Error (Participant / Format)

(1)

Pairwise contrasts were applied to check whether the emotional increase was in the hypothesized direction (Stereo < Surround < 3D audio). For these contrasts, a Common Language Effect Size (CLE; Lakens, 2013; McGraw & Wong, 1992) was calculated. The CLE expresses “the probability that an individual has a higher value on one measurement than the other” (Lakens, 2013, p. 4). In addition to the CLE, the differences between the three conditions were assessed by means of the common effect size Cohen's d_Z (Lakens, 2013). For better comparability with effect sizes from between-subjects designs, Cohen's d_rm and d_av as well as their less biased variants, Hedges g_rm and g_av, were calculated (for details see Lakens, 2013). The bias-corrected versions were considered because the estimation of population effect sizes from a sample is characterized by a tendency to overestimate the true effect. However, g_rm and g_av may also be biased but are less biased than d_rm and d_av and therefore show a better approximation to the true effect.

Finally, correlations—as an additional effect size—between the audio conditions were calculated for the GEMS factor Transcendence. A mean correlation was obtained by averaging the individual Fisher z-transformed correlations and back-transforming the result to r_z, as this value is less biased compared to the mean of untransformed correlations for this sample size (Corey et al., 1998).

Results

The RM MANOVA revealed significant overall differences between the formats, MATS = 17.90, p = .004 (based on parametric bootstrapping). As can be seen in Figure 1 and Table 1, the differences between the formats are largest for the Transcendence factor. Since the data for this factor fulfilled the theoretical assumptions (no extreme outliers, normality, and sphericity), a standard RM ANOVA was used, resulting in a large omnibus effect of η_p² = .206, F(2, 102) = 13.209, p < .001, η_G² = .044 (generalized). As can be surmised from Figure 2, pairwise contrasts for Stereo < Surround, Stereo < 3D audio, and Surround < 3D audio also became significant (all p < .05, for details, see Table 2). The respective effect sizes ranged from d_Z = 0.31 to 0.64 and bias-adjusted from g = 0.24 to 0.50. CLE ranged from 62.22% to 74.05%. Besides pairwise comparisons, McGraw and Wong (1992) suggest a formula to estimate the CLE for one condition compared to several other conditions. According to their approach, the probability that a participant scored higher in 3D audio compared to both stereo and surround sound is 54.99%.

Figure 1.

Means and confidence intervals of the nine GEMS factors for the three audio formats. Emotions felt are reported on a five-point Likert scale from 1 (Not at all) to 5 (Very much). Error bars represent 95% confidence intervals for within-subjects designs according to Cousineau and O’Brien (2014).

Figure 2.

Error plot for the GEMS factor Transcendence. Transcendence is reported on a five-point Likert scale from 1 (Not at all) to 5 (Very much). Error bars represent 95% confidence intervals for within-subjects designs according to Cousineau and O’Brien (2014).

Table 1.

Means and standard deviations of all nine GEMS factors for the three audio formats.

GEMS factor	Stereo	Surround	Auro-3D
Joyful Activation	2.19 (0.56)	2.29 (0.60)	2.34 (0.61)
Nostalgia	2.33 (0.60)	2.41 (0.65)	2.49 (0.65)
Peacefulness	2.43 (0.61)	2.44 (0.67)	2.48 (0.59)
Power	2.27 (0.77)	2.38 (0.73)	2.48 (0.87)
Sadness	1.90 (0.69)	1.89 (0.66)	2.00 (0.76)
Tenderness	2.28 (0.56)	2.34 (0.67)	2.38 (0.60)
Tension	2.25 (0.64)	2.34 (0.68)	2.31 (0.62)
Transcendence	2.59 (0.76)	2.77 (0.76)	3.00 (0.88)
Wonder	2.69 (0.70)	2.82 (0.69)	3.00 (0.86)

Note. Means across N = 52 participants. Standard deviations are presented in parentheses.

Table 2.

Pairwise contrasts for the RM ANOVA on the GEMS factor Transcendence.

Contrasts	Mean difference	t	p	CLE	d _Z	d _rm	d _av	g _rm	g _av
Stereo < Surround	0.183	2.27	.026	.6222	0.31	0.24	0.24	0.24	0.24
Stereo < Auro-3D	0.413	5.13	<.001	.7405	0.64	0.50	0.51	0.49	0.50
Surround < Auro-3D	0.231	2.86	.010	.6751	0.45	0.28	0.28	0.27	0.28

Note. SE = 0.081 and df = 102 taken from the ANOVA model for all contrasts; p values are Holm-adjusted; for details on the calculation of d and g with different subscripts, see Lakens (2013).

The scores for Transcendence in the three audio formats were positively correlated (for details see Table 3). Correlations ranged from r = .70 to .82. The averaged value resulting from the individual Fisher z-transformed correlations was 0.96. Back-transformation resulted in a value of r_z = .75 for the mean correlation for the scores for Transcendence between the three audio formats.

Table 3.

Correlations between the GEMS Transcendence scores in the three audio formats.

				95% CI
Audio formats	r	z_r	p	LL	UL
Stereo – Surround	.701	0.869	<.001	0.561	1.000
Stereo – Auro-3D	.702	0.871	<.001	0.562	1.000
Surround – Auro-3D	.816	1.146	<.001	0.721	1.000
Average Correlation	.745^a	0.962

Note. N = 52. Alternative hypothesis is a positive correlation. z_r = Fisher z-transform.

r_z (back-transformed average z_r value).

Looking at the individual ANOVAs for the remaining GEMS factors, Wonder, Nostalgia, Joyful Activation, and Power showed significant differences for the uncorrected significance level of α = .05 (for details see Appendix Table A2). With a Bonferroni correction resulting in α_corrected = α / 9 = .0056, the only factor showing significant differences besides Transcendence was Wonder. Its effect size is η_p² = .102, while the effect sizes for the seven remaining GEMS factors are below η_p² = .063.

Discussion

The MANOVA revealed that the audio formats had an effect on the emotions felt by the participants as measured by GEMS. For the factor of Transcendence, the ANOVA and its contrast analyses confirmed that the direction of emotional increase for the formats was as hypothesized (Stereo < Surround < 3D audio, see Figure 2 and Table 2). CLEs indicated that in a pairwise comparison the audio format with the technical possibility of higher spatiality is rated higher with a probability greater than 50%. This also holds for the comparison of 3D audio against both stereo and surround sound. Following common effect size benchmarks (Ellis, 2010, p. 41), the differences between the formats ranged from small effects (d or g ≥ 0.2) up to a medium effect (d or g ≥ 0.5). The large omnibus effect of η_p² = 0.206 (which corresponds to Cohen's f = 0.509) or the generalized effect η_G² = 0.044 (which corresponds to Cohen's f = 0.215) along with the average correlation of r_z = .75 between the emotional ratings for the three audio formats can be used as first estimates in a priori power analyses for future research designs.

These calculated effect sizes have their limitations and should be used with caution. One limitation is that they are based on a stimulus set that is limited in at least two ways. First, participants only listened to two excerpts from a single piece of classical music. The results may differ for other pieces and other genres. In addition to the musical content itself, the recording and production techniques may also affect the differences between audio formats. Auro-3D 9.1 is just one of many 3D audio formats, including, for example, higher-order ambisonics or object-based formats such as Dolby Atmos. A second limitation is that effect sizes were calculated for only one GEMS factor. GEMS aims to measure the emotions felt by listeners. Since these induced emotions are thought to be not only the result of the music but of a complex interaction between the music, the listener, and situational factors (Gabrielsson, 2001), the largest difference between the audio formats might not always apply to the factor of Transcendence. Since immersion is “characterized by […] increasing emotional involvement” (Grau, 2003, p. 13), research on immersion could be an indicator of the emotional effect of different audio formats. Against the background of more recent findings by Agrawal et al. (2022) from the audio-visual domain (the authors used excerpts from movies), there might be no significant difference between 3D audio and surround sound, or even no difference at all in the psychological experience of immersion and thus in the emotions felt.

Concerning the validity of our results, there is an overlap between the latent variables of Transcendence and Immersion, as measured by the Immersive Music Experience Inventory (IMEI; Wycisk et al., 2022), mediated through the common item “overwhelmed” in both inventories. Based on the outlier-adjusted data from Wycisk et al. (2022) considering evaluations from 190 participants to mono, stereo, and binaural 3D versions of audio excerpts from different pieces, a correlation analysis of the aggregated IMEI scores between audio formats resulted in a similar average correlation of r_z = .79 (for details see Appendix Table A3). In addition, the CLE for 3D versus stereo and mono was 50.7%, which is close to the CLE for 3D audio from Hahn's (2018) data. Therefore, the effect sizes might be applicable in a broader context than the limitations imply at first sight.

Finally, the majority of Hahn's sample had a professional musical background. Thus, it might be assumed that findings could differ from the more general population. However, there is currently no evidence that at least strong emotional experiences of music (in terms of physiological reactions such as chills) differ between musicians and non-musicians. For example, Grewe et al. (2009) showed that the number of chills perceived is not linked to the level of music education, age, or gender. We cannot exclude a more general effect of musical sophistication on the psychological rating of emotional experiences, but this should be based on a more differentiated approach to musical skills as offered by the Goldsmiths Musical Sophistication Index (Müllensiefen et al., 2014).

To summarize, our data reanalysis not only offers a reliable overall effect size for future power calculations for the planning of perceptual studies on immersive listening experiences but also allows specified effect sizes for pairwise comparisons of audio formats. Future work on the emotional effect of audio formats should investigate a wider range of stimulus material, both in terms of the musical content and various 3D audio formats, with respect to a more general audience.

Footnotes

Acknowledgment

We are indebted to Ephraim Hahn for making the original data available to us and giving permission for the publication of the data set.

Author Note

Portions of these findings were presented in a preliminary version as a poster at the 2022 Jahrestagung der Deutschen Gesellschaft für Musikpsychologie [Annual Conference of the German Society for Music Psychology], Würzburg, Germany.

Action Editor

Markus Neuwirth, Anton Bruckner Privatuniversität für Musik, Schauspiel und Tanz, Institut für Theorie und Geschichte, Linz, Austria.

Peer Review

Sarvesh Rajesh Agrawal, Bang and Olufsen, Research

One anonymous reviewer

Contributorship

KS and YW researched the literature. RK obtained the data and permission for reanalysis and publication. KS and RK were involved in data analysis. KS and YW wrote the first draft of the manuscript. All authors reviewed and edited the manuscript and approved the final version of the manuscript.

Data Availability

Data and analysis code are available on GitHub: . The data sets are licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Declaration of Conflicting Interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Ethical Approval

The original study by Hahn (2018) was conducted in Germany, where external ethical approval in psychological research is not mandatory and only required in specific cases. Such cases include (a) the expectation that participants take risks, (b) when deliberately not informing participants about the study procedure, or (c) when stimulating participants physically (Deutsche Forschungsgemeinschaft [DFG], 2023).

The presented reanalysis of the data set from the original study did not require ethics committee or IRB approval. The reanalysis did not involve the use of personal data, fieldwork, or experiments involving human or animal participants, or work with children, vulnerable individuals, or clinical populations.

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by a research grant from “Niedersächsisches Vorab,” a joint program funded by the Volkswagen Foundation in conjunction with the Lower Saxony Ministry for Science and Culture (funding reference: ZN3497) awarded to the third author.

ORCID iDs

Kilian Sander

Yves Wycisk

Reinhard Kopiez

Appendix

Table A3.

Correlations between the IMEI scores in mono, stereo, and binaural 3D audio.

				95% CI
Formats	r	z_r	p	LL	UL
Mono – Stereo	.789	1.068	< .001	0.739	1.000
Mono – 3D	.675	0.820	< .001	0.605	1.000
Stereo – 3D	.873	1.346	< .001	0.841	1.000
Average	.779^a	1.078

Note. N = 190. Alternative hypothesis is a positive correlation. z_r = Fisher z-transform.

r_z (back-transformed average z_r value).

References

Agrawal

Bech

De Moor

Forchhammer

(2022). Influence of changes in audio spatialization on immersion in audiovisual experiences. Journal of the Audio Engineering Society, 70(10), 810–823. https://doi.org/10.17743/jaes.2022.0034

Bachorik

J. P.

Bangert

Loui

Larke

Berger

Rowe

Schlaug

(2009). Emotion in motion: Investigating the time-course of emotional judgments of musical stimuli. Music Perception, 26(4), 355–364. https://doi.org/10.1525/mp.2009.26.4.355

Bathke

A. C.

Friedrich

Pauly

Konietschke

Staffen

Strobl

Höller

(2018). Testing mean differences among groups: Multivariate and repeated measures analysis with minimal assumptions. Multivariate Behavioral Research, 53(3), 348–359. https://doi.org/10.1080/00273171.2018.1446320

Corey

D. M.

Dunlap

W. P.

Burke

M. J.

(1998). Averaging correlations: Expected values and bias in combined Pearson rs and Fisher’s z transformations. The Journal of General Psychology, 125(3), 245–261. https://doi.org/10.1080/00221309809595548

Cousineau

O’Brien

(2014). Error bars in within-subject designs: A comment on Baguley (2012). Behavior Research Methods, 46(4), 1149–1151. https://doi.org/10.3758/s13428-013-0441-z

Denis

D. J.

(2015). Applied univariate, bivariate, and multivariate statistics (1st ed.). Wiley.

Deutsche Forschungsgemeinschaft. (2023). FAQ: Humanities and social sciences: Statement by an ethics committee . https://www.dfg.de/en/research_funding/faq/faq_humanities_social_science/index.html

Ellis

P. D.

(2010). The essential guide to effect sizes: Statistical power, meta-analysis, and the interpretation of research results. Cambridge University Press.

Field

A. P.

Miles

Field

(2012). Discovering statistics using R. Sage.

10.

Francombe

Brookes

Mason

(2017). Evaluation of spatial audio reproduction methods (Part 1): Elicitation of perceptual differences. Journal of the Audio Engineering Society, 65(3), 198–211. https://doi.org/10.17743/jaes.2016.0070

11.

Francombe

Brookes

Mason

Woodcock

(2017). Evaluation of spatial audio reproduction methods (Part 2): Analysis of listener preference. Journal of the Audio Engineering Society, 65(3), 212–225. https://doi.org/10.17743/jaes.2016.0071

12.

Friedrich

Konietschke

Pauly

(2019). Resampling-based analysis of multivariate data and repeated measures designs with the R package MANOVA.RM. The R Journal, 11(2), 380–400. https://doi.org/10.32614/RJ-2019-051

13.

Friedrich

Konietschke

Pauly

(2022). MANOVA.RM: Resampling-based analysis of multivariate data and repeated measures designs (Version 0.5.3) [R package]. https://CRAN.R-project.org/package=MANOVA.RM

14.

Friedrich

Pauly

(2018). MATS: Inference for potentially singular and heteroscedastic MANOVA. Journal of Multivariate Analysis, 165, 166–179. https://doi.org/10.1016/j.jmva.2017.12.008

15.

Gabrielsson

(2001). Emotion perceived and emotion felt: Same or different? Musicae Scientiae, Special issue 2001-2002, 123–147. https://doi.org/10.1177/10298649020050S105

16.

Geluso

(2018). Stereo. In Roginska

Geluso

(Eds.), Immersive sound: The art and science of binaural and multi-channel audio (pp. 63–87). Routledge Taylor & Francis Group.

17.

Grau

(2003). Virtual art: From illusion to immersion ( Custance

, Trans.; rev. and expanded ed.). MIT. (Original work published 2001).

18.

Grewe

Kopiez

Altenmüller

(2009). The chill parameter: Goose bumps and shivers as promising measures in emotion research. Music Perception, 27(1), 61–74. https://doi.org/10.1525/mp.2009.27.1.61

19.

Hahn

(2018, August 6–9). Musical emotions evoked by 3D audio [Conference paper]. AES Conference on Spatial Reproduction, Tokyo, Japan. http://www.aes.org/e-lib/browse.cfm?elib=19640

20.

Kim

(2018). Height channels. In Roginska

Geluso

(Eds.), Immersive sound: The art and science of binaural and multi-channel audio (pp. 221–243). Routledge Taylor & Francis Group.

21.

Korkmaz

Goksuluk

Zararsiz

(2014). MVN: An R package for assessing multivariate normality. The R Journal, 6(2), 151–162. https://doi.org/10.32614/RJ-2014-031

22.

Korkmaz

Goksuluk

Zararsiz

(2021). MVN: Multivariate normality tests (Version 5.9) [R package]. https://CRAN.R-project.org/package=MVN

23.

Lakens

(2013). Calculating and reporting effect sizes to facilitate cumulative science: A practical primer for t-tests and ANOVAs. Frontiers in Psychology, 4, Article 863. https://doi.org/10.3389/fpsyg.2013.00863

24.

Mardia

K. V.

(1970). Measures of multivariate skewness and kurtosis with applications. Biometrika, 57(3), 519–530. https://doi.org/10.1093/biomet/57.3.519

25.

McGraw

K. O.

Wong

S. P.

(1992). A common language effect size statistic. Psychological Bulletin, 111(2), 361–365. https://doi.org/10.1037/0033-2909.111.2.361

26.

Müllensiefen

Gingras

Musil

Stewart

(2014). The musicality of non-musicians: An index for assessing musical sophistication in the general population. PLoS ONE, 9(2), Article e89642. https://doi.org/10.1371/journal.pone.0089642

27.

Rencher

A. C.

Christensen

W. F.

(2012). Methods of multivariate analysis. Wiley. 10.1002/9781118391686

28.

Rumsey

(1998, October 31–November 2). Subjective assessment of the spatial attributes of reproduced sound [Conference paper]. AES Conference on Audio, Acoustics & Small Spaces, Copenhagen, Denmark. http://www.aes.org/e-lib/browse.cfm?elib=8096

29.

Strauß

(2020). Interview: MSM Studio Group München: Immersive audio: Emotionalität, dreidimensional. KEYS, 07/2020, 16–19.

30.

Wycisk

Sander

Kopiez

Platz

Preihs

Peissig

(2022). Wrapped into sound: Development of the immersive music experience inventory (IMEI). Frontiers in Psychology, 13, Article 951161. https://doi.org/10.3389/fpsyg.2022.951161

31.

Zacharov

Pedersen

T. H.

(2015, October 29–November 1). Spatial sound attributes—Development of a common lexicon [Convention paper]. 139th AES Convention, New York, USA. http://www.aes.org/e-lib/browse.cfm?elib=17992

32.

Zentner

Grandjean

Scherer

K. R.

(2008). Emotions evoked by the sound of music: Characterization, classification, and measurement. Emotion, 8(4), 494–521. 10.1037/1528-3542.8.4.494