Abstract
Involuntary musical imagery, colloquially known as “earworms”, is a phenomenon hypothesized to reflect involuntary rehearsal of long-term memory representations. Here we investigated musical earworms with a questionnaire adapted from Halpern and Bartlett (2011, Music Perception, 28(4), 425–432), both in typical individuals and in participants with congenital amusia. Congenital amusics have impaired short- and long-term musical memory, yet with evidence for preserved implicit processing of music. Almost all participants in both groups reported experiencing musical earworms, however less frequently so in amusics than in controls. In both groups, musical earworms were reported being mostly familiar music with lyrics, and consisted of music liked by the participants. Some features distinguished earworms in amusics and controls, with more limited familiarity effects in amusics. Moreover, amusics were deploying less voluntary strategies to stop the earworms, in keeping with less stable music memories in this group. In addition, we investigated verbal earworms in the same participants. Verbal earworms occurred less frequently than musical earworms, and were more frequent in amusics than in controls. However, the two types of earworms showed similar features and their frequencies of occurrence were correlated, suggesting they rely in part on domain-general processes. Implications for the understanding of involuntary auditory imagery and congenital amusia are discussed.
Introduction
Musical “earworms” refer to the phenomenon of having songs “stuck” in one's head (e.g., Williams, 2015). It is an involuntary musical imagery (INMI) that has the specificity of being repetitive. It can be considered as a form of musical memory, consisting of mentally hearing and repeating (familiar) musical excerpts, often of more recently heard music. It also occurs more often and lasts longer for individuals with enhanced music listening or practice (e.g., singing; Müllensiefen et al., 2014). This is in line with features reported for memory in general, that is, better memory for more recent experiences and after repeated exposure. The scientific investigation of “musical imagery” in general (i.e., also including voluntary imagery) has attracted interest as one means to further our understanding of perception and cognitive functions, such as memory, attention, and consciousness. Interestingly, brain activation during voluntarily imagined music was reported as similar to brain activation during perceived music (e.g., Halpern, 1999; Zatorre & Halpern, 2005), in particular involving the activation of frontal and superior temporal regions as well as the supplementary motor areas. For INMI more specifically, Farrugia et al. (2015) revealed a negative relationship between frequency of INMI and cortical thickness of various brain areas, including auditory areas and inferior frontal regions. Their VBM results further revealed a relation between the extent to which participants wished to suppress INMI or considered them helpful, and increased gray matter volume in right temporopolar and parahippocampal cortices, respectively.
Various research methods have been used with the aim to further understand the personal and subjective experience of “earworms” or INMI. Most methods are based on self-reports asking for subjective experiences, such as via interviews, questionnaires, diaries, or experience sampling (contacting participants on mobile phones at random times of the day in everyday lives, e.g., Beaty et al., 2013; Halpern & Bartlett, 2011). Data mining with analyses of reports in social media (e.g., Twitter) are part of this approach (e.g., Liikkanen et al., 2015). Some lab-based experiments have also been conducted, notably, for example, by asking participants to tap the tempo of the earworms (Williamson et al., 2014) or presenting catchy tunes in the lab and observing the potential subsequent occurrence of earworms (e.g., Byron & Fowles, 2015). Some of these findings also revealed that repetition and familiarity increase the likelihood of a song to become an earworm.
Studying a wide corpus of tunes reported as “earworms” has revealed specific features of potential INMI tunes on both time and pitch dimension (Jakubowski et al., 2017). INMI tunes have faster average tempi than non-INMI tunes and have specific melodic contour features (with respect to a reference corpus), such as for example either very common global melodic contours (i.e., patterns of rising and falling of pitches), or highly unusual contour patterns. Interestingly, this analysis also revealed that a given song's popularity and recency (e.g., measured with chart entry date and persistence) is related to the frequency of the song to be reported as INMI, thus confirming links with listeners’ long-term memory (e.g., recency, familiarity, see also Byron & Fowles, 2015).
Based on these various characteristics of the phenomenon of INMI and the features of INMI-songs, we were interested in investigating the potential existence of musical earworms in a population for which deficits or alterations along these perceptual and cognitive dimensions as well as its involved neural correlates have been reported. This population is referred to as being affected by congenital amusia, which has been described as a lifelong disorder of music perception and production (for reviews, see Peretz, 2016; Tillmann et al., 2015). This deficit cannot be explained by peripheral hearing loss, brain lesions, or general cognitive or social impairments (Ayotte et al., 2002), and short-term memory for verbal material is intact (including classical digit spans; Albouy, Schulze, et al., 2013; Tillmann et al., 2009; Williamson & Stewart, 2010). The main hypotheses for impairments underlying this condition are impaired fine-grained pitch perception (Ayotte et al., 2002; Foxton et al., 2004; Hyde & Peretz, 2004) and impaired short-term memory for pitch, which is observed even in the absence of elevated pitch discrimination thresholds or when the to-be-processed pitch changes exceed amusics’ individual pitch discrimination threshold (Albouy, Mattout, et al., 2013; Gosselin et al., 2009; Tillmann et al., 2009; Williamson et al., 2010; Williamson & Stewart, 2010; review in Tillmann, Lévêque, et al., 2016). The main dimension affected by this disorder is the pitch dimension, as revealed by various tasks requiring pitch processing and memory (Liu et al., 2010), but the time dimension can also be affected (e.g., Foxton et al., 2006; Pfeuty & Peretz, 2010; Van Vugt et al., in progress). Subjective reports as well as questionnaire and experience sampling approaches have revealed at least for a subset of amusics an altered music engagement and appreciation (e.g., McDonald & Stewart, 2008; Omigie et al., 2012). Amusic individuals seem to incorporate music to a lesser degree into everyday activities than do controls. They report to listen less often to music and to sing more rarely than do controls, both in private and in public (e.g., Peretz et al., 2008). Interestingly, not all amusic individuals reflect this limited engagement, some show profiles similar to controls (e.g., McDonald & Stewart, 2008; Omigie et al., 2012). The subjective comments also reflect their memory impairments, both for long- and short-term memory. Regarding long-term memory, amusic participants report their difficulties in recognizing a song without lyrics (e.g., Ayotte et al., 2002), an impairment that is also observed in explicit familiar tune recognition tests (e.g., Omigie et al., 2012; Peretz et al., 2002).When performing short-term memory tasks, amusic participants indicate, for example, that it is difficult to hold on a melody in their mind and that information just disappears (e.g., Albouy, Mattout, et al., 2013). This is clearly reflected in one congenital amusic's comment describing their attempts to keep a short musical phrase in memory, reported by Stewart (2008): “When the music finished the sound was always gone - as though it had never happened. And this bewildered me with a sense of failure, of failure to hold on to what I had just heard. Others told me that if I tried to remember I would. But I never did.” (p. 128).
Neurophysiological studies investigating the cerebral underpinnings of congenital amusia have revealed functional and anatomical abnormalities in the auditory cortex, the inferior frontal cortex, in particular in the right hemisphere (Albouy et al., 2019; Albouy, Caclin et al., 2019; Albouy, Mattout, et al., 2013; Hyde et al., 2006, 2007, 2011), together with an abnormal connectivity between these structures (Albouy et al., 2015, 2019; Albouy, Mattout, et al., 2013; Hyde et al., 2011; Lévêque et al., 2016; Loui et al., 2009). Some of these cortical alterations thus occur in areas also associated with the individual differences related to INMI (Farrugia et al., 2015).
In light of this pattern of impairments and anomalies, one might thus wonder whether individuals with congenital amusia could experience the phenomenon of musical earworms, and leads to the hypothesis of missing or decreased INMI in this condition. This hypothesis would fit with the comment of the congenital amusic reported by Stewart (2008): “I have no idea what people mean when they say: ‘I have a tune going round in my head.’ I have never had a tune tell out its music in my head let alone repeat itself!” (p. 128).
However, alternatively, one could make the hypothesis that INMI might benefit from implicit processing capacities as well as default-network involvement (Farrugia et al., 2015; Lévêque et al., 2016), which would allow also amusic individuals to experience this phenomenon. Indeed, an increasing number of reports have revealed that not all musical processing aspects are impaired, but that some implicit processing seems to remain intact, such as implicit detection of pitch changes (e.g., Peretz et al., 2009; Zendel et al., 2015), implicit processing of tonal structures (e.g., Albouy, Schulze, et al., 2013; Lévêque et al., 2022; Tillmann et al., 2012; Tillmann, Lalitte, et al., 2016), as well as some long-term memory for music, allowing for feeling of familiarity judgments as do controls (even though with somewhat slower access; Tillmann et al., 2014) and the influence of familiarity and liking on memory and pitch deviance detection (e.g., Ayotte et al., 2002; Quiroga-Martinez et al., 2020; Lévêque et al., 2023). To the best of our knowledge, the phenomenon of earworms has not been studied in congenital amusia. Note that when writing this report, a study on voluntary musical imagery in congenital amusia was published, also assessing earworms as part of a larger investigation (Loutrari et al., 2022). In contrast to our study, it used a single measure derived from an earworm questionnaire and observed no significant difference on this single aggregated measure between amusic and control participants.
We had set out to investigate INMI in individuals with congenital amusia and their matched control participants, who are all rather non-musicians. We started this project in 2011 by translating and adapting the questionnaire introduced by Halpern and Bartlett (2011) into French. As congenital amusia is a rather rare phenomenon (estimated to occur for 1–2% of the general population; Cuddy, 2005; Peretz & Vuvan, 2017; Sloboda et al., 2005), we accumulated the data presented here since then. For consistency with the start of the project, we kept the initial questionnaire without changing to more refined questionnaires or investigation tools of INMI that have been proposed later on (e.g., Floridou et al., 2015). The data reported below are encouraging to further investigate this phenomenon now with more refined tools.
The questionnaire by Halpern and Bartlett (2011) also included questions for potential “verbal earworms”. This addition seemed particularly interesting for the population of congenital amusics for which no deficits for verbal processing (e.g., verbal short-term memory, memory span) have been reported, except when pitch information is the only available cue for the required task (e.g., pitch change detection in syllables, Tillmann et al., 2011; or controlled tone language material, e.g., Nan et al., 2010; Nguyen et al., 2009).
Material and Methods
Participants
Starting in 2011, we used a paper version of the questionnaire, and collected data from a first set of participants. In 2017, we implemented an online version of the questionnaire, which allowed us to collect a larger set of data. Data were collected in total from 99 control participants (paper version, n = 11; online version, n = 88) and 41 amusic participants (paper version, n = 6; online version, n = 35). Participating in this questionnaire study was proposed to volunteers that came to the lab to participate in behavioral or neuroimaging studies investigating congenital amusia, and thus underwent a full assessment allowing diagnosis of congenital amusia (audiometry, MBEA, PDT, see below for details). To match amusic participants, non-musician control participants were recruited (i.e., participants with mostly no music training outside compulsory courses at school, with some having received limited musical training, as for the amusics, see Table 1). All participants reported no history of neurological or psychiatric disease. Demographic data for the two groups of participants are detailed in Table 1.
Demographic characteristics of control and amusic participants.
Note: The characteristics of the two groups were compared with Student t-tests, except for Sex and Handedness tested with a Fisher Exact test. Bold indicates significant differences.
Our standard screening session for congenital amusia included: a standard audiometry to exclude moderate to severe hearing loss; the Montreal Battery for the Evaluation of Amusia (MBEA, Peretz et al., 2003); and a Pitch Discrimination Threshold task (Tillmann et al., 2009). To be considered as amusic, participants had to obtain an average score two standard deviations below the average of the normal population on the MBEA. Amusics obtained scores below the cut-off score on the overall MBEA battery (23.4 on average across the six tasks, maximum score = 30) or the three pitch-related subtests (scale, interval, contour, cut-off = 21.7 on average across the three tasks, maximum score = 30; Liu et al., 2010). Pitch discrimination thresholds (PDTs) were determined using a two-alternative forced-choice task with an adaptive tracking, two-down/one-up staircase procedure (see Tillmann et al., 2009, for task and details). MBEA scores and PDT for the two groups, as well as between-group comparisons, are reported in Table 1.
All participants signed a written informed consent prior to behavioral tests, and data collection with the questionnaire followed the appropriate regulation on personal data protection at the time of data collection.
Material
We adapted the questionnaire developed by Halpern & Bartlett (2011), added some extensions and translated it into French. Our French questionnaire (and its English translation) is presented in the Appendix. Various aspects of musical earworms were assessed: frequency of occurrence, features of earworms (musical style, familiarity, liking), subjective experience of musical earworms (pleasantness, ending of the phenomenon, strategies). The same type of questions were then proposed to investigate the phenomenon of verbal earworms.
Statistical Analysis
For each question, we compared between groups the proportion of participants giving the different possible answers using a Fisher's Exact test. We then investigated the potential relation between the occurrence of musical earworms and other relevant variables (occurrence of verbal earworms, age, music memory, musical habits) with Spearman's rank correlations, across all participants and separately in each group.
Results
Results are reported following the order of appearance of the questions in the questionnaire. Subheadings are added for clarity of presentation, but were not included in the questionnaire (see Appendix). Unless otherwise specified, the results were for all participants of both groups (n = 140, see Methods).
Occurrence of Musical Earworms
The first question asked participants whether they knew the expression “avoir une musique dans la tête” [Do you know the expression “having music/song in your head”?], which is the French way of referring to musical earworms. All participants, except for one amusic participant, were familiar with this expression (i.e., responding yes). Only four participants (1 control out of 99 and 3 amusics out of 41) reported not experiencing musical earworms (Figure 1A). The proportion of participants not reporting musical earworms was not significantly different between groups (Fisher's Exact test, p = 0.08). Among participants who reported musical earworms (98 controls and 38 amusics), the frequencies of occurrence most often reported were “almost daily” and “a few times/week” for controls, whereas they were “a few times/week” and “a few times/month” for amusics (Figure 1B). The frequency of occurrence of musical earworms differed significantly between groups (Fisher's Exact test, p = 0.02). Amusics hence reported less frequent musical earworms than did controls.

Occurrence of musical and verbal earworms in control (n = 99) and amusic (n = 41) participants. A. Percentages of participants in each group that report experiencing musical earworms. B. For participants who experience musical earworms (98 controls and 38 amusics), percentages of participants in each group that report a given frequency of occurrence of musical earworms. C. Percentages of participants in each group that report experiencing verbal earworms (missing data from one amusic participant). D. For participants who experience verbal earworms (31 controls and 22 amusics), percentages of participants in each group that report a given frequency of occurrence of musical earworms (missing data from one additional amusic participant). Blue bars on the left for controls, red bars on the right for amusics.
Features of Musical Earworms
Details about whether musical earworms were always of the same musical style (or not) or consisted even of always the same pieces are reported in Table 2. In both groups, the most frequent answer was that musical earworms were of “various musical styles”, followed by “various pieces of the same musical style”, hence revealing that participants mostly experience a variety of different musical earworms. The distribution across the various options regarding musical style for the musical earworms did not differ significantly between groups (Fisher's Exact test, p = 0.39).
Nature of musical and verbal earworms.
Note. The values reported are the number of participants that choose this response option followed by the percentage of participants per group in brackets. Musical earworms: data from 98 control participants and 38 amusics who reported experiencing musical earworms. Verbal earworms: Data from 30 controls and 21 amusics who reported experiencing verbal earworms (missing data from one participant per group).
For the subsequent questions aiming to further characterize earworms, data were missing for 10 controls and 5 amusics (see Appendix for details about questionnaire implementation and whether or not presenting some questions should be conditioned on answers given to previous questions).
Regarding familiarity (Figure 2A), most control participants reported that earworms consisted of familiar music (n = 85 [97%]) rather than unfamiliar music (n = 3 [3%]). This was also the case for amusic participants, but to a lesser extent (n = 28 [85%] for familiar music, vs. n = 5 [15%] for unfamiliar music). The proportion of familiar/unfamiliar responses differed across groups (Fisher's Exact test, p = 0.03). Most participants reported that earworms were music with lyrics (Figure 2B), both in the control group (n = 78 [89%] for music with lyrics, vs. n = 10 [11%] for music without lyrics), and in the amusic group (n = 30 [91%] for music with lyrics, vs. n = 3 [9%] for music without lyrics). The proportion of the two response options did not differ significantly between groups (Fisher's Exact test, p = 0.44). Finally, most participants reported that earworms consisted of music they appreciated (Figure 2C), both in the control group (n = 67 [77%] for music they like, n = 15 [17%] for neutral music, and n = 5 [6%] for music they dislike, missing data for an additional control participant) and in the amusic group (n = 24 [73%] for music they like, n = 7 [21%] for neutral music, and n = 2 [6%] for music they dislike; the proportion of the three responses did not differ significantly between groups, Fisher's Exact test, p = 0.62).

Features of musical and verbal earworms in control and amusic participants. A. Percentages of participants in each group who reported that musical earworms consisted mostly of familiar or unfamiliar music. B. Percentages of participants in each group who reported that musical earworms consisted mostly of music with or without lyrics. C. Percentages of participants in each group who reported that musical earworms consisted mostly of music they liked, neutral music, or music they disliked. D. Percentages of participants in each group who reported that verbal earworms consisted mostly of familiar or unfamiliar content. E. Percentages of participants in each group that report that verbal earworms consisted mostly of liked, neutral, or disliked content.
The musical styles of the pieces that participants indicated hearing as earworms are presented for both groups in Figure 3A, with pop, rock, and summer hit being the most frequent answers in both groups (note that participants could give several answers).

Types of musical and verbal earworms in control and amusic participants. Participants could select one or several items describing the types of musical (A) or verbal (B) earworms they are experiencing. The percentage of participants in each group that selected a given type is depicted. Blue bars on the left for controls, red bars on the right for amusics.
Subjective Experience of Musical Earworms
Hearing musical earworms can be a positive, a negative, or a neutral experience. In controls, the most frequent answer was that it is a negative experience (n = 39 [40%] for negative, n = 34 [35%] for positive, and n = 25 [25%] for neutral). In amusics, the most frequent answer was that it is a neutral experience (n = 15 [41%] for neutral, n = 11 [31%] for positive, and n = 10 [28%] for negative, missing data for 2 participants). The proportion of these three responses differed significantly between groups (Fisher's Exact test, p = 0.009).
A majority of participants reported that it was difficult to get rid of musical earworms, both for controls (n = 68 [69%] having difficulties, vs. n = 30 [31%] having no such difficulties), and for amusics (n = 24 [63%] having difficulties, vs. n = 14 [37%] having no such difficulties; the proportion of the two responses did not differ significantly between groups, Fisher's Exact test, p = 0.09). When asked how the earworm terminates, control participants mostly reported that it goes away by itself (n = 76 [78%]), whereas some deploy voluntary strategies to stop it (n = 19 [19%]), and a few reported that it is almost impossible to stop it (n = 3 [3%]). Almost all amusics reported the earworm goes away by itself (n = 31 [86%]), with only few amusic participants reporting to deploy a voluntary strategy to stop it (n = 3 [8%]) or that it is almost impossible to stop it (n = 2 [6%], missing data for 2 participants, the proportion of these three responses differed significantly between groups, Fisher's Exact test, p = 0.01). Some participants provided descriptions of the strategies to get rid of the earworms, which we could sort into three categories: listening to or singing the original musical piece (n = 11 controls for listening, n = 2 controls for singing; n = 1 amusic for listening); listening to other musical pieces (n = 4 controls, n = 1 amusic); switching to a different activity (n = 1 control, n = 1 amusic).
Occurrence of Verbal Earworms
The second part of the questionnaire investigated verbal earworms, defined as verbal information (e.g., phrases, sentences) repeating over and over in one's head (one amusic participant did not respond to this part of the questionnaire). Only one third of the control participants reported experiencing verbal earworms, whereas more than half of the amusic participants did so (Figure 1C, the proportion of participants experiencing verbal earworms differed significantly between groups, Fisher's Exact test, p = 0.006). Amongst the participants who reported verbal earworms (31 controls out of 99 and 22 amusics out of 40, missing data for the verbal part of the questionnaire for one amusic participant), the frequency of occurrence most often reported was “a few times/month” (Figure 1D, the frequency of occurrence did not differ significantly between groups, Fisher's Exact test, p = 0.12). Hence, amusics reported having verbal earworms more than did controls, but the frequency of occurrence of these earworms (in participants who experienced verbal earworms) was similar in the two groups.
Features of Verbal Earworms
Details about the nature of these verbal earworms are reported in Table 2. In the two groups, verbal earworms were most often described as being of different types (e.g., poetry, famous speech, advertisement), or as being always of the same type, but made of different sentences. The proportion of the four response choices (see Table 2) did not differ significantly between the groups (Fisher's Exact test, p = 0.17).
Further questions aimed at characterizing verbal earworms (missing data for 4 controls and 5 amusics, see Appendix for details). For familiarity (Figure 2D), participants in both groups reported that verbal earworms were mostly familiar: for controls, n = 23 [85%] for familiar, vs. n = 4 [15%] for unfamiliar; for amusics, n = 15 [88%] for familiar, vs. n = 2 [12%] for unfamiliar; the proportion of the two responses did not differ significantly between groups (Fisher's Exact test, p = 0.22). For liking (Figure 2E), participants in both groups reported that verbal earworms were mostly neutral material: for controls, n = 20 [74%] for neutral, n = 5 [19%] for dislike, and n = 2 [7%] for like; for amusics, n = 8 [47%] for neutral, n = 7 [41%] for dislike, and n = 2 [12%] for like; the proportion of the three responses did not differ significantly between the groups (Fisher's Exact test, p = 0.08).
The type of sentences heard as verbal earworms are presented for both groups in Figure 3B, with “Part of a conversation you had” and “Advertising” being the most frequent answers in both groups (note that participants could give several answers).
Subjective Experience of Verbal Earworms
Control participants who reported having verbal earworms reported these earworms as a neutral (n = 14 [47%]) or a negative (n = 13 [43%]) experience, and only rarely a positive experience (n = 3 [10%], missing data for one participant). Amusic participants reported equally verbal earworms as a negative (n = 8 [36%]), a neutral (n = 7 [32%]), or a positive (n = 7 [32%]) experience (the proportion of the three responses differed significantly between groups, Fisher's Exact test, p = 0.01).
When asked how the verbal earworm terminates, control participants mostly reported that it goes away by itself (n = 23 [77%]), yet a few deploy a voluntary strategy to terminate it (n = 4 [13%]) or report that it is almost impossible to get rid of it (n = 3 [10%], missing data for one participant). Almost all amusic participants reported that it goes away by itself (n = 19 [90%]), a few deploy a voluntary strategy to terminate it (n = 2 [10%]), none reported that it is almost impossible to get rid of it (missing data for one participant). The proportion of the three responses differed significantly between groups, Fisher's Exact test, p = 0.04. Among the six participants who reported using a voluntary strategy to get rid of the verbal earworm, two controls reported changing activity (e.g., switching on the TV), two amusics to think about something else (e.g., to-do list), and two controls to reflect on the repeating words (described as ruminations).
Correlations Between Musical and Verbal Earworms
To assess the link between musical and verbal earworms, we created a score for each material (musical or verbal), with 0 for participants who reported not experiencing earworms, and 1 to 4 for participants who reported experiencing earworms, coding their frequency of occurrence with 1 meaning “a few times/year” and 4 meaning “almost daily” (taken directly from the second question of the questionnaire, see Appendix). The correlation between these two “frequency report scores” for musical and verbal earworms was assessed across all participants (n = 99 controls and n = 40 amusic, missing data for the verbal part of the questionnaire for one amusic participant), and separately in each group, using Spearman's rank correlations. The correlation between musical and verbal earworms was significant across all participants (rs(137) = 0.320; p < .0001), for controls (rs(97) = 0.297; p = .003), and for amusics (rs(38) = .591, p < .0001), see Table 3.
Correlations (Spearman's r) between musical earworms and verbal earworms, age, MBEA pitch Score, MBEA incidental memory score, and music listening habits (with (a) how often they listen to music intentionally, (b) how often they listen to music alone, (c) how often they listen to music with company, (d) how often they sing when they are alone, and (e) how often they sing in public).
Note: Correlations are reported for all participants across the two groups as well as for control participants and amusic participants separately, with degrees of freedom (df) and p-values (p). Uncorrected significant correlations are indicated with *. Correlations that remained significant after Bonferroni correction are indicated in bold. Corrections were made for nine tested correlations, separately for each concerned participant pool (i.e., threshold was corrected to p = .006).
Correlations Between Musical Earworms and age, Music Memory, or Music Listening
Using the “frequency report scores” for musical earworms (i.e., frequency of occurrence of musical earworms: from 0: never, to 4: almost daily), we assessed the link between musical earworms and age, as well as music-related variables, across all participants and separately in each group, using Spearman's rank correlations (Table 3).
Age correlated negatively with earworms, across all participants (rs(138) = -0.325, p < .0001), for controls (rs(97) = -0.320, p = .001), and for amusics (rs(39) = -.398, p < .001), suggesting that the older the participants, the fewer earworms were experienced.
To assess the link between musical earworms and musical memory, we used the MBEA pitch score (average performance on the first three subtests of the battery, which are delayed-matching-to-sample tasks with pitch changes, Liu et al., 2010) as a measure of melodic (pitch) short-term memory ability, and the MBEA incidental memory score (performance on the last sub-test of the battery, which tests whether participants recognize the melodies used in the other 5 subtests) as a measure of melodic long-term memory. MBEA pitch score correlated with musical earworm occurrence across all participants (rs(138) = .217, p = .01), but not within groups (for controls, rs(97) = 0.032, p = .75; for amusics, rs(39) = .085, p = .60). MBEA incidental memory score correlated with musical earworm occurrence across all participants (rs(138) = 0.296, p = .0004), and separately for amusics (rs(39) = .381; p = .01), but failed to reach significance for controls (rs(97) = .178, p = .078).
Finally, to assess the link between musical earworms and music listening habits, we took advantage of a questionnaire about musical experience and participants’ relation to music (based on questionnaires of McDonald & Stewart, 2008; Peretz et al., 2009; Sloboda et al., 2005) that participants had filled out as part of our general screening procedure. Fifty-four participants (35 controls and 19 amusics) responded to this questionnaire. Among more than 90 questions, this questionnaire included questions that are relevant based on previous research investigating INMIs: Participants ranked on a 1-to-5 scale (1: never, 5: very often) how often they listen to music intentionally, how often they listen to music alone, how often they listen to music with company, how often they sing when they are alone, and how often they sing in public. Across all participants, the occurrence of musical earworms was correlated with the frequency of listening to music intentionally (rs(52) = 0.369, p = .006), with the frequency of listening to music alone (rs(52) = 0.418, p = .002), and with the frequency of singing alone (rs(52) = 0.275, p = .04; for other correlations p > .22). None of these correlations reached significance in controls (p > .24), but the first two were significant in amusics (listening to music intentionally, rs(17) = 0.741, p = 0.0003; listening to music alone, rs(17) = 0.649, p = 0.0003) and the third one was marginally significant in amusics (singing alone, rs(17) = 0.444, p = .06; for other correlations p > .17), see Table 3 for details.
Discussion
We investigated musical and verbal earworms in non-musician control participants and amusic participants with a questionnaire adapted from Halpern and Bartlett (2011), in a large-scale study with a total of 140 participants. In the following, we first discuss how the data from non-musician controls replicated and extended previous findings on musical earworms in populations with various musical backgrounds (e.g., Liikkanen et al., 2015; Müllensiefen et al., 2014; see Liikkanen & Jakubowski, 2020, for a review), and then we discuss the data of our amusic participants and show how musical earworms further also our understanding of the phenomenon of congenital amusia. Finally, we discuss the potential relationship between musical and verbal earworms.
Experience of Musical Earworms in the General Population
All control participants (n = 99) reported knowing the expression “having music/song in your head”, and all but one indicated having experienced earworms. This observation is in agreement with Halpern and Bartlett's report (2011) that all participants (n = 18) had already had the experience of music playing over and over in their mind. Our control participants reported experiencing this phenomenon “almost daily” or “a few times a week” for about 66%, which is comparable to Halpern and Bartlett (2011), indicating 72% of their participant pool to choose these two response options. Musical earworms were of various musical styles for most participants (71%), as reported by 66% of participants in Halpern and Bartlett (2011). Responses also indicated that earworms were mostly familiar music with lyrics, as in Halpern and Bartlett (2011). This observation also fits with other findings, for example that INMIs are three times more often familiar rather than unfamiliar songs (Hyman et al., 2015), and that more earworms were reported for music with lyrics than for instrumental music (e.g., Liikkanen & Jakubowski, 2020, Beaman, 2018). Beaman (2018) confirms this latter observation in an experimental setting, notably by using the same tunes as either lyrical or instrumental version. This finding suggests that the observation of tunes with lyrics as frequently experienced INMIs cannot be explained solely by some enhanced mere exposure for tunes with lyrics than instrumental music.
Most of our control participants reported that earworms consisted of music they did appreciate, rather than dislike or neutral. In contrast to Halpern and Bartlett (2011), where 16 out of 18 participants reported that the experience was typically pleasant or mixed/neutral, our participants referred to earworms almost as frequently as negative experience (40%) than as positive experience (35%). This variation integrates into the different patterns of experience previously reported in the literature, with Beaty et al. (2013) reporting it as mostly positive experience (N = 190), Williamson et al. (2014) as neutral or pleasant (N = 1046), while Liikkanen et al. (2015) as a more negative experience (exploiting over 80,000 tweets).
While most control participants reported that it was difficult to get rid of musical earworms, they also acknowledged that earworms also go away by themselves (78%), as in Halpern and Bartlett (2011; 67%). Interestingly, the strategies reported by both amusics and controls are similar to those that have been previously reported, such as listening to or singing the target piece or other pieces and/or switching activities (see for example Williamson et al., 2014, for Finnish and English participants).
Across all participants, the occurrence of musical earworms was correlated with the frequency of listening to music intentionally or alone and the frequency of singing alone (albeit not reaching significance after correction for multiple comparisons). However, none of these correlations reached significance in controls separately (but see below for amusics), thus only partially confirming Müllensiefen et al. (2014)'s observation that INMI are occurring more often and last longer for individuals with enhanced music listening or practice. However, our controls were mostly non-musicians, while Müllensiefen et al. (2014)'s participants of their online study included a wide spread of musical background, including musician participants, which might also include enhanced variability regarding music listening or singing behavior.
Regarding individual differences of INMI, Liikkanen and Jakubowski (2020) review that only a weak negative relationship was observed between age and INMI frequency, with data suggesting a mediating factor of musical practice. In our study, mostly non-musicians were tested, confirming a significant negative relationship (see also Liikkanen, 2012); the older they were, the weaker the reported frequency of occurrence of INMI.
We also assessed the link between INMI and long-term memory capacity, notably by taking advantage that all participants had been tested with the MBEA (to ascertain absence of amusia). In particular, we used the MBEA subcomponent testing for incidental memory to investigate the potential link between INMI and long-term memory capacity. Across all participants, the incidental memory score of the MBEA correlated significantly with musical earworm occurrence, further supporting the link between INMI and memory (e.g., Byron & Fowles, 2015). The correlation with the MBEA pitch score, which reflects short-term memory capacity, did not reach significance when corrected for multiple comparisons. To further address this difference, we took advantage that 16 participants of our present participant pool (7 amusics, 9 controls) had also participated in a previous study investigating both short-term memory and long-term memory (with a familiar melody identification task) (Graves et al., 2019). For the present purpose, we averaged performance across conditions for short-term and long-term memory tasks, respectively, and correlated these performance with the musical earworm occurrence. Congruently with the results observed here with the MBEA scores, performance correlated significantly with the long-term memory performance (r = .631, p = .009, df = 14), but not the short-term memory performance (r = .438, p = .090, df = 14). Because of the relatively small overlap in participants between the two studies, we combined control and amusic participants for the present additional analyses. Future research should now further investigate the differential link between INMI and short-term versus long-term memory capacity, respectively.
In sum, the present data provided converging evidence with previous reports regarding INMI, but here with a French (non-musician) population, which was rather understudied up to now, as most data sets stem from Australia, India, Finland, the UK, and USA (see Liikkanen & Jakubowski, 2020, for a review).
Experience of Musical Earworms in Congenital Amusia
While all but one amusic participants reported knowing the expression “having music/song in your head”, they almost all reported experiencing this phenomenon, yet significantly less frequently than controls (weekly or monthly rather than daily or weekly). As for controls, musical earworms were of various musical styles for most amusic participants (71% in both groups). Also as for controls, earworms were mostly familiar music with lyrics, even though the difference between familiar and unfamiliar music response options was less pronounced for amusics. This response pattern is in agreement with previous data showing that also amusic individuals might have a mental musical lexicon; they can experience feelings of familiarity and also somewhat recognize songs, even though the access might take more time, be related to a feeling of uncertainty and lead to weaker recognition scores (e.g., Graves et al., 2019; Tillmann et al., 2014). Interestingly, the distribution across the different musical style categories looked very similar for amusics and controls, with pop music being reported as the most frequent category, an observation keeping with Halpern and Bartlett (2011; Experiment 2).
Like for control participants, amusic participants reported that earworms consisted of music they did appreciate, rather than music that was judged neutral or music they disliked. This might be related to some preserved emotional processing for music in the amusic population. Indeed, Gosselin et al. (2015) reported normal recognition of emotions in musical clips for amusic individuals, albeit with a reduced use of cues related to change of mode in comparison to controls. And even though amusics showed a deficit to explicitly label the emotion expressed by music in a different study using real recordings of orchestral music, their emotional intensity judgments are similar to controls, thus pointing again to rather preserved implicit processing (Lévêque et al., 2018), which thus might contribute to the observed response pattern here too.
Some of our results showed differences between earworms of control participants and of amusic participants. These differences can be linked to characteristics observed in congenital amusia and to the involvement of memory in the phenomenon of INMI. Indeed, even though both populations judged similarly that it is difficult to get rid of musical earworms (63% of amusics and 69% of controls reported having such difficulties), amusics’ responses suggest that the INMI are less persistent than for controls: amusics reported to rely less on voluntary strategies than did controls (8% of amusics, 19% of controls) and that the phenomenon stops rather by itself (reported by 86% of amusics, but 78% of controls).
These subjective reports suggesting more volatile INMI in amusics than in controls can be linked to previous reports showing that musical memory traces are less well established in amusia (e.g., Albouy et al., 2016; Gosselin et al., 2009; Tillmann et al., 2014). One could argue that this difference in volatility might be related to the difference in reported subjective experience, notably with amusics reporting it mostly as neutral, followed by positive and then only negative, while the most frequent answer in controls was negative, followed by positive and then neutral (with, for example, the involuntary or uncontrollable aspect that might be experienced as negative). Future research will need to further investigate this potential link, also with more detailed questionnaires allowing for responses including more precise estimates of frequency of occurrence, for example.
A link with memory and musical engagement is also suggested by participants’ responses to our general musical questionnaire, namely listening to music intentionally or alone is correlated to the frequency of INMI in amusia. Furthermore, the MBEA incidental memory score, an indicator for individuals’ long-term memory capacity, correlated with musical earworm occurrence (even though not reaching significance in the amusic group when corrected for multiple testing), further revealing the link between INMI and memory. For amusics only, this correlation with musical earworm occurrence was not significant for the MBEA pitch score, which relies on short-term memory only, further suggesting a link with long-term memory and INMI. Note that, as for controls, age was negatively correlated with frequency of INMI.
In sum, our amusic participants’ data reveal that despite their condition, they experience INMI, even though less frequently and less persistently than controls. Amusics’ INMI experience reflect certain characteristics that can be linked to their condition, such as their decreased memory for musical material, for example, or less frequent intentional music listening or singing.
While surprising that even amusics might have INMI, this observation is in agreement with a very recent report that came out while writing our manuscript. Loutrari et al. (2022) reported no difference in a global earworm score between controls and amusics. The more detailed responses of our questionnaire (adapted and extended from Halpern & Bartlett, 2011) revealed some alterations in amusics’ INMI that can be connected to other characteristics of amusic individuals (e.g., memory impairments).
The observation that also amusics can experience INMI can be linked to other recent findings showing that amusic individuals have still some implicit capacity to process musical material (e.g., Tillmann et al., 2012; Tillmann, Lalitte, et al., 2016). Indeed, one might argue that musical earworms are less explicit than inner singing for rehearsal or finding a melody in one's mind. It could be referred to as “spontaneous recall”, thus not requiring explicit search or effort. INMI might thus benefit from implicit processing, which has also shown its power in the amusic population, notably for pitch processing or tonal structure processing (e.g., Omigie et al., 2013; Tillmann et al., 2012; Zendel et al., 2015). The amusic individuals might also benefit from other dimensions in the “earworm material”, notably they might benefit from the lyrics associated to it. Consequently, one might wonder whether amusics are able to report musical earworms because of the memory of verbal information, and even more so as amusic individuals also report verbal earworms (see below).
Regarding the neural correlates, INMI might benefit from the default-network involvement. Indeed, Farrugia et al. (2015) reported a negative correlation between the frequency of INMI episodes and cortical thickness in two regions of the DMN (in particular, AG and ACC), providing evidence of shared mechanisms with other forms of self-generated thoughts. For congenital amusia, resting-state brain activation analyses have revealed not only altered connectivity between auditory cortices and auditory and frontal networks, but also showed an overconnectivity between Heschl's gyri and the core areas of the DMN (notably, medial prefrontal cortex, posterior cingulate cortex, and temporo-parietal junctions; Lévêque et al., 2016), which might perhaps allow amusic individuals to experience this phenomenon of INMI. However, future research of the potentially involved brain correlates and their potential deficit are needed.
Musical and Verbal Earworms
In addition to musical earworms, the questionnaire we used here (adapted from Halpern & Bartlett, 2011) includes an inquiry about verbal earworms. Interestingly, both control and amusic participants reported verbal earworms, while the proportion of amusics who did so was higher than that of controls. In comparison to musical earworms, the frequency of verbal earworms is decreased, with “a few times a month” being the most chosen category, in keeping with Halpern and Bartlett (2011), and did not differ between the two groups. Participants who experienced verbal earworms reported them to be either of different types (e.g., poetry vs. famous speech, 33%) or of the same type but with different sentences/words (33%), somewhat in line with Halpern and Bartlett (2011), where verbal earworms were mostly reported being of different types (67%; note, however, that only 9 participants reported verbal earworms in this earlier study). For both groups of participants, the verbal material was reported being mostly familiar and emotionally neutral, often coming from a conversation.
Beyond the observation that the verbal material was mostly familiar, the responses for verbal earworms further revealed some link to memory and some group differences. While both groups reported that the verbal earworms go away by themselves, this was the case for almost all amusic participants, but only 77% of the controls. This suggests that not only for musical but also for verbal earworms, the memory traces seem to be more volatile for amusics. This observation differs from reports for short-term memory results, notably with amusics showing deficits only for musical, but not for verbal material (e.g., Albouy, Schulze, et al., 2013; Tillmann et al., 2009; Williamson et al., 2010; Williamson & Stewart, 2010) and needs to be further investigated. One might also want to further investigate how this pattern evolves for voluntary mental imagery of both verbal and musical materials.
The potential link with more general cognitive processes involved in mental imagery is further supported by the here observed correlation of the frequency of musical and verbal earworms. Across both groups as well as within each of the groups, the two imageries seem related: the more musical earworms one reports, the more verbal earworms are reported too. However, beyond these similarities between musical and verbal earworms, which might be related to the involvement of domain-general processes, amusics reported less frequent musical earworms than did controls, whereas the reverse was observed for verbal earworms. These findings suggest some specificities of musical versus verbal earworms and highlight the need for further research investigating involuntary imagery across domains.
Conclusion and Perspectives
The present questionnaire investigation confirms that musical earworms are a common experience in the general population, even in non-musicians, and extend this finding to the case of congenital amusia, even if musical earworms are less frequent in this population (see also Loutrari et al., 2022). Verbal earworms also occur, but seem to be a less frequent experience. We here replicated a number of earlier findings about musical earworms: INMI are of diverse musical styles and consist mostly of familiar and liked music with lyrics. Although INMI can be quite sticky in memory, they are generally not perceived as a negative experience. Our present findings support a link between musical earworms and long-term memory, not only because of familiarity effects that have been previously described, but because of the differences observed between amusics and controls, and the correlation with the MBEA incidental memory score as well as the link with our previous data (Graves et al., 2019). Our findings with amusic participants also suggest that musical earworms could be related to implicit musical knowledge and processes that are largely spared in congenital amusia.
The present results open new perspectives to benefit from the investigation of the phenomenon of congenital amusia to further our understanding of INMI in general and its links to memory, emotion, consciousness, and voluntary imagery. Our study is a starting point with a first, rather basic questionnaire; the investigation can now benefit from the various other methods that the research domain of INMI has developed, such as in particular more refined questionnaires (e.g., the Involuntary Musical Imagery Scale of Floridou et al., 2015) or lab-based earworm creation or evocation.
Footnotes
Acknowledgements
We thank Jackson Graves for his help in the supplementary analysis combining musical earworm occurrence with short-term and long-term memory performance in a subset of the participants.
Action Editor
Solange Glasser, University of Melbourne, Melbourne Conservatorium of Music.
Peer Review
Karen Wise, Guildhall School of Music and Drama, Department of Music Therapy
One anonymous reviewer
Contributorship
BT, AC, and PA conceived the study. BT, AC, and LF were involved in gaining ethical approval, participant recruitment and data analyses. AP and CH were involved in participant recruitment and behavioral testing for screening. LF implemented the online version of the questionnaire. BT and AC wrote the first draft of the manuscript. All authors reviewed and edited the manuscript and approved the final version of the manuscript.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Ethical Approval
Participants signed a written informed consent prior to behavioral tests, and data collection with the questionnaire followed the appropriate regulation on personal data protection at the time of data collection (questionnaire registered with the CIL at the CNRS #13019).
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was conducted in the framework of the LabEx CeLyA (“Centre Lyonnais d'Acoustique”, ANR-10-LABX-0060) and of the LabEx Cortex (“Construction, Function and Cognitive Function and Rehabilitation of the Cortex”, ANR-11-LABX-0042) of Université de Lyon, within the program “Investissements d'avenir” (ANR-11-IDEX-0007) operated by the French National Research Agency (ANR).
Correction (June 2023):
The author's name Agathe Pralus was listed as lastname firstname. This has been corrected.
