Abstract
Nonverbal vocalizations in dementia are important clinically since they generally have been regarded as disruptive behavior that is disturbing. The aim of the present study is to describe the interactional pattern, including the prosodic package, of nonverbal vocalizations in a participant in a late stage of dementia. The acoustic analysis shows that the vocalizations do not differ significantly from the verbal utterances regarding mean fundamental frequency or pitch range. The mean fundamental frequency, F0, of the utterances from Anna was significantly higher than the mean F0 from the other elderly participants. The analysis demonstrates that there is a singing-like type of vocalizations that does not resemble the previously described patterns of nonverbal vocalizations. This pattern of the nonverbal vocalization does not resemble the intonation of Anna’s verbal utterances. The other participants perceive Anna’s vocalizations as potentially meaningful turns. Nonverbal vocalizations in clinical settings should be taken as communicative contributions.
Introduction
Pragmatic abilities are known to be impaired in people with dementia. 1 In late stages of dementia, noise making or disruptive vocalizations are rather common. 2 Nonverbal vocalizations are important clinically since they generally have been regarded as an asocial, disruptive behavior that is disturbing to caregivers and patients and are often interpreted as a sign of discomfort from the people with dementia. 3,4 There are different theoretical frameworks explaining these behaviors. They may be interpreted as a direct consequence of the dementia itself, as an expression of unmet needs, as an expression of noncommunicable feelings and experiences, or as an increased vulnerability to environmental factors. 4,5
Given recent years research about the social abilities of persons with Alzheimer’s Disease ([AD]; compare ref 6), it is interesting to investigate nonverbal vocalizations not as isolated phenomena but as part of a specific interactional context and particularly in what ways—if any—the interactional context affects the nature of the nonverbal vocalizations. As a consequence of the attempt to introduce a social perspective on noise-making and disruptive vocalization, we will use the term verbal and nonverbal vocalizations (VNVV) as a more neutral and appropriate term. 7
Verbal and Nonverbal Vocalization in Dementia
Language impairment is a regular feature in dementia, while language abilities are quite well preserved in normal aging. 8 –10 Language problems vary, both regarding severity and characteristics, in relation to the type and degree of dementia. However, language impairment is associated with behavior problems in dementia, when severity of dementia is controlled for. 11
Language impairment in dementia is not as well defined as aphasia acquired from focal brain lesions but includes word-finding problems, syntactic problems, and finally also a disturbance of pragmatic functions. 12 The linguistic problems are related to cognitive impairments in especially the episodic and semantic memory systems together with declining executive functions (compare ref 13).
In late stages of dementia, patients may be mute or restricted to palilalia or echolalia. There are also cases of abnormal laughter-like vocalizations in progressive aphasia. 14 These vocalizations did not show contextual sensitivity but occurred as an automatic output replacing speech.
It has been found that among all elderly patients between 48% and 55% of those patients that engage in nonverbal vocalizations have Alzheimer-type dementia and 30% have vascular dementia; while between 12% and 30% of patients diagnosed with some kind of dementia scream (for a review of research, see ref 4). It has also been found that nonverbal vocalizations occurred mostly in the evening, when the patients were alone, restrained, in the bathroom, or receiving care. 4
In a study by Nagaratnam et al, 15 4 categories of vocalizations (noise making) were identified; persistent screaming, perseverative vocalization, continuous chattering, muttering, singing or humming and swearing, grunting, and bizarre noise making. This categorization was based on medical records from 12 patients with AD and vascular dementia. The persistent screaming was described as shrieking, crying, or screaming. The perseverative vocalization was described as stereotyped utterances, for example, continuous counting. The continuous chattering, muttering, singing, or humming was described as constant unintelligible muttering or repeatedly speaking of the same topic and the swearing, grunting, and bizarre noise making were described as continuously swearing and shouting obscenities. All 4 behaviors occurred in association with agitation. 15
Ryan and his colleagues 16 presented a typology of different kinds of what they termed “nonverbal and vocal disruptive behaviors”: (1) noise making that appears purposeless and preservative; (2) noise making which is a response to the environment; (3) noise making which elicits a response from the environment; (4) “chatterbox” noise making; (5) noise making due to deafness; and (6) other noise making. In contrast to this typology, Cohen-Mansfield—who together with her research group probably has spent most time and effort on this phenomenon—suggested a dimensional typology. 3 Their typology of vocalizations includes 4 dimensions: (1) type of sound, (2) purpose of the sound, (3) response to the environment, and (4) levels of disruptiveness.
Interestingly, very few of these researchers have suggested nonverbal vocalizations among people with dementia being part of social interaction or having a clear social value besides being disruptive. In contrast, researchers interested in the nonverbal vocalizations of infants argue that early noncry vocalizations of infants are clearly social signals. 17 Caregivers spontaneously respond to 30% to 50% of these vocalizations. 18 Prelinguistic babbling often seems remarkably speech like since it seems to have adult-like prosody. It has been demonstrated that infants produce adult-like stress patterns and that they used variation in pitch, intensity, and duration in an adult-like manner. 19 There are several studies on how caregivers respond to vocalizations of infants and the importance for social and interactional development is emphasized. 18,20,21
The study of the acoustic characteristics of nonverbal vocalizations in elderly patients has to date been very limited. A study by Cohen-Mansfield et al 22 is an important exception. They performed acoustic analysis of temporal aspects and frequency of vocalizations, corresponding to rhythm and melody (intonation) of vocalizations, from 26 nursing home residents. Their results demonstrated that the vocalizations of agitated nursing home residents were short in duration and in high fundamental frequency. They did not find any indicators that could differentiate between types of dementia. However, no analysis of the interactional consequences of the nonverbal vocalization was made in the study. 22
In the following, focus will be on the prosodic aspects of nonverbal vocalizations, especially in relation to the organization of the ongoing interaction. This approach is based on previous research focused on pragmatic ability in people with dementia and the pragmatic impairment is viewed as a major source of communicative disruptions. With an interactional approach, the co-construction of understanding becomes central and well-functioning sequences are identified. 1 The aim of the present study is to describe the interactional pattern, including the prosodic packages, of nonverbal vocalizations in a participant in a late-stage of dementia. Vocalization data have been analyzed by means of acoustic measurements alongside the sequential analysis, allowing for the acoustic features, mainly mean fundamental frequency and pitch contours, to be related to the sequential aspects of the analysis. In the previous literature, analyses are mainly made of either the person with dementia or the caregiver. Thus, relating them to each other may add knowledge on how participants in interaction are influenced and influences each other also when one of them communicates with nonverbal vocalizations.
Materials and Methods
In the following, the nonverbal vocalizations of 1 person in 2 different situations will be used for analysis. In this sense, it is a case study aiming at delineating certain basic aspects of vocalizations in relation to social interaction. The intention is not to generalize the findings to the whole population of persons with AD or a subset of these but rather to show the possibility of the suggested methodological and analytical framework and also to point to certain implications of the approach concerning the understanding of VNVV.
The examples used are part of a video-recorded material consisting of several hundred hours of recording from a nursing home for persons with AD. In the material, interaction between staff and inhabitants, between persons with dementia without any staff present, are found. In this material, a small number of instances involving nonverbal vocalizations was identified and used for further analysis (For further descriptions of this material see ref 23.). The chosen key portions of the material were the only parts where the nonverbal vocalizations were present and possible to analyze.
Participants
The main participant in the present study is a woman, here called Anna. She is about 85 years old, has had AD for many years, and is approaching the third and final stage of this disease. She has profound problems with speech and language use in general and also with her hearing. She is very slow in her movements and has problems coordinating them, for instance, when she is eating. Sometimes she says something in a weak voice. More often she has problems producing words and instead vocalizes sounds. Her communication with others is very limited, and she mainly communicates with nonverbal vocalizations. In the present data, 2 other residents, B and C, in the nursing home participate in the interaction. These 2 ladies are also diagnosed with AD, although in a less advanced stage compared with Anna. The 2 ladies mainly interact with each other, but they also orient to Anna’s contributions in the interaction. The fourth participant in the present study is the researcher, R, who contributes to the interaction on some occasions. The researcher stayed as an observer in the nursing home during the months the data were collected.
Data
In the following, 2 episodes will be used for analysis. Both episodes evolve around the dinner table at the nursing home. In the first episode, some of the participants together with a nurse are having lunch. In the second episode, recorded at couple of days later, the 3 guests described above are lingering around the table after breakfast.
The episodes are part of video recordings covering longer stretches of time and also of an extended ethnographic presence for almost 6 months. The ethnographic fieldwork makes it possible to understand some of problems persons with AD have as part of the specific activity ecology of the nursing home. Behaviors, utterances, and actions that may appear as classical symptoms of AD become at least somewhat more understandable if they are seen as part of a specific context (eg, ref 23, for further information about the material).
Acoustic Analysis
The acoustic analysis was made both of verbal and nonverbal material. To measure the mean fundamental frequency, F0, as a whole for each participant, the mean F0 for each utterance was measured using the Praat software. 24 The mean F0 was then calculated by adding the values from each utterance and dividing the sum with the total number of utterances. To obtain the pitch range, minimum was measured and subtracted from the measured maximum F0 in utterance by utterance for each participant separately. The mean pitch range was then calculated in the same way as the mean F0. In order to check for statistically significant differences, both between Anna and the other participants and between Anna’s VNVV, Mann-Whitney U tests performed.
Sequential Prosodic Analysis
In the present article, a detailed sequential analysis was carried out in order to assess the structural and the intonational features of the participants, with special focus on Anna. A sequential analysis of interaction can be made in different ways. In a few earlier studies, conversation analytic (CA) methodology has been applied to analyses of conversation involving people with dementia. 1,25 In CA, understanding and meaning are viewed as collaboratively constructed by participants in interaction and every utterance is analyzed in relation to the context in which it occurs. The present data are analyzed sequentially in a CA-informed way, with a special focus on prosody.
Results and Analysis
In the following, the acoustic properties of nonverbal vocalizations from 1 person are used for analysis. Moreover, interactional aspects of the vocalizations will be taken into account.
Acoustic analysis
The participant, Anna, produces verbal speech with a varying degree of intelligibility in addition to her nonverbal vocalizations. From an overall perspective, the nonverbal vocalizations do not differ significantly (P = .2) from the verbal utterances regarding mean fundamental frequency or pitch range (Figure 1 ).

Acoustic properties of Anna’s verbal utterances in relation to the acoustic properties of her nonverbal vocalizations. MF0 indicates mean fundamental frequency; Min, lowest fundamental frequency; Max, highest fundamental frequency; range, pitch range (from the lowest to the highest measured fundamental frequency).
However, the mean fundamental frequency of the verbal utterances from Anna was significantly lower than the mean F0 (P < .0001) from the other elderly participants (Figure 2 ). The mean pitch range of the other participants was significantly higher (P < .0001) than the pitch range of Anna. In conclusion, this means that Anna’s verbal utterances were produced in low pitch with rather flat intonational contour (Figure 3 ).

Anna’s mean fundamental frequency in relation to the mean fundamental frequency of the other elderly participants.

Pitch range of Anna’s verbal utterances in relation to the pitch range of the other elderly participants.
Detailed analysis of the nonverbal vocalizations reveals 3 things. (1) A stereotyped prosodic pattern distinguishing the nonverbal from the verbal utterances, demonstrated by the acoustic measurements described above. (2) Two seemingly different pitch patterns of nonverbal vocalizations also appear; 1 singing-like pattern and 1 flatter pattern as shown in the sequential analysis below. (3) There are also indications of a particular prosodic patterns in Anna’s verbal utterances signaling anger, as shown in example 9 below where Anna uses a very distinct prosodic pattern with lower mean fundamental frequency and larger pitch range than her previous verbal utterances.
Sequential Prosodic Analysis
The included recordings were transcribed in accordance with the demands of CA, which means that a great variety of detail was taken into account, including pauses, hesitations, and overlaps, and so on. In the sequential analysis, prosodic features are illustrated by F0 curves of each contribution to the interactions, in accordance with the pioneering work by Local and Wells. 26,27
From an overall perspective, the sequential prosodic analysis demonstrates that Anna’s nonverbal vocalizations do interactional jobs. The analysis also revealed that the other participants orient to the vocalizations in different ways. In almost all instances of nonverbal vocalizations, Anna looks at the other participants, indicating that her basic pragmatic competence is preserved.
Example 1
This first example illustrates the typical, most frequent pitch pattern of the nonverbal vocalizations from Anna. These vocalizations are produced repetitively, and they are perceived as a monotonous singing. The same intonational pattern is repeated very often.
A= Anna
In the above example, the typical wave-like pitch pattern is shown, where the pitch starts rather high, around 225 Hz, rises up to about 275 Hz and then goes down again. The stereotyped pitch-movement up and down resembles singing more than talking, and it is repeated every time the singing-like nonverbal vocalizations occurs, see also example 2. The resemblance to singing is also enhanced by the fact that most of these vocalizations are articulated with the syllable “la,” which is the most frequently used syllable for humming in Swedish.
Example 2
A=Anna, B=other resident
This stereotyped prosodic pattern being similar to singing is also observed and oriented to by another participant, B, which is illustrated in by the comment on line 2, where she indicates that the singing is rather bad. The following 3 examples also demonstrate that the other participants orientate to the pattern as singing like. Nevertheless, the vocalizations, which seem to be Anna’s spontaneous way of expressing herself, do an interactional job since they make the other participants address Anna in interaction.
Example 3
A=Anna, B=other resident
In line 2, B imitates in a slightly exaggerated way the prosodic package used by Anna for the nonverbal vocalizations. This imitation is not responded to by any of the participants. Participant B also makes several meta comments on the vocalizations from Anna, demonstrating that she orients to them as some kind of singing, however, not resembling to any known song, see examples 4 and 5.
Example 4
A=Anna, R=researcher, B=other resident, P=pause
In line 1, Anna keeps on her singing-like vocalization, which is followed by a request for clarification by the participating researcher in line 2. In line 3, B comments on Anna’s contribution by asking if it is a song, and when this is not answered by Anna, B continues with commenting that it was not a very good song. This turn closes the sequence and is followed by a pause. In example 5, B also asks a question about the song.
Example 5
A=Anna, B=other resident
Anna is making her singing-like vocalization in line 1 and in line 2, B asks about the name of the song. This question does not get any answer from Anna, and after a short pause, B makes an imitation of the intonational pattern of Anna’s vocalization. Anna orients to this imitation by continuing her vocalization in the same manner, line 3. This kind of stereotyped singing-like vocalization does not correspond to any previously described nonverbal vocalizations. 15,22
The other repeated pitch pattern of the nonverbal vocalizations is more monotonous than Anna’s pattern previously described above. Anna also uses this pattern less frequently, and sometimes, she articulates it with a less singing-like syllable.
Example 6
A=Anna, CG=caregiver
In example 6, the syllable “aj” is produced at a rather high pitch, about 240 Hz, with very little variation. This utterance/vocalization does not get any response from the other elderly participants, but after a while, someone working at the nursing home, CG, steps up and asks Anna whether she is feeling alright. There are also instances where the syllable “la” is used, and this pattern is commented on, but not with any reference to singing.
Example 7
A=Anna, B=other resident, C=second other resident, P=pause
In line 2, Anna makes a nonverbal vocalization ending with a prolonged “a:.” This vocalizations does not get any response. In line 4, the pattern is repeated. B comments the vocalization in line 5, however, not addressed directly toward Anna even though it gets a verbal response from Anna in line 6. Also in this example, Anna’s vocalizations are produced with a high pitch, above 250 Hz. These vocalizations are also rather short in duration and the pattern with high pitch and short durations is in accordance with the pattern previously described by Cohen-Mansfield et al. 22
On several occasions, Anna’s nonverbal vocalizations are interrupted by responses from the other participants, most often the participating researcher. In these sequences, Anna responds with verbal utterances with a varying degree of intelligibility. The intonational pattern of the verbal utterances are less stereotyped than the nonverbal vocalizations and their mean fundamental frequency is slightly lower (although not significantly).
Example 8
A=Anna, B=other resident, R=researcher, P=pause
Anna keeps making nonverbal singing-like vocalizations in line 1 in overlap with a contribution from B. The participating researcher, R, asks a question in line 3 and Anna responds verbally and initiates a new topic in line 4. The pitch contour in this utterance does not resemble the singing-like pattern demonstrated in examples 1 to 3, the fundamental frequency is slightly lower as well as the pitch range. By asking the question about boredom in line 3, the participating researcher offers an account for the meaning of the nonverbal vocalization. The need for candidate understandings demonstrates that the difficulties of understanding are also a joint problem of interaction. This might be an expression of a “vicarious voice,” 28 from the researcher for Anna, and in this example, this contribution interrupts the singing-like vocalization and elicits verbal communication.
In 1 sequence shown in example 9, Anna demonstrates anger in a verbal utterance, mainly through a prominent prosodic package.
Example 9
A=Anna, CG=caregiver, C=second other resident, P=pause
In line 1, Anna is making a nonverbal vocalization, and in line 2, another participant orients to this as an expression of pain, probably depending on the syllable “aj” that Anna uses in this example. This contribution is not responded to by Anna, and participant C, who is well acquainted with Anna, tells Anna to stop vocalizing in a rather soliciting way (line 3 and 5). Anna responds verbally but unintelligibly in line 6, and in line 7, C tells her to eat her sandwich. In line 8, Anna produces a very distinct phrase with lower mean fundamental frequency and larger pitch range than her previous verbal utterances. C orients to this expression of anger by commenting that the sandwich is good.
In sum, the sequential analysis demonstrates that the most frequent pattern of Anna’s nonverbal vocalizations follow a singing-like pitch contour, which is also oriented to as singing by the other participants in the interaction. Nevertheless, also the singing-like contributions make the other participants address Anna and thereby give her some attention. The analysis also reveals another flatter pattern resembling to the previously described acoustic patterns. 22 In addition, it is demonstrated that the nonverbal vocalizations may be interrupted by contributions by the other participants. Furthermore, Anna demonstrates her interactional competence by expressing anger in an adequate and intelligible way. It also becomes clear that the other participants perceive Anna’s vocalizations as distinct sound patterns, which they can imitate (compare example 5 above) and also a potentially meaningful turn. 7
Concluding Discussion
It is clear that the participant in the present case study has severe language problems, sometimes resulting in, for instance, nonverbal vocalization. In the present analysis, it is demonstrated that there is a singing-like type of nonverbal vocalizations that does not resemble the previously described patterns of nonverbal vocalizations. 15,22 The singing-like intonational pattern of the nonverbal vocalization does not resemble the intonation of Anna’s verbal utterances, which adds to the notion that this is a routinized pattern connected to the nonverbal activity. In previous research, it has also been argued that nonverbal vocalizations are nonsocial and disruptive, 3,4 but in the present data, it is also demonstrated that the participants orient to the nonverbal vocalizations in an interactional way.
The intonational pattern of Anna’s verbal utterances is different from the prosodic package of the nonverbal vocalizations. It also differs to some extent from the other elderly participants so that it has an overall lower mean fundamental frequency and less pitch variation. On one occasion, Anna signals anger (example 9), and in this example, Anna is able to use prosodic features in order to signal emotional state. This example also demonstrates that Anna orients to negative and condescending comments with anger. This is an important finding that may have been overlooked if only the structural aspects of language had been taken into account. This is also an important clinical finding since the prosody of unintelligible verbal utterance is rarely made.
The laughter-like vocalizations described by Rohrer and colleagues 14 is claimed to be replacing speech, which also seems to be the case of the singing-like vocalizations in the present study. However, acoustic analysis of laughter-like vocalizations revealed that they were dysrhythmic and fragmentary indicating motor disorganization. In contrast to this, the singing-like vocalizations described here is very rhythmic and recurrent, which does not indicate any motor disorganization.
There are obvious parallels between vocalizations in infants and nonverbal vocalizations in people with dementia, mainly regarding the acoustic features, such as high pitch and large pitch variation. In previous research, it has been demonstrated that vocalizations in infants has a social functions, and it is most often responded to by caregivers as interactionally relevant. 17,18 The interactional importance of nonverbal vocalizations in people with dementia is less clear, although it is responded to by interactional partners, 7 which is also demonstrated in the present analysis.
It is interesting to note that both the participating researcher and the other elderly participants orient to Anna’s nonverbal vocalizations and respond to them in different ways. Sometimes, they treat them as signs of discomfort responding with questions about pain or boredom. In these sequences, the nonverbal vocalizations are shown to be possible to interrupt by initiatives from the other participants (also see Hydén 7 ). The fact that it is possible to interrupt the nonverbal vocalizations has obvious clinical implications since it points to the importance of addressing this seemingly nonsocial behavior. This finding adds to the knowledge on the effectiveness of communication interventions on agitation in people with dementia, 29,30 stressing the importance of positive and neutral communicative interventions and a dispreference for negative (anger provoking) interventions.
Footnotes
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
The authors disclosed the receipt of the following financial support for the research, authorship, and/or publication of this article: This research has partially been funded by Riksbankens Jubileumsfond, RJ.
