Ultrasound Tongue Imaging in Research and Practice with People with Cleft Palate ± Cleft Lip

Abstract

Ultrasound tongue imaging is becoming popular as a tool for both phonetic research and biofeedback for treating speech sound disorders. Despite this, it has not yet been adopted into cleft palate ± cleft lip care. This paper explores why this might be the case by highlighting recent research in this area and exploring the advantages and disadvantages of using ultrasound in cleft palate ± cleft lip care. Research suggests that technological advances have largely overcome some of the difficulties of employing ultrasound with this population and we predict a future increase in the clinical application of the tool.

Keywords

ultrasound articulation biofeedback

Introduction

Speakers with cleft palate ± cleft lip (CP ± L) often have persistent difficulty with speech. Errors can be obligatory, for example, nasal realisation of voiced plosives, or compensatory, maladaptive articulatory placement to compensate for anatomical differences.¹ Many of these compensatory errors affect lingual articulation, for example backing of alveolar consonants, either to other places within the oral cavity, such as velar/uvular or posterior to the oral cavity, for example glottal. Over the last 40 years there has been interest in measuring this lingual articulation instrumentally. Instrumental techniques can show us articulatory errors which are hard to identify and transcribe and (some) instrumental techniques can also be used in intervention. There are four main techniques for studying tongue movement: electropalatography (EPG), Magnetic Resonance Imaging (MRI), Electromagnetic Articulography (EMA), and Ultrasound Tongue Imaging (UTI).² The technique of choice for speakers with CP ± L has historically been EPG.³ EPG displays tongue-palate contact⁴ but not tongue-shape. From EPG we have learnt that speakers with CP ± L show errors which we cannot always identify with phonetic transcription.⁵ For example, mid-dorsum palatal stops transcribed as velars; increased variability between repeated productions of the same consonant⁶; and covert contrasts in which two contrasting speech sounds such as /t/ and /k/ are perceived by a listener as identical, yet produced in subtly different ways.⁷ Identification of these errors can alter the type of treatment chosen. For example, a child with a covert contrast requires a motor-based articulatory intervention whereas a child with two consonants produced in an identical manner often needs a phonological approach. EPG can also be used to determine treatment targets⁸ and to treat compensatory errors when used as a biofeedback tool.⁴ Of the other techniques, MRI shows the entire vocal tract but it has not been used extensively for biofeedback because it is expensive, speakers must lie prone, affecting the position of the tongue root, and it is noisy, making acoustic recordings challenging. Similarly, EMA, which uses flesh-point tracking of (usually) three points on the tongue, is expensive and invasive and biofeedback applications are scarce in the literature. Ultrasound was first used for biofeedback, though not with CP ± L, in the 1980s. With this technique, an ultrasound probe is placed under the chin and either a mid-sagittal or coronal view of the tongue can be seen in real-time (Figure 1). Ultrasound is easy to use, non-invasive, and safe since it does not use ionising radiation.⁹ Despite this, its adoption into CP ± L research and practice, like most new tools,¹⁰ has been slow. Despite being in use since around the same time as EPG, historically EPG has won out as the tool of choice for CP ± L. Recent advances in ultrasound suggest this position is changing. Aside from the challenges of implementing change in healthcare systems, historically there were technical challenges with ultrasound: machines were cumbersome, expensive, had slow framerates, and ultrasound images were difficult to analyse. Most of these issues are now largely solved with the introduction of affordable compact systems with high framerates and analysis methods have seen significant improvements,¹¹ and are set to improve further with advances in machine learning.¹² Studies using ultrasound to treat speech sound disorders in children without CP ± L have rapidly increased¹³ and further studies are underway.^14,15 A framework for the use of ultrasound in clinical practice is now available and is endorsed by the UK Royal College of Speech and Language Therapists¹⁶ and training in how to use ultrasound is available from universities, in an open access manual,¹⁷ and online at seeingspeech/speechstar.ac.uk.¹⁸ We predict that ultrasound will become an important tool for research and treatment in CP ± L, largely replacing other articulatory instruments as it has done for other types of speech sound disorder.¹³ Here we summarise what ultrasound can, and cannot, be used for by giving an overview of recent research specifically in CP ± L and conclude with future directions for this tool.

Figure 1.

Ultrasound images of the tongue. Top left: mid-sagittal image with the tongue tip to the right. Bottom left: coronal image. In both images the brightest white line is the tongue surface. Right: Ultrafit headset with probe positioned for mid-sagittal view.

Applications of Ultrasound Tongue Imaging

Mid-sagittal ultrasound shows almost the entire surface of the tongue from root to tip (Figure 1, left). Uvular and pharyngeal articulations, which occur in people with CP ± L,¹⁹ are clearly visible.¹¹ Arguably, ultrasound is a far better tool for measuring retracted articulations than EPG or EMA as it can show post-velar articulations. This makes ultrasound ideal for treating compensatory errors involving backing of any type. In EMA the most posterior coil is usually attached to the back, not root, of the tongue²⁰ and in EPG post-velar articulations are displayed as an ambiguous “open pattern” (ie, no tongue-palate contact).⁶ Bressmann et al.²¹ used ultrasound to identify that one speaker with CP ± L showed double articulations of a glottal plus pharyngeal articulation for /k/ which were auditorily perceived as only glottal stops. In the same paper, several speakers were shown to have mid-dorsum palatal stops in place of /k/. This demonstrates the use of ultrasound for identifying both covert contrasts and post-velar articulations. Identification of these errors can help clinicians target the precise error in intervention. Similarly, Cleland and colleagues²² developed a method for classifying lingual errors from ultrasound and demonstrated that in 39 children with CP ± L ultrasound could identify covert errors such as double articulations and retroflexion. When used in the mid-sagittal view, then, ultrasound is arguably the technique of choice for identifying differences in tongue shape and movement, helping to inform theoretical models and influence treatment plans. However, lateralised articulations, such as lateral fricatives, are common in CP ± L⁶ and these are not easily viewed with mid-sagittal ultrasound. In contrast, lateral fricatives are easily visualised and quantified in EPG because this technique shows tongue-palate contact across the entire palate. However, raising or lowering of the sides of the tongue can be visualised with coronal ultrasound^23,24 (Figure 1). Nevertheless, this is problematic because it is very difficult to determine which coronal slice of the tongue is being imaged. This weakness of coronal ultrasound can be overcome using 3D ultrasound imaging,²⁵ but so far this technique is cost-prohibitive and framerates are slow.

Another challenge with ultrasound has been analysis. While EPG consists of a normalised set of on/off contacts and EMA consists of a small, finite, number of sensors, ultrasound images are grainy and suffer from artefacts. Moreover, the image on the screen is not normalised and probe placement affects translation and rotation of the image. Because of these issues, most of the small number of studies using ultrasound with CP ± L have employed visual inspection methods.^21,22,26 This method requires experienced clinical phoneticians viewing recordings of ultrasound videos and making judgments about tongue shape and movement. In a sense, this is like impressionistic phonetic transcription, but with an added visual modality. Cleland et al.²² showed that combining ultrasound and audio leads to better inter-transcriber agreement and identification of covert contrasts. This more accurate assessment method could lead to improved treatment plans, but this is yet to be tested. This qualitative method of evaluating ultrasound is similar to some EPG studies which describe and categorise EPG patterns⁶ and is quick and easy for clinicians wishing to use ultrasound in assessment before intervention.

Quantifying Ultrasound

For ultrasound to be used to compare tongue shapes within and between speakers, either to measure changes post-intervention or to identify tongue shapes which differ from speakers without CP ± L, quantification is needed. In the phonetics literature of typical speech, ultrasound has become an increasingly important tool.²⁷ This has led to several advancements in quantitative ultrasound methods including development of probe stabilising headsets²⁸ (see Figure 1 for an example of a light weight headset) and systems which correct for head movement.²⁹ Both of these make the ultrasound image easier to interpret and measurements more accurate. When the headset is used for biofeedback, it makes the image more stable for the client and clinician. Methods for tracking the surface of the tongue³⁰; and machine learning for classifying speech errors³¹ show promise for developing automatic assessment tools. So far, quantitative ultrasound studies of CP ± L are scarce. Roxburgh and colleagues³² compared tongue contours statistically in covert contrasts in two speakers with CP ± L. They showed that ultrasound can be used to quantify the size of difference between phones produced with a covert contrast, and measure change during intervention. A further recent paper by Cleland et al.³³ used the dorsum excursion index,³⁴ to measure the relative excursion/height of the back of the tongue during production of high-pressure consonants. They hypothesised that children with CP ± L would show increased raising of the tongue back, due to an attempt to compensate for either current or resolved velopharyngeal insufficiency. They compared 31 children with CP ± L to 29 typically developing children. Although some individual children showed an unusually high and back tongue posture, they did not find group differences. They attribute this to the fact that many of the children with CP ± L in the group had normalised speech. Nevertheless, the study provides proof of concept that it is possible to use ultrasound metrics to compare groups of speakers with and without CP ± L and that it is possible to identify unusual raising of the tongue body with ultrasound, which in turn could be treated with ultrasound biofeedback.

Biofeedback Applications

One of the main advantages of ultrasound is that it can be used in real-time, making it ideal as a biofeedback intervention. Biofeedback enables some speakers with persistent speech sound disorders to quickly correct articulations in one to two sessions.³⁵ It therefore has the potential to improve the effectiveness and efficiency of articulation therapy, although children younger than age five or with additional learning needs may have difficulty understanding the ultrasound image.⁸ It has advantages over EPG in this application as it can be used without any individualised hardware and children do not need to have stable dentition. It is surprising, then, that this tool has not yet been widely adopted into cleft palate care. Two small-scale low-quality intervention studies, both with only two children, showed that ultrasound has potential as a biofeedback tool with CP ± L.^36,37 The scarcity of literature in this area is surprising given that the number of studies has been increasing in other populations over the last 10 years.¹³ Possible reasons for the slow adoption in CP ± L include the fact that lateralisation errors are sometimes an intervention target²⁴ but lowering and raising of the sides of the tongue can only be viewed in the coronal plane and this is technically difficult. However, backing errors including palatalisation of fricatives and backing to velar/uvular and pharyngeal, which are common, are ideal for treatment and slow adoption is likely due to the fact equipment costs have only just begun to become affordable in the last few years and it is only in the last year that a framework for clinical practice¹⁶ has been published. We predict that ultrasound will increasingly be adopted as a clinical tool, however, larger, more robust intervention studies specifically with CP ± L are clearly required alongside improvements in analysing ultrasound images automatically using machine learning approaches. One such study is currently underway in the UK,¹⁴ but further larger-scale studies will be needed.

Conclusion

Ultrasound shows promise as an articulatory tool for research and practice with people with CP ± L. There is still much work to do on improving the affordability of the technology and streamlining analysis. Implementation of ultrasound into clinical practice will require more robust evidence of its effectiveness, and a greater understanding of the barriers to its use in speech and language therapy clinics, for example training needs and ongoing support needs. In the meantime, we encourage researchers and clinicians interested in adopting this technique to consult the increasing evidence base including that in the phonetics literature and the literature on other types of speech sound disorders, and the growing number of online resources such as www.seeingspeech.ac.uk/speechstar before applying this knowledge to CP ± L.

Footnotes

Acknowledgements

This perspectives piece is underpinned by research made possible by grants from the Engineering and Physical Sciences Research Council EP/I027696/1, Action Medical Research GN2544 and the Chief Scientist Office TCS/20/02.

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Action Medical Research, Engineering and Physical Sciences Research Council, Chief Scientist Office, (grant number GN2544, EP/I027696/1, TCS/20/02).

ORCID iD

Joanne Cleland

References

Kotlarek

Krueger

. Treatment of speech sound errors in cleft palate: A tutorial for speech-language pathology assistants. Lang Speech Hear Serv Sch. 2023;54(1):171-188. doi:https://doi.org/10.1044/2022_LSHSS-22-00071

Cleland

Scobbie

. Acquisition of new speech motor plans via articulatory visual biofeedback. In: Fuchs

Cleland

Rochet-Cappelan

, eds. Speech perception and production: learning and memory. Peter Lang; 2018:139-159.

Lee

. Electropalatography. In: Manual of clinical phonetics. Routledge; 2021:339-355.

Lee

ASY

Law

Gibbon

. Electropalatography for articulation disorders associated with cleft palate. Cochrane Database Syst Rev. 2009(3):1-22. doi:https://doi.org/10.1002/14651858.CD006854.pub2

Michi

K-I

Suzuki

Yamashita

Imai

. Visual training and correction of articulation disorders by use of dynamic palatography. J Speech Hear Disord. 1986;51(3):226-238. doi:https://doi.org/10.1044/jshd.5103.226

Gibbon

. Abnormal patterns of tongue-palate contact in the speech of individuals with cleft palate. Clin Linguist Phon. 2004;18(4–5):285-311. doi:https://doi.org/10.1080/02699200410001663362

Gibbon

Crampin

. An electropalatographic investigation of middorsum palatal stops in an adult with repaired cleft palate. Cleft Palate Craniofac J. 2001;38(2):96-105. doi:https://doi.org/10.1597/1545-1569_2001_038_0096_aeiomp_2.0.co_2

Cleland

Preston

. Biofeedback interventions. In: Williams

McLeod

McCauley

, eds. Interventions for speech sound disorders in children. Pearson; 2021:573-599:chap 20.

Allen

Clunie

Slinger

, et al. Utility of ultrasound in the assessment of swallowing and laryngeal function: A rapid review and critical appraisal of the literature. Int J Lang Commun Disord. 2021;56(1):174-204. doi:https://doi.org/10.1111/1460-6984.12584

10.

Morris

Wooding

Grant

. The answer is 17 years, what is the question: Understanding time lags in translational research. J R Soc Med. 2011;104(12):510-520. doi:https://doi.org/10.1258/jrsm.2011.110180

11.

Cleland

. Ultrasound tongue imaging. In: Manual of clinical phonetics. Routledge; 2021:399-416.

12.

Al Ani

. Systematic review of deep learning models in ultrasound tongue imaging for the detection of speech disorders. TechRxiv; 2023.

13.

Sugden

Lloyd

Lam

Cleland

. Systematic review of ultrasound visual biofeedback in intervention for speech sound disorders. Int J Lang Commun Disord. 2019;54(5):705-728. doi:https://doi.org/10.1111/1460-6984.12478

14.

Cleland

Crampin

Campbell

Dokovova

. Protocol for SonoSpeech cleft pilot: A mixed-methods pilot randomized control trial of ultrasound visual biofeedback versus standard intervention for children with cleft lip and palate. Pilot Feasibility Stud. 2022;8(1):93. doi:https://doi.org/10.1186/s40814-022-01051-x

15.

McAllister

Preston

Hitchcock

Hill

. Protocol for correcting residual errors with spectral, ultrasound, traditional speech therapy randomized controlled trial (C-RESULTS RCT). BMC Pediatr. 2020;20(1):66. doi:https://doi.org/10.1186/s12887-020-1941-5

16.

Allen

Cleland

Smith

. An initial framework for use of ultrasound by speech and language therapists in the UK: Scope of practice, education and governance. Ultrasound. 2023;31(2):92-103. doi:https://doi.org/10.1177/1742271x221122562

17.

Cleland

Wrench

Lloyd

Sugden

. ULTRAX2020: ultrasound technology for optimising the treatment of speech disorders : clinicians’ resource manual. Glasgow: University of Strathclyde; 2018: 87.

18.

Lawson

Cleland

Stuart-Smith

. STAR teaching: a speech therapy animation and imaging resource. University of Glasgow; 2023. Accessed 28th July 2023, https://seeingspeech.ac.uk/speechstar/

19.

Trost

. Articulatory additions to the classical description of the speech of persons with cleft palate. Cleft Palate J. 1981;18(3):193-203.

20.

Rebernik

Jacobi

Jonkers

Noiray

Wieling

. A review of data collection practices using electromagnetic articulography. Lab Phonol. 2021;12(1):1-42.

21.

Bressmann

Radovanovic

Kulkarni

Klaiman

Fisher

. An ultrasonographic investigation of cleft-type compensatory articulations of voiceless velar stops. Clin Linguist Phon. 2011;25(11–12):1028-1033. doi:https://doi.org/10.3109/02699206.2011.599472

22.

Cleland

Lloyd

Campbell

, et al. The impact of real-time articulatory information on phonetic transcription: Ultrasound-aided transcription in cleft lip and palate speech. Folia Phoniatr Logop. 2020;72(2):120-130. doi:https://doi.org/10.1159/000499753

23.

Bressmann

Flowers

Wong

Irish

. Coronal view ultrasound imaging of movement in different segments of the tongue during paced recital: Findings from four normal speakers and a speaker with partial glossectomy. Clin Linguist Phon. 2010;24(8):589-601. doi:https://doi.org/10.3109/02699201003687309

24.

Zhu

Zhou

Wang

Jiang

Shi

. Ultrasonic imaging investigation of tongue movement patterns of cleft-related lateralized and palatalized misarticulation. J Craniofac Surg. 2022;33(4):e421-e426. doi:https://doi.org/10.1097/scs.0000000000008366

25.

Lulich

Pearson

. Three-/four-dimensional ultrasound technology in speech research. Perspect ASHA Spec Interest Groups. 2019;4(4):733-747. doi:https://doi.org/10.1044/2019_PERS-SIG19-2019-0001

26.

Bressmann

Radovanovic

Harper

Klaiman

Fisher

Kulkarni

. Production of two nasal sounds by speakers with cleft palate. Cleft Palate Craniofac J. 2018;55(6):876-882. doi:https://doi.org/10.1597/16-096

27.

Kochetov

. Research methods in articulatory phonetics I: Introduction and studying oral gestures. Lang Linguist Compass. 2020;14(4):e12368. doi:https://doi.org/10.1111/lnc3.12368

28.

Pucher

Klingler

Luttenberger

Spreafico

. Accuracy, recording interference, and articulatory quality of headsets for ultrasound recordings. Speech Commun. 2020;123(1):83-97. doi:https://doi.org/10.1016/j.specom.2020.07.001

29.

Noiray

Ries

Tiede

Rubertus

Laporte

Ménard

. Recording and analyzing kinematic data in children and adults with SOLLAR: Sonographic & optical linguo-labial articulation recording system. 2020.

30.

Al-hammuri

Gebali

Thirumarai Chelvan

Kanan

. Tongue contour tracking and segmentation in lingual ultrasound for speech recognition: A review. Diagnostics. 2022;12(11):2811.

31.

Ribeiro

Cleland

Eshky

Richmond

Renals

. Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors. Speech Commun. 2021;128(1):24-34. doi:https://doi.org/10.1016/j.specom.2021.02.001

32.

Roxburgh

Cleland

Scobbie

Wood

. Quantifying changes in ultrasound tongue-shape pre- and post-intervention in speakers with submucous cleft palate: An illustrative case study. Clin Linguist Phon. 2021;36(2-3):1-19. doi:https://doi.org/10.1080/02699206.2021.1973566

33.

Cleland

Dokovova

Crampin

Campbell

. An ultrasound investigation of tongue dorsum raising in children with cleft palate ± cleft lip. Cleft Palate Craniofac J. 2023;0(0):10556656231158965. doi:https://doi.org/10.1177/10556656231158965

34.

Zharkova

. Using ultrasound to quantify tongue shape and movement characteristics. Cleft Palate Craniofac J. 2013;50(1):76-81. doi:https://doi.org/10.1597/11-196

35.

Cleland

Scobbie James

Roxburgh

Heyde

Wrench

. Enabling new articulatory gestures in children with persistent speech sound disorders using ultrasound visual biofeedback. J Speech Lang Hear Res. 2019;62(2):229-246. doi:https://doi.org/10.1044/2018_JSLHR-S-17-0360

36.

Roxburgh

Scobbie

Cleland

. Articulation therapy for children with cleft palate using visual articulatory models and ultrasound biofeedback. International Phonetic Association; 2015.

37.

Parks

. The effectiveness of ultrasound biofeedback therapy in children with repaired cleft palate. 2018.