Sage Journals: Discover world-class research

Abstract

Background

Most common forms of dementia, including Alzheimer's disease, are associated with alterations in spoken language.

Objective

This study explores the potential of a speech-based machine learning (ML) approach in estimating cognitive impairment, using inputs of speech audio recordings.

Methods

We develop an automatic ML pipeline that ingests multimodal inputs of audio and transcribed text, mapping speech and language to domain-specific biomarkers optimized for high explainability and predictive ability. The resulting features are fed through a multi-stage pipeline to determine efficient classification configurations.

Results

We evaluated the system on large real-world datasets, achieving above 90% and 70% weighted average F1 scores for two-class (AD versus normal controls) and three-class (AD versus mild cognitive impairment versus normal controls) classification tasks, respectively. Model performance remains stable across different population characteristics.

Conclusions

The study introduces a robust, non-invasive method for gauging the cognitive status of AD and MCI patients from speech samples, with the potential of generalizing effectively to multiple types of diseases/disorders which may burden language.

Keywords

Alzheimer's disease cognitive assessment dementia early detection language analysis machine learning mild cognitive impairment natural language processing speech analysis speech biomarkers

Get full access to this article

View all access options for this article.

References

Kempler

. Neurocognitive disorders in aging. Thousand Oaks, CA: Sage Publications, Inc, 2005.

Kempler

Curtiss

Jackson

. Syntactic preservation in Alzheimer’s disease. J Speech Lang Hear Res 1987; 30: 343–350.

Taler

Phillips

. Language performance in Alzheimer’s disease and mild cognitive impairment: a comparative review. J Clin Exp Neuropsychol 2008; 30: 501–556.

Grossman

Mickanin

Robinson

, et al. Anomaly judgments of subject–predicate relations in Alzheimer’s disease. Brain Lang 1996; 54: 216–232.

Ullman

Corkin

Coppola

, et al. A neural dissociation within language: evidence that the mental dictionary is part of declarative memory, and that grammatical rules are processed by the procedural system. J Cogn Neurosci 1997; 9: 266–276.

Altmann

. Baboon mothers and infants. Chicago, IL: University of Chicago Press, 2001.

Chertkow

Bub

Seidenberg

. Priming and semantic memory loss in Alzheimer’s disease. Brain Lang 1989; 36: 420–446.

Tang-Wai

Graham

. Assessment of language function in dementia. Geriatrics 2008; 11: 103–110.

Szatloczki

Hoffmann

Vincze

, et al. Speaking in Alzheimer’s disease, is that an early sign? Importance of changes in language abilities in Alzheimer’s disease. Front Aging Neurosci 2015; 7: 195.

10.

Laske

Sohrabi

Frost

, et al. Innovative diagnostic tools for early detection of Alzheimer’s disease. Alzheimers Dement 2015; 11: 561–578.

11.

De Lira

Ortiz

Campanha

, et al. Microlinguistic aspects of the oral narrative in patients with Alzheimer’s disease. Int Psychogeriatr 2011; 23: 404–412.

12.

Weissler

Naumann

Andersson

, et al. The role of machine learning in clinical research: transforming the future of evidence generation. Trials 2021; 22: 1–15.

13.

Ghassemi

Naumann

Schulam

, et al. A review of challenges and opportunities in machine learning for health. AMIA Summits Transl Sci Proc 2020; 2020: 191–200.

14.

Strauss

Sherman

Spreen

. A compendium of neuropsychological tests: Administration, norms, and commentary. New York: Oxford University Press, 2006.

15.

Berube

Nonnemacher

Demsky

, et al. Stealing cookies in the twenty-first century: measures of spoken narrative in healthy versus speakers with aphasia. Am J Speech Lang Pathol 2019; 28: 321–329.

16.

Edition

. Diagnostic and statistical manual of mental disorders. Am Psychiatr Assoc 2013; 21: 591–643.

17.

Slaney

. Auditory toolbox. Interval research corporation. Tech Rep 1998; 10: 1194.

18.

Explosion

. spaCy-Industrial-strength natural language processing in python, https://spacy.io/, 2017.

19.

Salton

Buckley

. Term-weighting approaches in automatic text retrieval. Inf Process Manag 1988; 24: 513–523.

20.

Ferri

Pudil

Hatef

, et al. Comparative study of techniques for large-scale feature selection. Mach Intell Pattern Recognit 1994; 16: 403–413.

21.

Fawcett

. An introduction to ROC analysis. Pattern Recognit Lett 2006; 27: 861–874.

22.

. Random decision forests. In: Proceedings of 3rd international conference on document analysis and recognition, 1995, pp.278–282. IEEE.

23.

Chen

Benesty

, et al. Xgboost: extreme gradient boosting. R Package Version 0.4-2 2015; 1: 1–4.

24.

Breiman

Friedman

Olshen

, et al. Classification and regression trees. Boca Raton: Routledge, 2017.

25.

McFee

Raffel

Liang

, et al. Librosa: audio and music signal analysis in python. In: Proceedings of the 14th python in science conference, 2015, pp.18–25.

26.

Bird

Loper

Klein

. Natural Language Processing with Python. Sebastopol, CA: O’Reilly Media Inc., 2009.

27.

Pedregosa

Varoquaux

Gramfort

, et al. Scikit-learn: machine learning in Python. J Mach Learn Res 2011; 12: 2825–2830.

28.

Raschka

. MLxtend: providing machine learning and data science utilities and extensions to Python’s scientific computing stack. J Open Source Softw 2018; 3: 638.

29.

McKinney

. Pandas: a foundational Python library for data analysis and statistics. Python High Perform Sci Comput 2011; 14: 1–9.

30.

Hossin

Sulaiman

. A review on evaluation metrics for data classification evaluations. Int J Data Min Knowl Manag Process 2015; 5: 11.

31.

Vlachos

Kosmidis

Yannakoulia

, et al. Dementia incidence in the elderly population of Greece: results from the HELIAD study. Alzheimer Dis Assoc Disord 2021; 35: 48–54.

32.

Georgakis

Papadopoulos

Beratis

, et al. Validation of TICS for detection of dementia and mild cognitive impairment among individuals characterized by low levels of education or illiteracy: a population-based study in rural Greece. Clin Neuropsychol 2017; 31: 61–71.

33.

Jelastopulu

Giourou

Argyropoulos

, et al. Demographic and clinical characteristics of patients with dementia in Greece. Adv Psychiatry 2014; 2014: 636151.

34.

Gagnon

Hansen

Woolmore-Goodwin

, et al. Correcting the MoCA for education: effect on sensitivity. Can J Neurol Sci 2013; 40: 678–683.

35.

Zhang

Yuan

, et al. Influence of education level on MMSE and MoCA scores of elderly inpatients. Appl Neuropsychol Adult 2023; 30: 414–418.

36.

Wackerbarth

Johnson

MMS

. The carrot and the stick: benefits and barriers in getting a diagnosis. Alzheimer Dis Assoc Disord 2002; 16: 213–220.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.03 MB

Linguistic cues for automatic assessment of Alzheimer’s disease across languages