Abstract
The ability to distinguish spontaneous from non-spontaneous speech can prove helpful, such as in forensic evidence situations, sorting voice-mail responses from voice-mail menus, and automatic segmentation of spontaneous responses from prepared questions. The latter situation occurs when trying to create a database of spontaneous data from data of a speaker responding spontaneously to prepared prompts. This paper outlines and compares three methods for automatically classifying spontaneous and non- spontaneous speech, and presents the experimental results of the performance of all three methods, evaluated on high quality simulated data. All three methods are based on an analysis of the probability distributions of prosodic features extracted from speech signals.
Keywords
Get full access to this article
View all access options for this article.
