Automatic speech recognition of Urdu words using linear discriminant analysis

Abstract

Urdu is amongst the five largest languages of the world and possess a very important role as it shares its vocabulary with languages as Arabic, Persian, Hindi and several other languages of the Indo-Pak. The Automatic Speech Recognition task of Urdu has not been addressed significantly. This paper presents the statistical based classification technique to achieve the task of Automatic Speech Recognition of isolated words in Urdu. The proposed approach is based on calculation of 52 Mel Frequency Cepstral Coefficients for each isolated word. The classification has been achieved with Linear Discriminant Analysis. The successful or incorrect matches have been presented in the Confusion Matrix. As a prototype, the framework has been trained with audio samples of seven speakers including male/female, native/non-native and speakers with different ages. The test set comprises of audio data of three speaker. For each isolated, percentage error has been calculated. It was found that majority of the words are recognized with percentage error less than 33% . Some words suffer 100% error and were referred to be the bad words. This work may provide a baseline for further research on Urdu Automatic Speech Recognition.

Keywords

Urdu automatic speech recognition mel frequency cepstral coefficients linear discriminant analysis

Get full access to this article

View all access options for this article.

References

10.

11.

12.

13.

14.

15.

16.

Center for Language Engineering. (2012) [Online]. http://www.cle.org.pk/

17.

18.

19.

20.

21.

22.

23.

24.

S. Balakrishnama and A. Ganapathiraju. (Accessed: 2012, March) Linear Discriminant Analysis; A Brief Tutorial. [Online]. http://www.music.mcgill.ca/~ich

25.

26.

27.

H. Ali, N. Ahmad, X. Zhou, M. Ali and A. Asghar, Linear Discriminant Analysis Based Approach for Automatic Speech Recognition of Urdu IsolatedWords, in Communication Technologies, Information Security and Sustainable Development, in Springer CCIS series, vol. 414, 2014, pp. 24–34