Abstract
The present paper proposes Random Kitchen Sink based music/speech classification. The temporal and spectral features such as spectral centroid, Spectral roll-off, spectral flux, Mel-frequency cepstral coefficients, entropy, and Zero-crossing rate are extracted from the signals. In order to show the competence of the proposed approach, experimental evaluations and comparisons are performed. Even though both speech and music signals differ in their production mechanisms, those share many common characteristics such as a common spectrum of frequency and are comparatively non-stationary which makes the classification difficult. The proposed approach explicitly maps the data to a feature space where it is linearly separable. The evaluation results shows that the proposed approach provides competing scores with the methods in the available literature.
Get full access to this article
View all access options for this article.
