Abstract
Honking contributes significantly to traffic noise but automotive horns are selected mainly based on sound pressure level (SPL) without considering sound quality (SQ). Horns with the same SPL may be perceived differently because of variation in sound quality parameters. Traditional SQ study relies on jury tests which are time and resource intensive. Present study proposes machine learning (ML) and deep learning (DL) models for horn classification, minimizing the need for jury tests. Previous horn sound classifications were performed using audio features like, psychoacoustics, time, frequency attributes separately measured using single microphone data. As human hearing is a complex phenomenon, consideration of individual features may not capture perception of sound quality. This work explores different feature combinations with ML/DL models for binary or multi-class horn classification. Twenty-two horn sound samples are recorded using Binaural Head Measurement System (BHMS) and then split into segments. After signal augmentation, samples are labeled as ‘pleasant’, ‘unpleasant’ under binary class using subjective rating data of jury tests. Then, audio features like psychoacoustics, time-frequency features, mel spectrogram (MS), mel Frequency cepstral coefficients (MFCCs), and chromagram are calculated. These features are used to obtain a multi-class horn classification dataset using unsupervised clustering. Design of experiments (DOE) strategy is implemented to find the key features. After hyperparameter tuning, effectiveness of all models is evaluated through comparison of Accuracy, F1-score, Precision, Recall, and training time. Significant attention is given for audio augmentation by injecting noise, stretching, pitching, and shifting to original sound samples. Similarly, Synthetic Minority Over-sampling Technique (SMOTE) is applied to address imbalance in sample numbers. Present study reveals that Random Forest (RF) and Convolutional Neural Network (CNN) are the best performing models. However, the CNN model requires high training time. Developed models can be used to classify horn samples at design stage reducing the need for jury tests.
Keywords
Get full access to this article
View all access options for this article.
