Sage Journals: Discover world-class research

Abstract

In the wave of the digital music era, music aesthetic classification has become the core issue of music information retrieval and personalized recommendation systems. The diversity and subjectivity of music aesthetics have brought great challenges to traditional classification methods, while the rise of deep learning technology has brought new opportunities to this field. This study proposes an algorithm that fuses temporal generative adversarial network (Time GAN) and long short-term memory network (LSTM), which is applied to construct a music aesthetic classification model in order to more accurately identify and classify music works. This study combines Time GAN and LSTM to realize a music aesthetic classification model. It provides a new perspective for the automatic classification of music aesthetics. In terms of experimental digital data, this study selected a music database containing 10,000 songs, which cover various styles such as classical, jazz, rock, and pop. In the experiment, we preprocessed these songs and extracted the Mel spectrogram of each song as input features. On this basis, we adopted Time GAN to generate additional training samples to enhance the generalization ability of the model. In the experiment, Time GAN successfully generated 5000 high-quality music samples, which, together with the raw data, constitute our training set. Through comparative experiments, we find that the Time GAN-LSTM model has achieved remarkable results in the task of music aesthetic classification. In the cross-validation test, the classification accuracy of the model reached 89.7%, which is a significant improvement compared to the 82.1% of LSTM alone and the 75.4% of traditional machine learning methods.

Keywords

Time GAN-LSTM algorithm music aesthetics classification model model optimization

Get full access to this article

View all access options for this article.

References

Gao

Liu

Jiang

, et al. Regulating test anxiety by aesthetic musical emotions: based on the mutual promotion and mutual counteraction (MPMC) theory of affect. Curr Psychol 2024; 43(30): 25153–25169.

Gianelli

CGd. S

. The modernism, the modern and the national: aesthetic approaches in the radio musical production of radames gnattali. Revista Historia-Debates E Tendencias 2024; 24(2): 125–141.

Nawaz

Omigie

. Investigating the interaction of pleasantness and arousal and the role of aesthetic emotions on episodic memory using a musical what-where-when paradigm. Cognit Emot 2023; 37(2): 320–328.

Porter

. Composing Anxious Voices: Aesthetic Distance, Empathy, and Musical Rhetoric in Two Ossian-Inspired Works by Louis Spohr and Carl Maria von Weber. J Musicol Res 2023; 42(4): 191–214.

Feng

Luo

, et al. Inverter fault diagnosis for a three-phase permanent-magnet synchronous motor drive system based on SDAE-GAN-LSTM. Electronics 2023; 12(19): 4172.

Hei

Cui

, et al. GAN-LSTM joint network applied to seismic array noise signal recognition. Applied Sciences-Basel 2021; 11(21): 9987.

Yue

Zheng

Cui

, et al. GAN-LSTM-Based ADS-B attack detection in the context of air traffic control. IEEE Internet Things J 2023; 10(14): 12651–12665.

Song

, Preference for harmony: a link between aesthetic responses to combinations of colors and musical tones. Psychomusicology: Music, Mind, and Brain 2022; 32(1-2): 33–45.

Song

Kowalewski

Friedman

. Preference for harmony: a link between aesthetic responses to combinations of colors and musical tones. Psychomusicology 2022; 32(1-2): 33–45.

10.

Woo

. Development of idea of ‘Mode’ in ‘Musical Sound’ organization: aesthetic principles of modal jazz. Korean Journal of Popular Music 2024; 33: 179–205.

11.

Abdulatif

Cao

Yang

. CMGAN: Conformer-based Metric-GAN for monaural speech enhancement. Ieee-Acm Transactions on Audio Speech and Language Processing 2024; 32: 2477–2493.

12.

Zhao

Zhongjun

Yang

, et al. SGAD-GAN: Simultaneous generation and anomaly detection for time-series sensor data with generative adversarial networks. Mech Syst Signal Process 2024; 210: 111141.

13.

Chiu

J-E

S-Z

. On-line recognition of mixture control chart patterns using hybrid CNN and LSTM for auto-correlated processes. Comput Ind Eng 2024; 198: 110674.

14.

Jiang

Meisheng

Yongjie

, et al. Rotor dynamic response prediction using physics-informed multi-LSTM networks. Aero Sci Technol 2024; 155: 109648.

15.

Orosoo

Namjildagva

Mark

, et al. Transforming English language learning: advanced speech recognition with MLP-LSTM for personalized education. Alex Eng J 2025; 111: 21–32.

16.

Alzahem

Boulila

Koubaa

, et al. Improving satellite image classification accuracy using GAN-Based data augmentation and vision transformers. Earth Science Informatics 2023; 16(4): 4169–4186.

17.

Aranjuelo

Huang

Arganda-Carreras

, et al. Learning gaze-aware compositional GAN from limited annotations. Proceedings of the Acm on Computer Graphics and Interactive Techniques 2024; 7(2): 1–17.

18.

Abalde

Rigby

Keller

, et al. A framework for joint music making: behavioral findings, neural processes, and computational models. Neurosci Biobehav Rev 2024; 167: 105816.

19.

Ding

. Deep learning perspective on the construction of SPOC teaching model of music and dance in colleges and universities. Systems and Soft Computing 2024; 6: 200137.

20.

Liuwanyue

. Course genres classification of music e-learning platform based on deep learning big data intelligent processing algorithm. Entertainment Computing 2024; 50: 100704.

21.

Shu

Liu

, et al. Privileged multi-task learning for attribute-aware aesthetic assessment. Pattern Recogn 2022; 132: 108921.

22.

Cheng

Fang

Navon

, et al. Ensemble Kalman filter for GAN-ConvLSTM based long lead-time forecasting. Journal of Computational Science 2023; 69: 102024.

23.

Akhtar

Ali

Chaudhuri

. Mobile-UNet GAN: a single-image dehazing model. Signal Image and Video Processing 2024; 18(1): 275–283.

24.

Alhamazani

Lai

Y-K

Rosin

. 3DCascade-GAN: shape completion from single-view depth images. Computers & Graphics-Uk 2023; 115: 412–422.

25.

Almeida

JDS

Junior

, et al. Univariate time series missing data imputation using Pix2Pix GAN. Ieee Latin America Transactions 2023; 21(3): 505–512.

26.

Guo

Haifang

Yequan

, et al. MAMGAN: multiscale attention metric GAN for monaural speech enhancement in the time domain. Appl Acoust 2023; 209: 109385.

27.

Klopries

Schwung

. ITF-GAN: synthetic time series dataset generation and manipulation by interpretable features. Knowl Base Syst 2024; 283: 111131.

28.

Lin

Yao

Zeng

, et al. A GAN-Based method for time-dependent cloud workload generation. J Parallel Distr Comput 2022; 168: 33–44.

29.

. Time series imputation with GAN inversion and decay connection. Inf Sci 2023; 643: 119234.

30.

Wang

Guo

, et al. A novel Kalman smoothing (Ks) − long short-term memory (LSTM) hybrid model for filling in short- and long-term missing values in significant wave height. Measurement 2025; 242: 115947.

Application of Time GAN-LSTM algorithm in constructing music aesthetic classification model

Abstract

Keywords

Get full access to this article

References