Deep Learning–Based Emotion Classification Models for Chinese and Korean OST Music

Quanrui Lu; Hyuntai Kim

doi:10.56979/1002/2026/1244

Authors

Quanrui Lu Department of Music, Faculty of Arts and Physical Education, Sejong University, Seoul, 05006，Korea.
Hyuntai Kim Department of Music, Faculty of Arts and Physical Education, Sejong University, Seoul, 05006，Korea.

DOI:

https://doi.org/10.56979/1002/2026/1244

Keywords:

Music Emotion Recognition (MER), Deep Learning; Chinese OST, Korean OST, PMEmo, EMOPIA, Attention Mechanism

Abstract

Music Emotion Recognition (MER) has made significant advancements with deep learning, however, existing models tend to have cultural bias wherein they are not good at recognizing the emotion of non-Western musical structures. This paper proposes a deep learning framework designed especially for the emotion classification in Chinese and Korean Original Soundtracks (OSTs), which have unique tonal dynamics and a high variance in emotions. We propose a Dual-Stream Convolutional Recurrent Neural Network (CRNN) with Self-Attention, which is able to capture the spectral spatial characteristics and the temporal melodic developments, commonly found in Asian cinematic music. To validate the model, we use two region-specific datasets namely PMEmo (Chinese popular music) and EMOPIA (Korean/Asian piano OSTs). Experimental results show that our proposed architecture can obtain an accuracy of 88.4% and F1-score of 0.87, which outperforms baseline models (ResNet-50 and standard LSTM) with 5.2% margin. The research helps to confirm that the training data for culturally-aware training is vital for accurate affective computing within the music domain.

Deep Learning–Based Emotion Classification Models for Chinese and Korean OST Music

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

SCOPUS

HJRS

ISSN

Online First

Call for Papers

Make a Submission

Open Access

Information

Conference

SC-2