CROSS-CORPUS SPEECH EMOTION RECOGNITION BASED ON FEW-SHOT LEARNING AND DOMAIN ADAPTATION

Youngdo Ahn, Jong Won Shin, Sung Joo Lee

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 00:08:13

11 May 2022

Within a single speech emotion corpus, deep neural networks have shown decent performance in speech emotion recognition. However, the performance of the emotion recognition based on data-driven learning methods degrades significantly for the cross-corpus scenario. To relieve this issue without any labeled samples from the target domain, we propose a cross-corpus speech emotion recognition based on few-shot learning and unsupervised domain adaptation, which is trained to learn the class (emotion) similarity from the source domain samples adapted to the target domain. In addition, we utilize multiple corpora in training to enhance the robustness of the emotion recognition to the unseen samples. Experiments on emotional speech corpora with three different languages showed that the proposed method outperformed other approaches.

Tags:

null

CROSS-CORPUS SPEECH EMOTION RECOGNITION BASED ON FEW-SHOT LEARNING AND DOMAIN ADAPTATION

Youngdo Ahn, Jong Won Shin, Sung Joo Lee

Value-Added Bundle(s) Including this Product

ICASSP 2022, May 2022 Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

PROGRESS-ICASSP 2022: Introduction by Farokh Atashzar and Nancy F. Chen

PROGRESS-ICASSP 2022: Opening Speech

MULTIMODAL DATA FUSION IN HIGH-DIMENSIONAL HETEROGENEOUS DATASETS VIA GENERATIVE MODELS

Join the IEEE Signal Processing Society