A Dialogical Emotion Decoder For Speech Emotion Recognition In Spoken Dialog

Sung-Lin Yeh, Yun-Shao Lin, Chi-Chun Lee

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 08:24

04 May 2020

Developing a robust emotion speech recognition (SER) system for human dialog is important in advancing conversational agent design. In this paper, we proposed a novel inference algorithm, a dialogical emotion decoding (DED) algorithm, that treats a dialog as a sequence and consecutively decode the emotion states of each utterance over time with a given recognition engine. This decoder is trained by incorporating intra- and inter-speakers emotion influences within a conversation. Our approach achieves a 70.1% in four class emotion on the IEMOCAP database, which is 3% over the state-of-art model. The evaluation is further conducted on a multi-party interaction database, the MELD, which shows a similar effect. Our proposed DED is in essence a conversational emotion rescoring decoder that can also be flexibly combined with different SER engines.

Tags:

sps conference

icassp 2020 virtual conference

May 2020

icassp 2020

A Dialogical Emotion Decoder For Speech Emotion Recognition In Spoken Dialog

Sung-Lin Yeh, Yun-Shao Lin, Chi-Chun Lee

Value-Added Bundle(s) Including this Product

ICASSP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join the IEEE Signal Processing Society