SDTN: SPEAKER DYNAMICS TRACKING NETWORK FOR EMOTION RECOGNITION IN CONVERSATION
Jiawei Chen (South China Agricultural University); Peijie Huang (South China Agricultural University); Guotai Huang ( South China Agricultural University); Qianer Li ( South China Agricultural University); Yuhong Xu ( South China Agricultural University)
-
SPS
IEEE Members: $11.00
Non-members: $15.00
Emotion Recognition in Conversation (ERC) has considerable prospects due to its wide range of applications. Most existing works integrate speaker information statically and capture a relatively consistent atmosphere in conversation. However, these works poorly track the emotional state dynamics of each party in a conversation and focus on emotion consistency. The speakers’ emotional states are independent but influence each other during the conversation. To address the above issues, we propose a Speaker Dynamics Tracking Network (SDTN) for ERC. Specifically, SDTN can dynamically track the local and global speaker states during emotional flow in conversation and capture implicit stimulation of emotional shift. Extensive experiments on MELD and EmoryNLP datasets demonstrate the superiority and effectiveness of our proposed SDTN model, and confirm that every designed module consistently benefits the performance.