A COMMONSENSE KNOWLEDGE ENHANCED NETWORK WITH RETROSPECTIVE LOSS FOR EMOTION RECOGNITION IN SPOKEN DIALOG
Yunhe Xie, Chengjie Sun, Zhenzhou Ji
-
SPS
IEEE Members: $11.00
Non-members: $15.00Length: 00:05:17
The recent surges in the open conversational data caused Emotion Recognition in Spoken Dialog (ERSD) to gain much attention. However, the existing ERSD datasets' scale limits the model's complete reasoning. Moreover, the artificial dialogue agent is ideally able to reference past dialogue experiences. This paper proposes a Commonsense Knowledge Enhanced Network with a retrospective loss, namely CKE-Net, to hierarchically perform dialog modeling, external knowledge integration, and historical state retrospect. Specifically, we first adopt a transformer-based encoder to model context in multi-view by elaborating different mask matrices. Then, the graph attention network is used to introduce commonsense knowledge, which benefits the complex emotional reasoning. Finally, a retrospective loss is added to utilize the model's prior experience during training. Experiments on IEMOCAP and MELD datasets demonstrate that every designed module is consistently beneficial to the performance. Extensive experimental results show that our model outperforms the state-of-the-art models across the two benchmark datasets.