PAIRWISE ADJACENCY MATRIX ON SPATIAL TEMPORAL GRAPH CONVOLUTION NETWORK FOR SKELETON-BASED TWO-PERSON INTERACTION RECOGNITION
Chao-Lung Yang, Aji Setyoko, Hendrik Tampubolon, Kai-Lung Hua
-
SPS
IEEE Members: $11.00
Non-members: $15.00Length: 15:00
Spatial-temporal graph convolutional networks (ST-GCN) have achieved outstanding performances on human action recognition, however, it might be less superior on a two-person interaction recognition (TPIR) task due to the relationship of each skeleton is not considered. In this study, we present an improvement of the ST-GCN model that focused on TPIR by employing the pairwise adjacency matrix to capture the relationship of person-person skeletons (ST-GCN-PAM). To validate the effectiveness of the proposed ST-GCN-PAM model on TPIR, experiments were conducted on NTU RGB+D 120. Additionally, the model was also examined on the Kinetics dataset and NTU RGB+D 60. The results show that the proposed ST-GCN-PAM outperforms the-state-of-the-art methods on mutual action of NTU RGB+D 120 by achieving 83.28% (cross-subject) and 88.31% (cross-view) accuracy. The model is also superior to the original ST-GCN on the multi-human action of the Kinetics dataset by achieving 41.68% in Top-1 and 88.91% in Top-5.