ONE-SHOT MEDICAL ACTION RECOGNITION WITH A CROSS-ATTENTION MECHANISM AND DYNAMIC TIME WARPING
Leiyu Xie (Newcastle University); Yuxing Yang (Newcastle University); Zeyu Fu (University of Exeter); Syed Mohsen Naqvi (Newcastle University)
-
SPS
IEEE Members: $11.00
Non-members: $15.00
In this paper, we address the classification of medical actions with only one single sample by developing a novel one-shot learning framework which contains both cross-attention and dynamic time warping (DTW) modules. To be concrete, we
firstly transform the raw skeleton sequence into the signal-level image representation. We exploit a metric learning approach, which is the prototypical network for the proposed one-shot learning framework and choose the residual network (ResNet18) as the backbone which is widely used in recent years. Cross-attention is applied for guiding the network to focus on the more important joints from each specific action. The cross-attention mechanism that applies between the
support and query set will be adapted for mining and matching the relationships with the human body. Furthermore, a DTW module is introduced to mitigate the temporal information mismatching issue between the actions from the support
and query sets. The experimental results on the NTU RGB+D 120 dataset demonstrate the effectiveness of our proposed approach and the improved performance compared to the baseline approach.