Action Segmentation On Representations Of Skeleton Sequences Using Transformer Networks
Simon H??ring, Raphael Memmesheimer, Dietrich Paulus
-
SPS
IEEE Members: $11.00
Non-members: $15.00Length: 00:07:31
We propose an approach for action segmentation by representing motions as images. A transformer object detection network is used to segment the sequences from the representation images. We examine different encoding approaches, normalization strategies and skeleton joint orders in an extensive experiment study. Our approach is evaluated on skeleton sequences from the PKU-MMD dataset. We successfully apply transformer networks for action segmentation on skeleton sequences. Our proposed approach achieves high class accuracies while start and end-time estimation of the action segments are subject to further improvement.