Synchronized Audio-Visual Frames With Fractional Positional Encoding For Transformers in Video-To-Text Translation
Philipp Harzig, Moritz Einfalt, Rainer Lienhart
-
SPS
IEEE Members: $11.00
Non-members: $15.00Length: 00:15:13
0