Free-view Expressive Talking Head Video Editing

Yuantian Huang (University of Tsukuba); Satoshi Iizuka (University of Tsukuba); Kazuhiro Fukui (University of Tsukuba)

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

08 Jun 2023

We present a novel framework for talking head video editing, allowing users to freely edit head pose, emotion, and eye blink while maintaining audio-visual synchronization. Unlike previous approaches that mainly focus on generating a talking head video, our proposed model is able to edit the talking heads of an input video and restore it to full frames, which supports a broader range of applications. Our proposed framework consists of two parts: a) a reconstruction-based generator that can generate talking heads fitting to the original frame while corresponding to freely controllable attributes, including head pose, emotion, and eye blink. b) a multiple-attribute discriminator that enforces attribute-visual synchronization. We additionally introduce attention modules and perceptual loss to improve the overall generation quality. We compare existing approaches as corroborated by quantitative metrics and qualitative comparisons.

Tags:

Image and video synthesis, rendering, and visualization

Free-view Expressive Talking Head Video Editing

Yuantian Huang (University of Tsukuba); Satoshi Iizuka (University of Tsukuba); Kazuhiro Fukui (University of Tsukuba)

Value-Added Bundle(s) Including this Product

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

More Like This

SVMV: SPATIOTEMPORAL VARIANCE-SUPERVISED MOTION VOLUME FOR VIDEO FRAME INTERPOLATION

Flow-Guided Deformable Alignment Network with Self-Supervision for Video Inpainting

ACTIVE PERCEPTION SYSTEM FOR ENHANCED VISUAL SIGNAL RECOVERY USING DEEP REINFORCEMENT LEARNING

Join the IEEE Signal Processing Society