Speech-driven facial animation using polynomial fusion of features

Kefalas, Triantafyllos; Vougioukas, Konstantinos; Panagakis, Yannis; Petridis, Stavros; Kossaifi, Jean; Pantic, Maja

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1912

Computer Science > Machine Learning

Title: Speech-driven facial animation using polynomial fusion of features

Authors: Triantafyllos Kefalas, Konstantinos Vougioukas, Yannis Panagakis, Stavros Petridis, Jean Kossaifi, Maja Pantic

(Submitted on 12 Dec 2019 (this version), latest version 19 Feb 2020 (v2))

Abstract: Speech-driven facial animation involves using a speech signal to generate realistic videos of talking faces. Recent deep learning approaches to facial synthesis rely on extracting low-dimensional representations and concatenating them, followed by a decoding step of the concatenated vector. This accounts for only first-order interactions of the features and ignores higher-order interactions. In this paper we propose a polynomial fusion layer that models the joint representation of the encodings by a higher-order polynomial, with the parameters modelled by a tensor decomposition. We demonstrate the the suitability of this approach through experiments on generated videos evaluated on a range of metrics on video quality, audiovisual synchronisation and generation of blinks.

Subjects:	Machine Learning (cs.LG); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
Cite as:	arXiv:1912.05833 [cs.LG]
	(or arXiv:1912.05833v1 [cs.LG] for this version)

Submission history

From: Triantafyllos Kefalas [view email]
[v1] Thu, 12 Dec 2019 08:46:57 GMT (698kb,D)
[v2] Wed, 19 Feb 2020 14:36:16 GMT (694kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1912.05833v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Speech-driven facial animation using polynomial fusion of features

Submission history