We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Intelligent Video Editing: Incorporating Modern Talking Face Generation Algorithms in a Video Editor

Abstract: This paper proposes a video editor based on OpenShot with several state-of-the-art facial video editing algorithms as added functionalities. Our editor provides an easy-to-use interface to apply modern lip-syncing algorithms interactively. Apart from lip-syncing, the editor also uses audio and facial re-enactment to generate expressive talking faces. The manual control improves the overall experience of video editing without missing out on the benefits of modern synthetic video generation algorithms. This control enables us to lip-sync complex dubbed movie scenes, interviews, television shows, and other visual content. Furthermore, our editor provides features that automatically translate lectures from spoken content, lip-sync of the professor, and background content like slides. While doing so, we also tackle the critical aspect of synchronizing background content with the translated speech. We qualitatively evaluate the usefulness of the proposed editor by conducting human evaluations. Our evaluations show a clear improvement in the efficiency of using human editors and an improved video generation quality. We attach demo videos with the supplementary material clearly explaining the tool and also showcasing multiple results.
Comments: 9 pages, 7 figures, accepted in ICVGIP 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV)
DOI: 10.1145/3490035.3490284
Cite as: arXiv:2110.08580 [cs.CV]
  (or arXiv:2110.08580v1 [cs.CV] for this version)

Submission history

From: Anchit Gupta [view email]
[v1] Sat, 16 Oct 2021 14:19:12 GMT (3185kb,D)

Link back to: arXiv, form interface, contact.