We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Video Transformer for Deepfake Detection with Incremental Learning

Abstract: Face forgery by deepfake is widely spread over the internet and this raises severe societal concerns. In this paper, we propose a novel video transformer with incremental learning for detecting deepfake videos. To better align the input face images, we use a 3D face reconstruction method to generate UV texture from a single input face image. The aligned face image can also provide pose, eyes blink and mouth movement information that cannot be perceived in the UV texture image, so we use both face images and their UV texture maps to extract the image features. We present an incremental learning strategy to fine-tune the proposed model on a smaller amount of data and achieve better deepfake detection performance. The comprehensive experiments on various public deepfake datasets demonstrate that the proposed video transformer model with incremental learning achieves state-of-the-art performance in the deepfake video detection task with enhanced feature learning from the sequenced data.
Comments: Accepted at ACM International Conference on Multimedia, October 20 to 24, 2021, Virtual Event, China
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2108.05307 [cs.CV]
  (or arXiv:2108.05307v1 [cs.CV] for this version)

Submission history

From: Sohail Ahmed Khan [view email]
[v1] Wed, 11 Aug 2021 16:22:56 GMT (3383kb)

Link back to: arXiv, form interface, contact.