We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: FakeOut: Leveraging Out-of-domain Self-supervision for Multi-modal Video Deepfake Detection

Abstract: Video synthesis methods rapidly improved in recent years, allowing easy creation of synthetic humans. This poses a problem, especially in the era of social media, as synthetic videos of speaking humans can be used to spread misinformation in a convincing manner. Thus, there is a pressing need for accurate and robust deepfake detection methods, that can detect forgery techniques not seen during training. In this work, we explore whether this can be done by leveraging a multi-modal, out-of-domain backbone trained in a self-supervised manner, adapted to the video deepfake domain. We propose FakeOut; a novel approach that relies on multi-modal data throughout both the pre-training phase and the adaption phase. We demonstrate the efficacy and robustness of FakeOut in detecting various types of deepfakes, especially manipulations which were not seen during training. Our method achieves state-of-the-art results in cross-dataset generalization on audio-visual datasets. This study shows that, perhaps surprisingly, training on out-of-domain videos (i.e., not especially featuring speaking humans), can lead to better deepfake detection systems. Code is available on GitHub.
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2212.00773 [cs.CV]
  (or arXiv:2212.00773v2 [cs.CV] for this version)

Submission history

From: Gil Knafo [view email]
[v1] Thu, 1 Dec 2022 18:56:31 GMT (1650kb,D)
[v2] Wed, 7 Feb 2024 22:55:29 GMT (1144kb,D)

Link back to: arXiv, form interface, contact.