Iterative Facial Image Inpainting Based on an Encoder-Generator Architecture

Dogan, Yahya; Keles, Hacer Yalim

Full-text links:

Download:

Current browse context:

eess.IV

< prev | next >

new | recent | 2101

Electrical Engineering and Systems Science > Image and Video Processing

Title: Iterative Facial Image Inpainting Based on an Encoder-Generator Architecture

Authors: Yahya Dogan, Hacer Yalim Keles

(Submitted on 18 Jan 2021 (v1), last revised 13 Feb 2022 (this version, v2))

Abstract: Facial image inpainting is a challenging problem as it requires generating new pixels that include semantic information for masked key components in a face, e.g., eyes and nose. Recently, remarkable methods have been proposed in this field. Most of these approaches use encoder-decoder architectures and have different limitations such as allowing unique results for a given image and a particular mask. Alternatively, some optimization-based approaches generate promising results using different masks with generator networks. However, these approaches are computationally more expensive. In this paper, we propose an efficient solution to the facial image inpainting problem using the Cyclic Reverse Generator (CRG) architecture, which provides an encoder-generator model. We use the encoder to embed a given image to the generator space and incrementally inpaint the masked regions until a plausible image is generated; we trained a discriminator model to assess the quality of the generated images during the iterations and determine the convergence. After the generation process, for the post-processing, we utilize a Unet model that we trained specifically for this task to remedy the artifacts close to the mask boundaries. We empirically observed that only a few iterations are sufficient to generate realistic images with the proposed model. Since the models are not trained for particular mask types, our method allows applying sketch-based inpaintings, using a variety of mask types, and producing multiple and diverse results. We compared our method with the state-of-the-art models both quantitatively and qualitatively, and observed that our method can compete with the other models in all mask types; it is particularly better in images where larger masks are utilized. Our code, dataset and models are available at: this https URL facial image inpainting.

Comments:	This paper is the preprint of the accepted manuscript in Neural Computing and Applications Journal
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2101.07036 [eess.IV]
	(or arXiv:2101.07036v2 [eess.IV] for this version)

Submission history

From: Hacer Yalim Keles [view email]
[v1] Mon, 18 Jan 2021 12:19:58 GMT (22220kb,D)
[v2] Sun, 13 Feb 2022 11:11:59 GMT (29501kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2101.07036

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Image and Video Processing

Title: Iterative Facial Image Inpainting Based on an Encoder-Generator Architecture

Submission history