Fader Networks: Manipulating Images by Sliding Attributes

Lample, Guillaume; Zeghidour, Neil; Usunier, Nicolas; Bordes, Antoine; Denoyer, Ludovic; Ranzato, Marc'Aurelio

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 1706

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Fader Networks: Manipulating Images by Sliding Attributes

Authors: Guillaume Lample, Neil Zeghidour, Nicolas Usunier, Antoine Bordes, Ludovic Denoyer, Marc'Aurelio Ranzato

(Submitted on 1 Jun 2017 (v1), last revised 28 Jan 2018 (this version, v2))

Abstract: This paper introduces a new encoder-decoder architecture that is trained to reconstruct images by disentangling the salient information of the image and the values of attributes directly in the latent space. As a result, after training, our model can generate different realistic versions of an input image by varying the attribute values. By using continuous attribute values, we can choose how much a specific attribute is perceivable in the generated image. This property could allow for applications where users can modify an image using sliding knobs, like faders on a mixing console, to change the facial expression of a portrait, or to update the color of some objects. Compared to the state-of-the-art which mostly relies on training adversarial networks in pixel space by altering attribute values at train time, our approach results in much simpler training schemes and nicely scales to multiple attributes. We present evidence that our model can significantly change the perceived value of the attributes while preserving the naturalness of images.

Comments:	NIPS 2017
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1706.00409 [cs.CV]
	(or arXiv:1706.00409v2 [cs.CV] for this version)

Submission history

From: Guillaume Lample [view email]
[v1] Thu, 1 Jun 2017 17:48:24 GMT (8462kb,D)
[v2] Sun, 28 Jan 2018 16:12:14 GMT (8462kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1706.00409

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Fader Networks: Manipulating Images by Sliding Attributes

Submission history