Disentangling Generative Factors in Natural Language with Discrete Variational Autoencoders

Mercatali, Giangiacomo; Freitas, André

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2109

Change to browse by:

Computer Science > Computation and Language

Title: Disentangling Generative Factors in Natural Language with Discrete Variational Autoencoders

Authors: Giangiacomo Mercatali, André Freitas

(Submitted on 15 Sep 2021)

Abstract: The ability of learning disentangled representations represents a major step for interpretable NLP systems as it allows latent linguistic features to be controlled. Most approaches to disentanglement rely on continuous variables, both for images and text. We argue that despite being suitable for image datasets, continuous variables may not be ideal to model features of textual data, due to the fact that most generative factors in text are discrete. We propose a Variational Autoencoder based method which models language features as discrete variables and encourages independence between variables for learning disentangled representations. The proposed model outperforms continuous and discrete baselines on several qualitative and quantitative benchmarks for disentanglement as well as on a text style transfer downstream application.

Comments:	Findings of EMNLP 2021
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2109.07169 [cs.CL]
	(or arXiv:2109.07169v1 [cs.CL] for this version)

Submission history

From: Giangiacomo Mercatali [view email]
[v1] Wed, 15 Sep 2021 09:10:05 GMT (193kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2109.07169

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Disentangling Generative Factors in Natural Language with Discrete Variational Autoencoders

Submission history