Don't Say That! Making Inconsistent Dialogue Unlikely with Unlikelihood Training

Li, Margaret; Roller, Stephen; Kulikov, Ilia; Welleck, Sean; Boureau, Y-Lan; Cho, Kyunghyun; Weston, Jason

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 1911

Change to browse by:

Computer Science > Computation and Language

Title: Don't Say That! Making Inconsistent Dialogue Unlikely with Unlikelihood Training

Authors: Margaret Li, Stephen Roller, Ilia Kulikov, Sean Welleck, Y-Lan Boureau, Kyunghyun Cho, Jason Weston

(Submitted on 10 Nov 2019 (v1), last revised 6 May 2020 (this version, v2))

Abstract: Generative dialogue models currently suffer from a number of problems which standard maximum likelihood training does not address. They tend to produce generations that (i) rely too much on copying from the context, (ii) contain repetitions within utterances, (iii) overuse frequent words, and (iv) at a deeper level, contain logical flaws. In this work we show how all of these problems can be addressed by extending the recently introduced unlikelihood loss (Welleck et al., 2019) to these cases. We show that appropriate loss functions which regularize generated outputs to match human distributions are effective for the first three issues. For the last important general issue, we show applying unlikelihood to collected data of what a model should not do is effective for improving logical consistency, potentially paving the way to generative models with greater reasoning ability. We demonstrate the efficacy of our approach across several dialogue tasks.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1911.03860 [cs.CL]
	(or arXiv:1911.03860v2 [cs.CL] for this version)

Submission history

From: Jason Weston [view email]
[v1] Sun, 10 Nov 2019 05:53:40 GMT (661kb,D)
[v2] Wed, 6 May 2020 14:13:02 GMT (1822kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1911.03860

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Don't Say That! Making Inconsistent Dialogue Unlikely with Unlikelihood Training

Submission history