Adversarial learning of neural user simulators for dialogue policy optimisation

Keizer, Simon; Dockes, Caroline; Braunschweiler, Norbert; Stoyanchev, Svetlana; Doddipatla, Rama

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2306

Change to browse by:

Computer Science > Computation and Language

Title: Adversarial learning of neural user simulators for dialogue policy optimisation

Authors: Simon Keizer, Caroline Dockes, Norbert Braunschweiler, Svetlana Stoyanchev, Rama Doddipatla

(Submitted on 1 Jun 2023)

Abstract: Reinforcement learning based dialogue policies are typically trained in interaction with a user simulator. To obtain an effective and robust policy, this simulator should generate user behaviour that is both realistic and varied. Current data-driven simulators are trained to accurately model the user behaviour in a dialogue corpus. We propose an alternative method using adversarial learning, with the aim to simulate realistic user behaviour with more variation. We train and evaluate several simulators on a corpus of restaurant search dialogues, and then use them to train dialogue system policies. In policy cross-evaluation experiments we demonstrate that an adversarially trained simulator produces policies with 8.3% higher success rate than those trained with a maximum likelihood simulator. Subjective results from a crowd-sourced dialogue system user evaluation confirm the effectiveness of adversarially training user simulators.

Comments:	UK Speech 2023
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2306.00858 [cs.CL]
	(or arXiv:2306.00858v1 [cs.CL] for this version)

Submission history

From: Simon Keizer [view email]
[v1] Thu, 1 Jun 2023 16:17:16 GMT (126kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2306.00858

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Adversarial learning of neural user simulators for dialogue policy optimisation

Submission history