Uncertainty Estimates for Efficient Neural Network-based Dialogue Policy Optimisation

Tegho, Christopher; Budzianowski, Paweł; Gašić, Milica

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 1711

Statistics > Machine Learning

Title: Uncertainty Estimates for Efficient Neural Network-based Dialogue Policy Optimisation

Authors: Christopher Tegho, Paweł Budzianowski, Milica Gašić

(Submitted on 30 Nov 2017)

Abstract: In statistical dialogue management, the dialogue manager learns a policy that maps a belief state to an action for the system to perform. Efficient exploration is key to successful policy optimisation. Current deep reinforcement learning methods are very promising but rely on epsilon-greedy exploration, thus subjecting the user to a random choice of action during learning. Alternative approaches such as Gaussian Process SARSA (GPSARSA) estimate uncertainties and are sample efficient, leading to better user experience, but on the expense of a greater computational complexity. This paper examines approaches to extract uncertainty estimates from deep Q-networks (DQN) in the context of dialogue management. We perform an extensive benchmark of deep Bayesian methods to extract uncertainty estimates, namely Bayes-By-Backprop, dropout, its concrete variation, bootstrapped ensemble and alpha-divergences, combining it with DQN algorithm.

Comments:	Accepted at the Bayesian Deep Learning Workshop, 31st Conference on Neural Information Processing Systems (NIPS 2017)
Subjects:	Machine Learning (stat.ML); Computation and Language (cs.CL); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1711.11486 [stat.ML]
	(or arXiv:1711.11486v1 [stat.ML] for this version)

Submission history

From: Paweł Budzianowski [view email]
[v1] Thu, 30 Nov 2017 16:09:02 GMT (101kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:1711.11486

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Uncertainty Estimates for Efficient Neural Network-based Dialogue Policy Optimisation

Submission history