Policy Search with High-Dimensional Context Variables

Tangkaratt, Voot; van Hoof, Herke; Parisi, Simone; Neumann, Gerhard; Peters, Jan; Sugiyama, Masashi

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 1611

Statistics > Machine Learning

Title: Policy Search with High-Dimensional Context Variables

Authors: Voot Tangkaratt, Herke van Hoof, Simone Parisi, Gerhard Neumann, Jan Peters, Masashi Sugiyama

(Submitted on 10 Nov 2016)

Abstract: Direct contextual policy search methods learn to improve policy parameters and simultaneously generalize these parameters to different context or task variables. However, learning from high-dimensional context variables, such as camera images, is still a prominent problem in many real-world tasks. A naive application of unsupervised dimensionality reduction methods to the context variables, such as principal component analysis, is insufficient as task-relevant input may be ignored. In this paper, we propose a contextual policy search method in the model-based relative entropy stochastic search framework with integrated dimensionality reduction. We learn a model of the reward that is locally quadratic in both the policy parameters and the context variables. Furthermore, we perform supervised linear dimensionality reduction on the context variables by nuclear norm regularization. The experimental results show that the proposed method outperforms naive dimensionality reduction via principal component analysis and a state-of-the-art contextual policy search method.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1611.03231 [stat.ML]
	(or arXiv:1611.03231v1 [stat.ML] for this version)

Submission history

From: Voot Tangkaratt [view email]
[v1] Thu, 10 Nov 2016 09:25:12 GMT (749kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:1611.03231

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Policy Search with High-Dimensional Context Variables

Submission history