Factored Contextual Policy Search with Bayesian Optimization

Karkus, Peter; Kupcsik, Andras; Hsu, David; Lee, Wee Sun

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1612

Computer Science > Machine Learning

Title: Factored Contextual Policy Search with Bayesian Optimization

Authors: Peter Karkus, Andras Kupcsik, David Hsu, Wee Sun Lee

(Submitted on 6 Dec 2016 (v1), last revised 28 May 2019 (this version, v2))

Abstract: Scarce data is a major challenge to scaling robot learning to truly complex tasks, as we need to generalize locally learned policies over different "contexts". Bayesian optimization approaches to contextual policy search (CPS) offer data-efficient policy learning that generalize over a context space. We propose to improve data-efficiency by factoring typically considered contexts into two components: target-type contexts that correspond to a desired outcome of the learned behavior, e.g. target position for throwing a ball; and environment type contexts that correspond to some state of the environment, e.g. initial ball position or wind speed. Our key observation is that experience can be directly generalized over target-type contexts. Based on that we introduce Factored Contextual Policy Search with Bayesian Optimization for both passive and active learning settings. Preliminary results show faster policy generalization on a simulated toy problem. A full paper extension is available at arXiv:1904.11761

Comments:	BayesOpt 2016, NeurIPS Workshop. A full paper extension is available at arXiv:1904.11761
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO); Machine Learning (stat.ML)
Cite as:	arXiv:1612.01746 [cs.LG]
	(or arXiv:1612.01746v2 [cs.LG] for this version)

Submission history

From: Peter Karkus [view email]
[v1] Tue, 6 Dec 2016 10:51:51 GMT (1033kb,D)
[v2] Tue, 28 May 2019 04:08:29 GMT (1031kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1612.01746

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Factored Contextual Policy Search with Bayesian Optimization

Submission history