Risk Averse Bayesian Reward Learning for Autonomous Navigation from Human Demonstration

Ellis, Christian; Wigness, Maggie; Rogers III, John G.; Lennon, Craig; Fiondella, Lance

Full-text links:

Download:

Current browse context:

cs.RO

< prev | next >

new | recent | 2108

Change to browse by:

Computer Science > Robotics

Title: Risk Averse Bayesian Reward Learning for Autonomous Navigation from Human Demonstration

Authors: Christian Ellis, Maggie Wigness, John G. Rogers III, Craig Lennon, Lance Fiondella

(Submitted on 31 Jul 2021)

Abstract: Traditional imitation learning provides a set of methods and algorithms to learn a reward function or policy from expert demonstrations. Learning from demonstration has been shown to be advantageous for navigation tasks as it allows for machine learning non-experts to quickly provide information needed to learn complex traversal behaviors. However, a minimal set of demonstrations is unlikely to capture all relevant information needed to achieve the desired behavior in every possible future operational environment. Due to distributional shift among environments, a robot may encounter features that were rarely or never observed during training for which the appropriate reward value is uncertain, leading to undesired outcomes. This paper proposes a Bayesian technique which quantifies uncertainty over the weights of a linear reward function given a dataset of minimal human demonstrations to operate safely in dynamic environments. This uncertainty is quantified and incorporated into a risk averse set of weights used to generate cost maps for planning. Experiments in a 3-D environment with a simulated robot show that our proposed algorithm enables a robot to avoid dangerous terrain completely in two out of three test scenarios and accumulates a lower amount of risk than related approaches in all scenarios without requiring any additional demonstrations.

Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2108.00276 [cs.RO]
	(or arXiv:2108.00276v1 [cs.RO] for this version)

Submission history

From: Christian Ellis [view email]
[v1] Sat, 31 Jul 2021 16:08:34 GMT (2897kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2108.00276

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Robotics

Title: Risk Averse Bayesian Reward Learning for Autonomous Navigation from Human Demonstration

Submission history