Forethought and Hindsight in Credit Assignment

Chelu, Veronica; Precup, Doina; van Hasselt, Hado

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2010

Computer Science > Machine Learning

Title: Forethought and Hindsight in Credit Assignment

Authors: Veronica Chelu, Doina Precup, Hado van Hasselt

(Submitted on 26 Oct 2020)

Abstract: We address the problem of credit assignment in reinforcement learning and explore fundamental questions regarding the way in which an agent can best use additional computation to propagate new information, by planning with internal models of the world to improve its predictions. Particularly, we work to understand the gains and peculiarities of planning employed as forethought via forward models or as hindsight operating with backward models. We establish the relative merits, limitations and complementary properties of both planning mechanisms in carefully constructed scenarios. Further, we investigate the best use of models in planning, primarily focusing on the selection of states in which predictions should be (re)-evaluated. Lastly, we discuss the issue of model estimation and highlight a spectrum of methods that stretch from explicit environment-dynamics predictors to more abstract planner-aware models.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2010.13685 [cs.LG]
	(or arXiv:2010.13685v1 [cs.LG] for this version)

Submission history

From: Veronica Chelu [view email]
[v1] Mon, 26 Oct 2020 16:00:47 GMT (8976kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2010.13685

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Forethought and Hindsight in Credit Assignment

Submission history