References & Citations
Computer Science > Machine Learning
Title: Some Simulation Results for Emphatic Temporal-Difference Learning Algorithms
(Submitted on 6 May 2016)
Abstract: This is a companion note to our recent study of the weak convergence properties of constrained emphatic temporal-difference learning (ETD) algorithms from a theoretic perspective. It supplements the latter analysis with simulation results and illustrates the behavior of some of the ETD algorithms using three example problems.
Link back to: arXiv, form interface, contact.