We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Machine Learning

Title: Are Deep Policy Gradient Algorithms Truly Policy Gradient Algorithms?

Abstract: We study how the behavior of deep policy gradient algorithms reflects the conceptual framework motivating their development. We propose a fine-grained analysis of state-of-the-art methods based on key aspects of this framework: gradient estimation, value prediction, optimization landscapes, and trust region enforcement. We find that from this perspective, the behavior of deep policy gradient algorithms often deviates from what their motivating framework would predict. Our analysis suggests first steps towards solidifying the foundations of these algorithms, and in particular indicates that we may need to move beyond the current benchmark-centric evaluation methodology.
Subjects: Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Robotics (cs.RO); Machine Learning (stat.ML)
Cite as: arXiv:1811.02553 [cs.LG]
  (or arXiv:1811.02553v2 [cs.LG] for this version)

Submission history

From: Andrew Ilyas [view email]
[v1] Tue, 6 Nov 2018 18:54:21 GMT (2633kb,D)
[v2] Mon, 12 Nov 2018 18:54:30 GMT (4798kb,D)
[v3] Sun, 2 Dec 2018 02:45:35 GMT (4801kb,D)
[v4] Mon, 25 May 2020 16:24:26 GMT (826kb,D)

Link back to: arXiv, form interface, contact.