We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.AI

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Artificial Intelligence

Title: Hierarchical Actor-Critic

Abstract: The ability to learn at different resolutions in time may help overcome one of the main challenges in deep reinforcement learning -- sample efficiency. Hierarchical agents that operate at different levels of temporal abstraction can learn tasks more quickly because they can divide the work of learning behaviors among multiple policies and can also explore the environment at a higher level. In this paper, we present a novel approach to hierarchical reinforcement learning called Hierarchical Actor-Critic (HAC) that enables agents to learn to break down problems involving continuous action spaces into simpler subproblems belonging to different time scales. HAC has two key advantages over most existing hierarchical learning methods: (i) the potential for faster learning as agents learn short policies at each level of the hierarchy and (ii) an end-to-end approach. We demonstrate that HAC significantly accelerates learning in a series of tasks that require behavior over a relatively long time horizon and involve sparse rewards.
Comments: Changes include (i) more thorough explanation of how temporal abstraction is used and (ii) new results
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Robotics (cs.RO)
Cite as: arXiv:1712.00948 [cs.AI]
  (or arXiv:1712.00948v2 [cs.AI] for this version)

Submission history

From: Andrew Levy [view email]
[v1] Mon, 4 Dec 2017 08:18:08 GMT (7444kb,D)
[v2] Tue, 27 Feb 2018 16:01:40 GMT (2159kb,D)
[v3] Wed, 28 Feb 2018 17:45:42 GMT (4505kb,D)
[v4] Fri, 1 Mar 2019 18:21:33 GMT (2016kb,D)
[v5] Tue, 3 Sep 2019 21:05:21 GMT (2286kb,D)

Link back to: arXiv, form interface, contact.