References & Citations
Computer Science > Artificial Intelligence
Title: Hierarchical Width-Based Planning and Learning
(Submitted on 15 Jan 2021 (v1), last revised 1 Sep 2021 (this version, v3))
Abstract: Width-based search methods have demonstrated state-of-the-art performance in a wide range of testbeds, from classical planning problems to image-based simulators such as Atari games. These methods scale independently of the size of the state-space, but exponentially in the problem width. In practice, running the algorithm with a width larger than 1 is computationally intractable, prohibiting IW from solving higher width problems. In this paper, we present a hierarchical algorithm that plans at two levels of abstraction. A high-level planner uses abstract features that are incrementally discovered from low-level pruning decisions. We illustrate this algorithm in classical planning PDDL domains as well as in pixel-based simulator domains. In classical planning, we show how IW(1) at two levels of abstraction can solve problems of width 2. For pixel-based domains, we show how in combination with a learned policy and a learned value function, the proposed hierarchical IW can outperform current flat IW-based planners in Atari games with sparse rewards.
Submission history
From: Miquel Junyent [view email][v1] Fri, 15 Jan 2021 15:37:46 GMT (2562kb,D)
[v2] Tue, 23 Mar 2021 15:42:37 GMT (2197kb,D)
[v3] Wed, 1 Sep 2021 09:21:22 GMT (2194kb,D)
Link back to: arXiv, form interface, contact.