Current browse context:
cs.LG
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: Open Loop Execution of Tree-Search Algorithms, extended version
(Submitted on 3 May 2018 (v1), last revised 12 Feb 2019 (this version, v2))
Abstract: In the context of tree-search stochastic planning algorithms where a generative model is available, we consider on-line planning algorithms building trees in order to recommend an action. We investigate the question of avoiding re-planning in subsequent decision steps by directly using sub-trees as action recommender. Firstly, we propose a method for open loop control via a new algorithm taking the decision of re-planning or not at each time step based on an analysis of the statistics of the sub-tree. Secondly, we show that the probability of selecting a suboptimal action at any depth of the tree can be upper bounded and converges towards zero. Moreover, this upper bound decays in a logarithmic way between subsequent depths. This leads to a distinction between node-wise optimality and state-wise optimality. Finally, we empirically demonstrate that our method achieves a compromise between loss of performance and computational gain.
Submission history
From: Erwan Lecarpentier [view email][v1] Thu, 3 May 2018 15:20:10 GMT (325kb,D)
[v2] Tue, 12 Feb 2019 21:42:21 GMT (325kb,D)
Link back to: arXiv, form interface, contact.