References & Citations
Mathematics > Optimization and Control
Title: An axiomatic basis for Blackwell optimality
(Submitted on 11 Jan 2017 (v1), revised 12 Dec 2017 (this version, v3), latest version 22 Nov 2022 (v6))
Abstract: In the theory of Markov decision processes (MDPs), a Blackwell optimal policy is a policy that is optimal for every discount factor sufficiently close to one. In this paper I provide an axiomatic basis for Blackwell optimality in discrete-time MDPs with finitely many states and finitely many actions.
Submission history
From: Adam Jonsson L [view email][v1] Wed, 11 Jan 2017 08:05:26 GMT (11kb)
[v2] Tue, 25 Jul 2017 17:49:01 GMT (8kb)
[v3] Tue, 12 Dec 2017 14:50:18 GMT (8kb)
[v4] Mon, 29 Jan 2018 23:29:16 GMT (7kb)
[v5] Tue, 15 Dec 2020 16:18:05 GMT (13kb)
[v6] Tue, 22 Nov 2022 15:59:22 GMT (31kb)
Link back to: arXiv, form interface, contact.