References & Citations
Mathematics > Optimization and Control
Title: An axiomatic approach to Markov decision processes
(Submitted on 11 Jan 2017 (v1), last revised 22 Nov 2022 (this version, v6))
Abstract: This paper presents an axiomatic approach to finite Markov decision processes where the discount rate is zero. One of the principal difficulties in the no discounting case is that, even if attention is restricted to stationary policies, a strong overtaking optimal policy need not exists. We provide preference foundations for two criteria that do admit optimal policies: $0$-discount optimality and average overtaking optimality. As a corollary of our results, we obtain conditions on a decision maker's preferences which ensure that an optimal policy exists. These results have implications for disciplines where stochastic dynamic programming problems arise, including automatic control, dynamic games, and economic development.
Submission history
From: Adam Jonsson L [view email][v1] Wed, 11 Jan 2017 08:05:26 GMT (11kb)
[v2] Tue, 25 Jul 2017 17:49:01 GMT (8kb)
[v3] Tue, 12 Dec 2017 14:50:18 GMT (8kb)
[v4] Mon, 29 Jan 2018 23:29:16 GMT (7kb)
[v5] Tue, 15 Dec 2020 16:18:05 GMT (13kb)
[v6] Tue, 22 Nov 2022 15:59:22 GMT (31kb)
Link back to: arXiv, form interface, contact.