We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.OC

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Optimization and Control

Title: Average cost optimal control under weak ergodicity hypotheses: Relative value iterations

Abstract: We study Markov decision processes with Polish state and action spaces. The action space is state dependent and is not necessarily compact. We first establish the existence of an optimal ergodic occupation measure using only a near-monotone hypothesis on the running cost. Then we study the well-posedness of Bellman equation, or what is commonly known as the average cost optimality equation, under the additional hypothesis of the existence of a small set. We deviate from the usual approach which is based on the vanishing discount method and instead map the problem to an equivalent one for a controlled split chain. We employ a stochastic representation of the Poisson equation to derive the Bellman equation. Next, under suitable assumptions, we establish convergence results for the 'relative value iteration' algorithm which computes the solution of the Bellman equation recursively. In addition, we present some results concerning the stability and asymptotic optimality of the associated rolling horizon policies.
Comments: 32 pages
Subjects: Optimization and Control (math.OC); Systems and Control (eess.SY)
MSC classes: 90C40 (93E20)
Cite as: arXiv:1902.01048 [math.OC]
  (or arXiv:1902.01048v7 [math.OC] for this version)

Submission history

From: Vivek Borkar [view email]
[v1] Mon, 4 Feb 2019 06:38:48 GMT (25kb)
[v2] Mon, 17 Feb 2020 10:19:15 GMT (32kb)
[v3] Fri, 29 Jan 2021 06:08:57 GMT (38kb)
[v4] Wed, 10 Feb 2021 04:10:16 GMT (40kb)
[v5] Fri, 22 Oct 2021 07:30:21 GMT (43kb)
[v6] Thu, 6 Apr 2023 05:46:51 GMT (54kb)
[v7] Mon, 14 Aug 2023 10:04:13 GMT (46kb)

Link back to: arXiv, form interface, contact.