We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.OC

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Optimization and Control

Title: Unbounded Dynamic Programming via the Q-Transform

Abstract: We propose a new approach to solving dynamic decision problems with unbounded rewards based on the transformations used in Q-learning. In our case, the objective of the transform is to convert an unbounded dynamic program into a bounded one. The approach is general enough to handle problems for which existing methods struggle, and yet simple relative to other techniques and accessible for applied work. We show by example that many common decision problems satisfy our conditions.
Comments: arXiv admin note: text overlap with arXiv:1911.13025
Subjects: Optimization and Control (math.OC); Theoretical Economics (econ.TH)
DOI: 10.1016/j.jmateco.2022.102652
Cite as: arXiv:2012.00219 [math.OC]
  (or arXiv:2012.00219v2 [math.OC] for this version)

Submission history

From: Alexis Akira Toda [view email]
[v1] Tue, 1 Dec 2020 02:26:07 GMT (30kb)
[v2] Wed, 17 Mar 2021 19:33:13 GMT (34kb)

Link back to: arXiv, form interface, contact.