We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Mathematics > Optimization and Control

Title: On the Optimization Landscape of Dynamic Output Feedback Linear Quadratic Control

Abstract: The optimization landscape of optimal control problems plays an important role in the convergence of many policy gradient methods. Unlike state-feedback Linear Quadratic Regulator (LQR), static output-feedback policies are typically insufficient to achieve good closed-loop control performance. We investigate the optimization landscape of linear quadratic control using dynamic output feedback policies, denoted as dynamic LQR (dLQR) in this paper. We first show that the dLQR cost varies with similarity transformations. We then derive an explicit form of the optimal similarity transformation for a given observable stabilizing controller. We further characterize the unique observable stationary point of dLQR. This provides an optimality certificate for policy gradient methods under mild assumptions. Finally, we discuss the differences and connections between dLQR and the canonical linear quadratic Gaussian (LQG) control. These results shed light on designing policy gradient algorithms for decision-making problems with partially observed information.
Subjects: Optimization and Control (math.OC); Systems and Control (eess.SY)
Cite as: arXiv:2201.09598 [math.OC]
  (or arXiv:2201.09598v2 [math.OC] for this version)

Submission history

From: Jingliang Duan [view email]
[v1] Mon, 24 Jan 2022 11:14:07 GMT (3366kb,D)
[v2] Sat, 29 Jan 2022 06:50:17 GMT (3366kb,D)

Link back to: arXiv, form interface, contact.