We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.OC

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Optimization and Control

Title: Deep neural network approximations for the stable manifolds of the Hamilton-Jacobi equations

Authors: Guoyuan Chen
Abstract: As the Riccati equation for control of linear systems, the Hamilton-Jacobi-Bellman (HJB) equations play a fundamental role for optimal control of nonlinear systems. For infinite-horizon control problem, the optimal control can be represented by the stable manifold of the characteristic Hamiltonian system of HJB equation. In this paper, we study the deep neural network (NN) semiglobal approximation of the stable manifold. Our main contribution includes twofold: firstly, from the mathematical point of view, we theoretically prove that if an approximation is sufficiently close to the exact stable manifold of the HJB equation, then the corresponding control derived from this approximation stabilizes the system and is nearly optimal. Secondly, based on the theoretical result, we propose a deep learning approach to approximate the stable manifold and compute optimal feedback control numerically. Our algorithm is based on the geometric features of the stable manifold and relies on adaptive data generation through finding trajectories randomly within the stable manifold. To achieve this, we solve two-point boundary value problems (BVPs) locally near the equilibrium and extend the local solutions using initial value problems (IVPs) for the characteristic Hamiltonian system. We randomly choose a number of samples along each trajectory, and adaptively select additional samples near points with large errors from the previous round of training. Our algorithm is causality-free basically, hence it has the potential to apply to a wide range of high-dimensional nonlinear systems. We demonstrate the effectiveness of our method through two examples: stabilizing the Reaction Wheel Pendulums and controlling the parabolic Allen-Cahn equation.
Comments: The theorem is revised in more compact form. The loss function is modified to fix the theoretical result. A high-dimensional example is added
Subjects: Optimization and Control (math.OC); Systems and Control (eess.SY)
Cite as: arXiv:2007.15350 [math.OC]
  (or arXiv:2007.15350v4 [math.OC] for this version)

Submission history

From: Guoyuan Chen [view email]
[v1] Thu, 30 Jul 2020 09:58:38 GMT (62kb,D)
[v2] Sun, 20 Sep 2020 07:21:22 GMT (158kb,D)
[v3] Fri, 8 Oct 2021 02:27:04 GMT (281kb,D)
[v4] Sat, 17 Jun 2023 00:37:34 GMT (608kb,D)

Link back to: arXiv, form interface, contact.