We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.OC

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Mathematics > Optimization and Control

Title: Separation of Learning and Control for Cyber-Physical Systems

Abstract: Most cyber-physical systems (CPS) encounter a large volume of data with a dynamic nature which is added to the system gradually in real time and not altogether in advance. Therefore, neither traditional supervised (or unsupervised) learning nor typical model-based control approaches can effectively facilitate optimal solutions with performance guarantees. In this article, we provide a theoretical framework that yields optimal control strategies at the intersection of control theory and learning. In the proposed framework, we use the actual CPS, i.e., the "true" CPS that we seek to optimally control online, in parallel with a model of the CPS that we have available. We institute an information state which is the conditional joint probability distribution of the states of the model and the actual CPS given all data available up until each instant of time. We use this information state along with the CPS model to derive offline separated control strategies. Since the strategies are derived offline, the state of the actual CPS is not known, i.e., the model cannot capture the dynamics of the actual CPS due to the complexity of the system, and thus the optimal strategy of the model is parameterized with respect to the state of the actual CPS. However, the control strategy and the process of estimating the information state are separated. Therefore, we can learn the information state of the system online while we operate the model and the actual CPS. We show that after the information state becomes known online through learning, the separated control strategy of the model derived offline is optimal for the actual CPS. We illustrate the proposed framework in a dynamic system consisting of two subsystems with a delayed sharing information structure.
Comments: 16 pages, 3 figures. arXiv admin note: substantial text overlap with arXiv:2101.10992
Subjects: Optimization and Control (math.OC)
Cite as: arXiv:2107.06379 [math.OC]
  (or arXiv:2107.06379v2 [math.OC] for this version)

Submission history

From: Andreas Malikopoulos [view email]
[v1] Tue, 13 Jul 2021 20:37:19 GMT (4330kb,D)
[v2] Thu, 3 Feb 2022 22:00:44 GMT (5169kb,D)
[v3] Tue, 24 May 2022 20:02:46 GMT (4351kb,D)

Link back to: arXiv, form interface, contact.