We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

eess.SY

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Electrical Engineering and Systems Science > Systems and Control

Title: Scalable Synthesis of Verified Controllers in Deep Reinforcement Learning

Abstract: There has been significant recent interest in devising verification techniques for learning-enabled controllers (LECs) that manage safety-critical systems. Given the opacity and lack of interpretability of the neural policies that govern the behavior of such controllers, many existing approaches enforce safety properties through shield, a dynamic monitoring-and-repairing mechanism that ensures a LEC does not emit actions that would violate desired safety conditions. These methods, however, have been shown to have significant scalability limitations because verification costs grow as problem dimensionality and objective complexity increase. In this paper, we propose a new automated verification pipeline capable of synthesizing high-quality safe controllers even when the problem domain involves hundreds of dimensions, or when the desired objective involves stochastic perturbations, liveness considerations, and other complex non-functional properties. Our key insight involves separating safety verification from neural controller training, and using pre-computed verified safety shields to constrain the training process. Experimental results over a range of high-dimensional benchmarks demonstrate the effectiveness of our approach in a range of stochastic linear time-invariant and time-variant systems.
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG)
Cite as: arXiv:2104.10219 [eess.SY]
  (or arXiv:2104.10219v3 [eess.SY] for this version)

Submission history

From: Zikang Xiong [view email]
[v1] Tue, 20 Apr 2021 19:30:29 GMT (1172kb,D)
[v2] Fri, 29 Oct 2021 02:48:04 GMT (1629kb,D)
[v3] Mon, 10 Oct 2022 18:11:05 GMT (655kb,D)

Link back to: arXiv, form interface, contact.