SABLAS: Learning Safe Control for Black-box Dynamical Systems

Qin, Zengyi; Sun, Dawei; Fan, Chuchu

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2201

Computer Science > Machine Learning

Title: SABLAS: Learning Safe Control for Black-box Dynamical Systems

Authors: Zengyi Qin, Dawei Sun, Chuchu Fan

(Submitted on 6 Jan 2022 (v1), last revised 9 Jan 2022 (this version, v2))

Abstract: Control certificates based on barrier functions have been a powerful tool to generate probably safe control policies for dynamical systems. However, existing methods based on barrier certificates are normally for white-box systems with differentiable dynamics, which makes them inapplicable to many practical applications where the system is a black-box and cannot be accurately modeled. On the other side, model-free reinforcement learning (RL) methods for black-box systems suffer from lack of safety guarantees and low sampling efficiency. In this paper, we propose a novel method that can learn safe control policies and barrier certificates for black-box dynamical systems, without requiring for an accurate system model. Our method re-designs the loss function to back-propagate gradient to the control policy even when the black-box dynamical system is non-differentiable, and we show that the safety certificates hold on the black-box system. Empirical results in simulation show that our method can significantly improve the performance of the learned policies by achieving nearly 100% safety and goal reaching rates using much fewer training samples, compared to state-of-the-art black-box safe control methods. Our learned agents can also generalize to unseen scenarios while keeping the original performance. The source code can be found at this https URL

Comments:	IEEE Robotics and Automation Letters, 2022
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Systems and Control (eess.SY)
Cite as:	arXiv:2201.01918 [cs.LG]
	(or arXiv:2201.01918v2 [cs.LG] for this version)

Submission history

From: Zengyi Qin [view email]
[v1] Thu, 6 Jan 2022 04:39:44 GMT (22578kb,D)
[v2] Sun, 9 Jan 2022 04:28:26 GMT (22578kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2201.01918

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: SABLAS: Learning Safe Control for Black-box Dynamical Systems

Submission history