Learning to Reach, Swim, Walk and Fly in One Trial: Data-Driven Control with Scarce Data and Side Information

Djeumou, Franck; Topcu, Ufuk

Full-text links:

Download:

Current browse context:

eess.SY

< prev | next >

new | recent | 2106

Electrical Engineering and Systems Science > Systems and Control

Title: Learning to Reach, Swim, Walk and Fly in One Trial: Data-Driven Control with Scarce Data and Side Information

Authors: Franck Djeumou, Ufuk Topcu

(Submitted on 19 Jun 2021 (v1), last revised 28 Dec 2021 (this version, v4))

Abstract: We develop a learning-based control algorithm for unknown dynamical systems under very severe data limitations. Specifically, the algorithm has access to streaming and noisy data only from a single and ongoing trial. It accomplishes such performance by effectively leveraging various forms of side information on the dynamics to reduce the sample complexity. Such side information typically comes from elementary laws of physics and qualitative properties of the system. More precisely, the algorithm approximately solves an optimal control problem encoding the system's desired behavior. To this end, it constructs and iteratively refines a data-driven differential inclusion that contains the unknown vector field of the dynamics. The differential inclusion, used in an interval Taylor-based method, enables to over-approximate the set of states the system may reach. Theoretically, we establish a bound on the suboptimality of the approximate solution with respect to the optimal control with known dynamics. We show that the longer the trial or the more side information is available, the tighter the bound. Empirically, experiments in a high-fidelity F-16 aircraft simulator and MuJoCo's environments illustrate that, despite the scarcity of data, the algorithm can provide performance comparable to reinforcement learning algorithms trained over millions of environment interactions. Besides, we show that the algorithm outperforms existing techniques combining system identification and model predictive control.

Comments:	Initial submission to L4DC
Subjects:	Systems and Control (eess.SY); Machine Learning (cs.LG); Robotics (cs.RO); Optimization and Control (math.OC)
Cite as:	arXiv:2106.10533 [eess.SY]
	(or arXiv:2106.10533v4 [eess.SY] for this version)

Submission history

From: Franck Djeumou [view email]
[v1] Sat, 19 Jun 2021 17:10:27 GMT (641kb,D)
[v2] Tue, 14 Sep 2021 18:46:54 GMT (5022kb,D)
[v3] Tue, 21 Dec 2021 19:35:48 GMT (4534kb,D)
[v4] Tue, 28 Dec 2021 15:55:45 GMT (4534kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2106.10533

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Systems and Control

Title: Learning to Reach, Swim, Walk and Fly in One Trial: Data-Driven Control with Scarce Data and Side Information

Submission history