We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.RO

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Robotics

Title: Neural Moving Horizon Estimation for Robust Flight Control

Abstract: Estimating and reacting to disturbances is crucial for robust flight control of quadrotors. Existing estimators typically require significant tuning for a specific flight scenario or training with extensive ground-truth disturbance data to achieve satisfactory performance. In this paper, we propose a neural moving horizon estimator (NeuroMHE) that can automatically tune its key parameters modeled by a neural network and adapt to different flight scenarios. We achieve this by deriving the analytical gradients of the MHE estimates with respect to the MHE weighting matrices, which enables a seamless embedding of the MHE as a learnable layer into the neural network for highly effective learning. Interestingly, we show that the gradients can be computed efficiently using a Kalman filter in a recursive form. Moreover, we develop a model-based policy gradient algorithm to train NeuroMHE directly from the quadrotor trajectory tracking error without needing the ground-truth disturbance data. The effectiveness of NeuroMHE is verified extensively via both simulations and physical experiments on quadrotors in various challenging flights. Notably, NeuroMHE outperforms a state-of-the-art neural network-based estimator, reducing force estimation errors by up to 76.7%, while using a portable neural network that has only 7.7% of the learnable parameters of the latter. The proposed method is general and can be applied to robust adaptive control of other robotic systems.
Comments: This paper (not the final version) has been accepted for publication in the IEEE Transactions on Robotics
Subjects: Robotics (cs.RO); Machine Learning (cs.LG); Systems and Control (eess.SY)
DOI: 10.1109/TRO.2023.3331064
Cite as: arXiv:2206.10397 [cs.RO]
  (or arXiv:2206.10397v13 [cs.RO] for this version)

Submission history

From: Bingheng Wang [view email]
[v1] Tue, 21 Jun 2022 13:43:24 GMT (6824kb,D)
[v2] Wed, 22 Jun 2022 14:37:56 GMT (6824kb,D)
[v3] Thu, 23 Jun 2022 04:31:33 GMT (6824kb,D)
[v4] Mon, 27 Jun 2022 13:10:36 GMT (6832kb,D)
[v5] Wed, 29 Jun 2022 08:01:17 GMT (6832kb,D)
[v6] Fri, 1 Jul 2022 14:23:56 GMT (6832kb,D)
[v7] Thu, 7 Jul 2022 09:06:28 GMT (6810kb,D)
[v8] Fri, 8 Jul 2022 10:06:57 GMT (6810kb,D)
[v9] Mon, 11 Jul 2022 17:15:41 GMT (6811kb,D)
[v10] Mon, 11 Sep 2023 16:15:00 GMT (16983kb,D)
[v11] Mon, 18 Sep 2023 13:10:07 GMT (16766kb,D)
[v12] Tue, 10 Oct 2023 00:38:54 GMT (16766kb,D)
[v13] Tue, 14 Nov 2023 13:04:02 GMT (16859kb,D)

Link back to: arXiv, form interface, contact.