Current browse context:
cs.LG
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: Recurrent Neural Network Training with Convex Loss and Regularization Functions by Extended Kalman Filtering
(Submitted on 4 Nov 2021 (v1), last revised 2 Nov 2022 (this version, v3))
Abstract: This paper investigates the use of extended Kalman filtering to train recurrent neural networks with rather general convex loss functions and regularization terms on the network parameters, including $\ell_1$-regularization. We show that the learning method is competitive with respect to stochastic gradient descent in a nonlinear system identification benchmark and in training a linear system with binary outputs. We also explore the use of the algorithm in data-driven nonlinear model predictive control and its relation with disturbance models for offset-free closed-loop tracking.
Submission history
From: Alberto Bemporad [view email][v1] Thu, 4 Nov 2021 07:49:15 GMT (270kb,D)
[v2] Fri, 10 Jun 2022 15:51:35 GMT (909kb,D)
[v3] Wed, 2 Nov 2022 11:05:04 GMT (88kb,D)
Link back to: arXiv, form interface, contact.