Reinforcement Learning for Adaptive Optimal Stationary Control of Linear Stochastic Systems

Pang, Bo; Jiang, Zhong-Ping

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2107

Electrical Engineering and Systems Science > Systems and Control

Title: Reinforcement Learning for Adaptive Optimal Stationary Control of Linear Stochastic Systems

Authors: Bo Pang, Zhong-Ping Jiang

(Submitted on 16 Jul 2021 (v1), last revised 5 Dec 2021 (this version, v3))

Abstract: This paper studies the adaptive optimal stationary control of continuous-time linear stochastic systems with both additive and multiplicative noises, using reinforcement learning techniques. Based on policy iteration, a novel off-policy reinforcement learning algorithm, named optimistic least-squares-based policy iteration, is proposed which is able to find iteratively near-optimal policies of the adaptive optimal stationary control problem directly from input/state data without explicitly identifying any system matrices, starting from an initial admissible control policy. The solutions given by the proposed optimistic least-squares-based policy iteration are proved to converge to a small neighborhood of the optimal solution with probability one, under mild conditions. The application of the proposed algorithm to a triple inverted pendulum example validates its feasibility and effectiveness.

Comments:	10 pages, 3 figures
Subjects:	Systems and Control (eess.SY); Machine Learning (cs.LG); Optimization and Control (math.OC)
Cite as:	arXiv:2107.07788 [eess.SY]
	(or arXiv:2107.07788v3 [eess.SY] for this version)

Submission history

From: Bo Pang [view email]
[v1] Fri, 16 Jul 2021 09:27:02 GMT (96kb,D)
[v2] Tue, 20 Jul 2021 03:47:32 GMT (99kb,D)
[v3] Sun, 5 Dec 2021 10:07:16 GMT (737kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2107.07788

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Electrical Engineering and Systems Science > Systems and Control

Title: Reinforcement Learning for Adaptive Optimal Stationary Control of Linear Stochastic Systems

Submission history