We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Electrical Engineering and Systems Science > Systems and Control

Title: Reinforcement Learning for Adaptive Optimal Stationary Control of Linear Stochastic Systems

Abstract: This paper studies the adaptive optimal stationary control of continuous-time linear stochastic systems with both additive and multiplicative noises, using reinforcement learning techniques. Based on policy iteration, a novel off-policy reinforcement learning algorithm, named optimistic least-squares-based policy iteration, is proposed which is able to find iteratively near-optimal policies of the adaptive optimal stationary control problem directly from input/state data without explicitly identifying any system matrices, starting from an initial admissible control policy. The solutions given by the proposed optimistic least-squares-based policy iteration are proved to converge to a small neighborhood of the optimal solution with probability one, under mild conditions. The application of the proposed algorithm to a triple inverted pendulum example validates its feasibility and effectiveness.
Comments: 10 pages, 3 figures
Subjects: Systems and Control (eess.SY); Machine Learning (cs.LG); Optimization and Control (math.OC)
Cite as: arXiv:2107.07788 [eess.SY]
  (or arXiv:2107.07788v3 [eess.SY] for this version)

Submission history

From: Bo Pang [view email]
[v1] Fri, 16 Jul 2021 09:27:02 GMT (96kb,D)
[v2] Tue, 20 Jul 2021 03:47:32 GMT (99kb,D)
[v3] Sun, 5 Dec 2021 10:07:16 GMT (737kb,D)

Link back to: arXiv, form interface, contact.