Current browse context:
math.OC
Change to browse by:
References & Citations
Mathematics > Optimization and Control
Title: A Small Gain Analysis of Single Timescale Actor Critic
(Submitted on 4 Mar 2022 (v1), last revised 25 May 2023 (this version, v4))
Abstract: We consider a version of actor-critic which uses proportional step-sizes and only one critic update with a single sample from the stationary distribution per actor step. We provide an analysis of this method using the small-gain theorem. Specifically, we prove that this method can be used to find a stationary point, and that the resulting sample complexity improves the state of the art for actor-critic methods to $O \left(\mu^{-2} \epsilon^{-2} \right)$ to find an $\epsilon$-approximate stationary point where $\mu$ is the condition number associated with the critic.
Submission history
From: Alexander Olshevsky [view email][v1] Fri, 4 Mar 2022 22:20:34 GMT (2478kb)
[v2] Tue, 8 Mar 2022 18:08:21 GMT (79kb)
[v3] Thu, 17 Nov 2022 15:11:22 GMT (79kb)
[v4] Thu, 25 May 2023 17:59:20 GMT (80kb)
Link back to: arXiv, form interface, contact.