We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

quant-ph

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Quantum Physics

Title: Policy Gradients using Variational Quantum Circuits

Abstract: Variational Quantum Circuits are being used as versatile Quantum Machine Learning models. Some empirical results exhibit an advantage in supervised and generative learning tasks. However, when applied to Reinforcement Learning, less is known. In this work, we considered a Variational Quantum Circuit composed of a low-depth hardware-efficient ansatz as the parameterized policy of a Reinforcement Learning agent. We show that an $\epsilon$-approximation of the policy gradient can be obtained using a logarithmic number of samples concerning the total number of parameters. We empirically verify that such quantum models behave similarly or even outperform typical classical neural networks used in standard benchmarking environments and in quantum control, using only a fraction of the parameters. Moreover, we study the Barren Plateau phenomenon in quantum policy gradients using the Fisher Information Matrix spectrum.
Subjects: Quantum Physics (quant-ph); Machine Learning (cs.LG)
Cite as: arXiv:2203.10591 [quant-ph]
  (or arXiv:2203.10591v3 [quant-ph] for this version)

Submission history

From: Andre Sequeira [view email]
[v1] Sun, 20 Mar 2022 16:14:49 GMT (7277kb,D)
[v2] Sun, 9 Oct 2022 14:32:00 GMT (25191kb,D)
[v3] Sun, 15 Jan 2023 19:09:48 GMT (3668kb,D)

Link back to: arXiv, form interface, contact.