Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/90180

TítuloPolicy gradients using variational quantum circuits
Autor(es)Sequeira, André
Santos, Luís Paulo
Barbosa, L. S.
Palavras-chaveQuantum machine learning
Reinforcement learning
Policy gradients
Quantum control
DataAbr-2023
EditoraSpringer
RevistaQuantum Machine Intelligence
CitaçãoSequeira, A., Santos, L.P. & Barbosa, L.S. Policy gradients using variational quantum circuits. Quantum Mach. Intell. 5, 18 (2023). https://doi.org/10.1007/s42484-023-00101-8
Resumo(s)Variational quantum circuits are being used as versatile quantum machine learning models. Some empirical results exhibit an advantage in supervised and generative learning tasks. However, when applied to reinforcement learning, less is known. In this work, we considered a variational quantum circuit composed of a low-depth hardware-efficient ansatz as the parameterized policy of a reinforcement learning agent. We show that an epsilon-approximation of the policy gradient can be obtained using a logarithmic number of samples concerning the total number of parameters. We empirically verify that such quantum models behave similarly to typical classical neural networks used in standard benchmarking environments and quantum control, using only a fraction of the parameters. Moreover, we study the barren plateau phenomenon in quantum policy gradients using the Fisher information matrix spectrum.
TipoArtigo
URIhttps://hdl.handle.net/1822/90180
DOI10.1007/s42484-023-00101-8
ISSN2524-4906
e-ISSN2524-4914
Versão da editorahttps://link.springer.com/article/10.1007/s42484-023-00101-8#citeas
Arbitragem científicayes
AcessoAcesso aberto
Aparece nas coleções:HASLab - Artigos em revistas internacionais

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
2023-QuantumMachineIntelligence.pdf2,37 MBAdobe PDFVer/Abrir

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID