436

Cost-Function-Dependent Barren Plateaus in Shallow Quantum Neural Networks

Abstract

Variational quantum algorithms (VQAs) optimize the parameters θ\boldsymbol{\theta} of a quantum neural network V(θ)V(\boldsymbol{\theta}) to minimize a cost function CC. While VQAs may enable practical applications of noisy quantum computers, they are nevertheless heuristic methods with unproven scaling. Here, we rigorously prove two results, assuming V(θ)V(\boldsymbol{\theta}) is a hardware-efficient ansatz composed of blocks forming local 2-designs. Our first result states that defining CC in terms of global observables leads to an exponentially vanishing gradient (i.e., a barren plateau) even when V(θ)V(\boldsymbol{\theta}) is shallow. This implies that several VQAs in the literature must revise their proposed cost functions. On the other hand, our second result states that defining CC with local observables leads to at worst a polynomially vanishing gradient, so long as the depth of V(θ)V(\boldsymbol{\theta}) is O(logn)\mathcal{O}(\log n). Taken together, our results establish a connection between locality and trainability. Finally, we illustrate these ideas with large-scale simulations, up to 100 qubits, of a particular VQA known as quantum autoencoders.

View on arXiv
Comments on this paper