15
0

Quantum Fisher-Preconditioned Reinforcement Learning: From Single-Qubit Control to Rayleigh-Fading Link Adaptation

Main:4 Pages
3 Figures
Bibliography:1 Pages
1 Tables
Abstract

In this letter, we propose Quantum-Preconditioned Policy Gradient (QPPG), a natural gradient-based algorithm for link adaptation that whitens policy updates using the full inverse quantum Fisher information with Tikhonov regularization. QPPG bridges classical and quantum geometry, achieving stable learning even under noise. Evaluated on classical and quantum environments, including noisy single-qubit Gym tasks and Rayleigh-fading channels, QPPG converges 4 times faster than REINFORCE and sustains a 1 dB gain under uncertainty. It reaches a 90 percent return in one hundred episodes with high noise robustness, showcasing the advantages of full QFI-based preconditioning for scalable quantum reinforcement learning.

View on arXiv
@article{giwa2025_2506.15753,
  title={ Quantum Fisher-Preconditioned Reinforcement Learning: From Single-Qubit Control to Rayleigh-Fading Link Adaptation },
  author={ Oluwaseyi Giwa and Muhammad Ahmed Mohsin and Muhammad Ali Jamshed },
  journal={arXiv preprint arXiv:2506.15753},
  year={ 2025 }
}
Comments on this paper