Model-free reinforcement learning with noisy actions for automated experimental control in optics

Setting up and controlling optical systems is often a challenging and tedious task. The high number of degrees of freedom to control mirrors, lenses, or phases of light makes automatic control challenging, especially when the complexity of the system cannot be adequately modeled due to noise or non-linearities. Here, we show that reinforcement learning (RL) can overcome these challenges when coupling laser light into an optical fiber, using a model-free RL approach that trains directly on the experiment without pre-training. By utilizing the sample-efficient algorithms Soft Actor-Critic (SAC) or Truncated Quantile Critics (TQC), our agent learns to couple with 90% efficiency, comparable to the human expert. We demonstrate that direct training on an experiment can replace extensive system modeling. Our result exemplifies RL's potential to tackle problems in optics, paving the way for more complex applications where full noise modeling is not feasible.
View on arXiv@article{richtmann2025_2405.15421, title={ Model-free reinforcement learning with noisy actions for automated experimental control in optics }, author={ Lea Richtmann and Viktoria-S. Schmiesing and Dennis Wilken and Jan Heine and Aaron Tranter and Avishek Anand and Tobias J. Osborne and Michèle Heurs }, journal={arXiv preprint arXiv:2405.15421}, year={ 2025 } }