45
1

Reinforcement Learning-based Sequential Route Recommendation for System-Optimal Traffic Assignment

Main:6 Pages
7 Figures
Bibliography:1 Pages
5 Tables
Abstract

Modern navigation systems and shared mobility platforms increasingly rely on personalized route recommendations to improve individual travel experience and operational efficiency. However, a key question remains: can such sequential, personalized routing decisions collectively lead to system-optimal (SO) traffic assignment? This paper addresses this question by proposing a learning-based framework that reformulates the static SO traffic assignment problem as a single-agent deep reinforcement learning (RL) task. A central agent sequentially recommends routes to travelers as origin-destination (OD) demands arrive, to minimize total system travel time. To enhance learning efficiency and solution quality, we develop an MSA-guided deep Q-learning algorithm that integrates the iterative structure of traditional traffic assignment methods into the RL training process. The proposed approach is evaluated on both the Braess and Ortuzar-Willumsen (OW) networks. Results show that the RL agent converges to the theoretical SO solution in the Braess network and achieves only a 0.35% deviation in the OW network. Further ablation studies demonstrate that the route action set's design significantly impacts convergence speed and final performance, with SO-informed route sets leading to faster learning and better outcomes. This work provides a theoretically grounded and practically relevant approach to bridging individual routing behavior with system-level efficiency through learning-based sequential assignment.

View on arXiv
@article{wang2025_2505.20889,
  title={ Reinforcement Learning-based Sequential Route Recommendation for System-Optimal Traffic Assignment },
  author={ Leizhen Wang and Peibo Duan and Cheng Lyu and Zhenliang Ma },
  journal={arXiv preprint arXiv:2505.20889},
  year={ 2025 }
}
Comments on this paper