Finding the Near Optimal Policy via Adaptive Reduced Regularization in MDPs

31 October 2020

Wenhao Yang

Papers citing "Finding the Near Optimal Policy via Adaptive Reduced Regularization in MDPs"

1 / 1 papers shown

Title
Softmax Policy Gradient Methods Can Take Exponential Time to Converge Gen Li Yuting Wei Yuejie Chi Yuxin Chen 26 50 0 22 Feb 2021