
High-Dimensional Continuous Control Using Generalized Advantage Estimation
Papers citing "High-Dimensional Continuous Control Using Generalized Advantage Estimation"
50 / 77 papers shown
Title |
---|
![]() What makes math problems hard for reinforcement learning: a case study Ali Shehper A. Medina-Mardones Lucas Fagan Angus Gruen Piotr Kucharski Sergei Gukov Piotr Kucharski Zhenghan Wang Sergei Gukov |
![]() Advantage Alignment Algorithms Juan Agustin Duque Milad Aghajohari Tim Cooijmans Tianyu Zhang Rameswar Panda Gauthier Gidel Aaron Courville |