Efficiency Separation between RL Methods: Model-Free, Model-Based and
Goal-Conditioned

Efficiency Separation between RL Methods: Model-Free, Model-Based and Goal-Conditioned

28 September 2023

Raphaël Jungers

Jean-Charles Delvenne

Papers citing "Efficiency Separation between RL Methods: Model-Free, Model-Based and Goal-Conditioned"

10 / 10 papers shown

Title
Dichotomy of Control: Separating What You Can Control from What You Cannot Mengjiao Yang Dale Schuurmans Pieter Abbeel Ofir Nachum OffRL 50 43 0 24 Oct 2022
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning Adam R. Villaflor Zheng Huang Swapnil Pande John M. Dolan J. Schneider OffRL 60 25 0 21 Jul 2022
Imitating Past Successes can be Very Suboptimal Benjamin Eysenbach Soumith Udatha Sergey Levine Ruslan Salakhutdinov OffRL 52 19 0 07 Jun 2022
Reward-Conditioned Policies Aviral Kumar Xue Bin Peng Sergey Levine 57 96 0 31 Dec 2019
Reinforcement Learning Upside Down: Don't Predict Rewards -- Just Map Them to Actions J. Schmidhuber 47 131 0 05 Dec 2019
Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning Vladimir Feinberg Alvin Wan Ion Stoica Michael I. Jordan Joseph E. Gonzalez Sergey Levine OffRL 56 317 0 28 Feb 2018
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm David Silver Thomas Hubert Julian Schrittwieser Ioannis Antonoglou Matthew Lai ... D. Kumaran T. Graepel Timothy Lillicrap Karen Simonyan Demis Hassabis 119 1,768 0 05 Dec 2017
Proximal Policy Optimization Algorithms John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov OffRL 444 18,931 0 20 Jul 2017
Hindsight Experience Replay Marcin Andrychowicz Dwight Crow Alex Ray Jonas Schneider Rachel Fong Peter Welinder Bob McGrew Joshua Tobin Pieter Abbeel Wojciech Zaremba OffRL 239 2,322 0 05 Jul 2017
Thinking Fast and Slow with Deep Learning and Tree Search Thomas W. Anthony Zheng Tian David Barber 87 395 0 23 May 2017