Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.16291
Cited By
Efficiency Separation between RL Methods: Model-Free, Model-Based and Goal-Conditioned
28 September 2023
Han Bao
Raphaël Jungers
Jean-Charles Delvenne
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Efficiency Separation between RL Methods: Model-Free, Model-Based and Goal-Conditioned"
10 / 10 papers shown
Title
Dichotomy of Control: Separating What You Can Control from What You Cannot
Mengjiao Yang
Dale Schuurmans
Pieter Abbeel
Ofir Nachum
OffRL
50
43
0
24 Oct 2022
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning
Adam R. Villaflor
Zheng Huang
Swapnil Pande
John M. Dolan
J. Schneider
OffRL
60
25
0
21 Jul 2022
Imitating Past Successes can be Very Suboptimal
Benjamin Eysenbach
Soumith Udatha
Sergey Levine
Ruslan Salakhutdinov
OffRL
52
19
0
07 Jun 2022
Reward-Conditioned Policies
Aviral Kumar
Xue Bin Peng
Sergey Levine
57
96
0
31 Dec 2019
Reinforcement Learning Upside Down: Don't Predict Rewards -- Just Map Them to Actions
J. Schmidhuber
47
131
0
05 Dec 2019
Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning
Vladimir Feinberg
Alvin Wan
Ion Stoica
Michael I. Jordan
Joseph E. Gonzalez
Sergey Levine
OffRL
56
317
0
28 Feb 2018
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
David Silver
Thomas Hubert
Julian Schrittwieser
Ioannis Antonoglou
Matthew Lai
...
D. Kumaran
T. Graepel
Timothy Lillicrap
Karen Simonyan
Demis Hassabis
119
1,768
0
05 Dec 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
444
18,931
0
20 Jul 2017
Hindsight Experience Replay
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
239
2,322
0
05 Jul 2017
Thinking Fast and Slow with Deep Learning and Tree Search
Thomas W. Anthony
Zheng Tian
David Barber
87
395
0
23 May 2017
1