Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2309.01448
Cited By
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
4 September 2023
Qisen Yang
Shenzhi Wang
Qihang Zhang
Gao Huang
Shiji Song
OffRL
OnRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance"
26 / 26 papers shown
Title
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
283
899
0
12 Oct 2021
A Minimalist Approach to Offline Reinforcement Learning
Scott Fujimoto
S. Gu
OffRL
114
816
0
12 Jun 2021
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
OffRL
OnRL
88
607
0
16 Jun 2020
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
131
1,806
0
08 Jun 2020
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GP
OffRL
210
1,359
0
15 Apr 2020
Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement
Benjamin Eysenbach
Xinyang Geng
Sergey Levine
Ruslan Salakhutdinov
OffRL
37
87
0
25 Feb 2020
Reward-Conditioned Policies
Aviral Kumar
Xue Bin Peng
Sergey Levine
57
96
0
31 Dec 2019
Training Agents using Upside-Down Reinforcement Learning
R. Srivastava
Pranav Shyam
Filipe Wall Mutz
Wojciech Ja'skowski
Jürgen Schmidhuber
OffRL
59
126
0
05 Dec 2019
AlgaeDICE: Policy Gradient from Arbitrary Experience
Ofir Nachum
Bo Dai
Ilya Kostrikov
Yinlam Chow
Lihong Li
Dale Schuurmans
OffRL
145
241
0
04 Dec 2019
Behavior Regularized Offline Reinforcement Learning
Yifan Wu
George Tucker
Ofir Nachum
OffRL
85
684
0
26 Nov 2019
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Xinyue Chen
Zijian Zhou
Ziyi Wang
Che Wang
Yanqiu Wu
George Andriopoulos
OffRL
69
122
0
27 Oct 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRL
OnRL
109
1,054
0
03 Jun 2019
Supervised Reinforcement Learning with Recurrent Neural Network for Dynamic Treatment Recommendation
Lu Wang
Wei Zhang
Xiaofeng He
H. Zha
49
263
0
04 Jul 2018
Learning to Drive in a Day
Alex Kendall
Jeffrey Hawke
David Janz
Przemyslaw Mazur
Daniele Reda
John M. Allen
Vinh-Dieu Lam
Alex Bewley
Amar Shah
95
656
0
01 Jul 2018
Learning Synergies between Pushing and Grasping with Self-supervised Deep Reinforcement Learning
Andy Zeng
Shuran Song
Stefan Welker
Johnny Lee
Alberto Rodriguez
Thomas Funkhouser
SSL
73
568
0
27 Mar 2018
Learning to Reweight Examples for Robust Deep Learning
Mengye Ren
Wenyuan Zeng
Binh Yang
R. Urtasun
OOD
NoLa
139
1,424
0
24 Mar 2018
Deep Reinforcement Learning for Sepsis Treatment
Aniruddh Raghu
Matthieu Komorowski
Imran Ahmed
Leo Anthony Celi
Peter Szolovits
Marzyeh Ghassemi
OffRL
57
172
0
27 Nov 2017
Learning to Compare: Relation Network for Few-Shot Learning
Flood Sung
Yongxin Yang
Li Zhang
Tao Xiang
Philip Torr
Timothy M. Hospedales
269
4,042
0
16 Nov 2017
End-to-end Driving via Conditional Imitation Learning
Felipe Codevilla
Matthias Muller
Antonio M. López
V. Koltun
Alexey Dosovitskiy
123
1,066
0
06 Oct 2017
Overcoming Exploration in Reinforcement Learning with Demonstrations
Ashvin Nair
Bob McGrew
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
OffRL
88
783
0
28 Sep 2017
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
Aravind Rajeswaran
Vikash Kumar
Abhishek Gupta
Giulia Vezzani
John Schulman
E. Todorov
Sergey Levine
133
1,093
0
28 Sep 2017
Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation
YuXuan Liu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
98
380
0
11 Jul 2017
Active Bias: Training More Accurate Neural Networks by Emphasizing High Variance Samples
Haw-Shiuan Chang
Erik Learned-Miller
Andrew McCallum
73
352
0
24 Apr 2017
Deep Reinforcement Learning framework for Autonomous Driving
Ahmad El-Sallab
Mohammed Abdou
E. Perot
S. Yogamani
81
969
0
08 Apr 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
806
11,866
0
09 Mar 2017
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
204
5,073
0
05 Jun 2016
1