ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.01448
  4. Cited By
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with
  Expert Guidance

Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance

4 September 2023
Qisen Yang
Shenzhi Wang
Qihang Zhang
Gao Huang
Shiji Song
    OffRL
    OnRL
ArXivPDFHTML

Papers citing "Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance"

26 / 26 papers shown
Title
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
283
899
0
12 Oct 2021
A Minimalist Approach to Offline Reinforcement Learning
A Minimalist Approach to Offline Reinforcement Learning
Scott Fujimoto
S. Gu
OffRL
114
816
0
12 Jun 2021
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
OffRL
OnRL
88
607
0
16 Jun 2020
Conservative Q-Learning for Offline Reinforcement Learning
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
131
1,806
0
08 Jun 2020
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GP
OffRL
210
1,359
0
15 Apr 2020
Rewriting History with Inverse RL: Hindsight Inference for Policy
  Improvement
Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement
Benjamin Eysenbach
Xinyang Geng
Sergey Levine
Ruslan Salakhutdinov
OffRL
37
87
0
25 Feb 2020
Reward-Conditioned Policies
Reward-Conditioned Policies
Aviral Kumar
Xue Bin Peng
Sergey Levine
57
96
0
31 Dec 2019
Training Agents using Upside-Down Reinforcement Learning
Training Agents using Upside-Down Reinforcement Learning
R. Srivastava
Pranav Shyam
Filipe Wall Mutz
Wojciech Ja'skowski
Jürgen Schmidhuber
OffRL
59
126
0
05 Dec 2019
AlgaeDICE: Policy Gradient from Arbitrary Experience
AlgaeDICE: Policy Gradient from Arbitrary Experience
Ofir Nachum
Bo Dai
Ilya Kostrikov
Yinlam Chow
Lihong Li
Dale Schuurmans
OffRL
145
241
0
04 Dec 2019
Behavior Regularized Offline Reinforcement Learning
Behavior Regularized Offline Reinforcement Learning
Yifan Wu
George Tucker
Ofir Nachum
OffRL
85
684
0
26 Nov 2019
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement
  Learning
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Xinyue Chen
Zijian Zhou
Ziyi Wang
Che Wang
Yanqiu Wu
George Andriopoulos
OffRL
69
122
0
27 Oct 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRL
OnRL
109
1,054
0
03 Jun 2019
Supervised Reinforcement Learning with Recurrent Neural Network for
  Dynamic Treatment Recommendation
Supervised Reinforcement Learning with Recurrent Neural Network for Dynamic Treatment Recommendation
Lu Wang
Wei Zhang
Xiaofeng He
H. Zha
49
263
0
04 Jul 2018
Learning to Drive in a Day
Learning to Drive in a Day
Alex Kendall
Jeffrey Hawke
David Janz
Przemyslaw Mazur
Daniele Reda
John M. Allen
Vinh-Dieu Lam
Alex Bewley
Amar Shah
95
656
0
01 Jul 2018
Learning Synergies between Pushing and Grasping with Self-supervised
  Deep Reinforcement Learning
Learning Synergies between Pushing and Grasping with Self-supervised Deep Reinforcement Learning
Andy Zeng
Shuran Song
Stefan Welker
Johnny Lee
Alberto Rodriguez
Thomas Funkhouser
SSL
73
568
0
27 Mar 2018
Learning to Reweight Examples for Robust Deep Learning
Learning to Reweight Examples for Robust Deep Learning
Mengye Ren
Wenyuan Zeng
Binh Yang
R. Urtasun
OOD
NoLa
139
1,424
0
24 Mar 2018
Deep Reinforcement Learning for Sepsis Treatment
Deep Reinforcement Learning for Sepsis Treatment
Aniruddh Raghu
Matthieu Komorowski
Imran Ahmed
Leo Anthony Celi
Peter Szolovits
Marzyeh Ghassemi
OffRL
57
172
0
27 Nov 2017
Learning to Compare: Relation Network for Few-Shot Learning
Learning to Compare: Relation Network for Few-Shot Learning
Flood Sung
Yongxin Yang
Li Zhang
Tao Xiang
Philip Torr
Timothy M. Hospedales
269
4,042
0
16 Nov 2017
End-to-end Driving via Conditional Imitation Learning
End-to-end Driving via Conditional Imitation Learning
Felipe Codevilla
Matthias Muller
Antonio M. López
V. Koltun
Alexey Dosovitskiy
123
1,066
0
06 Oct 2017
Overcoming Exploration in Reinforcement Learning with Demonstrations
Overcoming Exploration in Reinforcement Learning with Demonstrations
Ashvin Nair
Bob McGrew
Marcin Andrychowicz
Wojciech Zaremba
Pieter Abbeel
OffRL
88
783
0
28 Sep 2017
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning
  and Demonstrations
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
Aravind Rajeswaran
Vikash Kumar
Abhishek Gupta
Giulia Vezzani
John Schulman
E. Todorov
Sergey Levine
133
1,093
0
28 Sep 2017
Imitation from Observation: Learning to Imitate Behaviors from Raw Video
  via Context Translation
Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation
YuXuan Liu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
98
380
0
11 Jul 2017
Active Bias: Training More Accurate Neural Networks by Emphasizing High
  Variance Samples
Active Bias: Training More Accurate Neural Networks by Emphasizing High Variance Samples
Haw-Shiuan Chang
Erik Learned-Miller
Andrew McCallum
73
352
0
24 Apr 2017
Deep Reinforcement Learning framework for Autonomous Driving
Deep Reinforcement Learning framework for Autonomous Driving
Ahmad El-Sallab
Mohammed Abdou
E. Perot
S. Yogamani
81
969
0
08 Apr 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
806
11,866
0
09 Mar 2017
OpenAI Gym
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
204
5,073
0
05 Jun 2016
1