ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.09359
  4. Cited By
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets

AWAC: Accelerating Online Reinforcement Learning with Offline Datasets

16 June 2020
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
    OffRL
    OnRL
ArXivPDFHTML

Papers citing "AWAC: Accelerating Online Reinforcement Learning with Offline Datasets"

50 / 423 papers shown
Title
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive
  Control
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control
Rohan Chitnis
Yingchen Xu
B. Hashemi
Lucas Lehnert
Ürün Dogan
Zheqing Zhu
Olivier Delalleau
OffRL
26
9
0
01 Jun 2023
Efficient Diffusion Policies for Offline Reinforcement Learning
Efficient Diffusion Policies for Offline Reinforcement Learning
Bingyi Kang
Xiao Ma
Chao Du
Tianyu Pang
Shuicheng Yan
OffRL
34
62
0
31 May 2023
Offline Meta Reinforcement Learning with In-Distribution Online
  Adaptation
Offline Meta Reinforcement Learning with In-Distribution Online Adaptation
Jianhao Wang
Jin Zhang
Haozhe Jiang
Junyu Zhang
Liwei Wang
Chongjie Zhang
OffRL
26
9
0
31 May 2023
What is Essential for Unseen Goal Generalization of Offline
  Goal-conditioned RL?
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?
Rui Yang
Yong Lin
Xiaoteng Ma
Haotian Hu
Chongjie Zhang
Tong Zhang
OffRL
29
22
0
30 May 2023
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement
  Learning
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning
Jianxiong Li
Xiao Hu
Haoran Xu
Jingjing Liu
Xianyuan Zhan
Ya-Qin Zhang
OffRL
OnRL
36
19
0
25 May 2023
Matrix Estimation for Offline Reinforcement Learning with Low-Rank
  Structure
Matrix Estimation for Offline Reinforcement Learning with Low-Rank Structure
Xumei Xi
Chao Yu
Yudong Chen
OffRL
24
0
0
24 May 2023
Inverse Preference Learning: Preference-based RL without a Reward
  Function
Inverse Preference Learning: Preference-based RL without a Reward Function
Joey Hejna
Dorsa Sadigh
OffRL
29
48
0
24 May 2023
OER: Offline Experience Replay for Continual Offline Reinforcement
  Learning
OER: Offline Experience Replay for Continual Offline Reinforcement Learning
Sibo Gai
Donglin Wang
Li He
CLL
OffRL
45
3
0
23 May 2023
Training Diffusion Models with Reinforcement Learning
Training Diffusion Models with Reinforcement Learning
Kevin Black
Michael Janner
Yilun Du
Ilya Kostrikov
Sergey Levine
EGVM
44
316
0
22 May 2023
Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid
  Reinforcement Learning
Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning
Gen Li
Wenhao Zhan
Jason D. Lee
Yuejie Chi
Yuxin Chen
OffRL
OnRL
73
12
0
17 May 2023
Knowledge Transfer from Teachers to Learners in Growing-Batch
  Reinforcement Learning
Knowledge Transfer from Teachers to Learners in Growing-Batch Reinforcement Learning
P. Emedom-Nnamdi
A. Friesen
Bobak Shahriari
Nando de Freitas
Matthew W. Hoffman
OffRL
23
0
0
05 May 2023
Distance Weighted Supervised Learning for Offline Interaction Data
Distance Weighted Supervised Learning for Offline Interaction Data
Joey Hejna
Jensen Gao
Dorsa Sadigh
OffRL
36
12
0
26 Apr 2023
What can online reinforcement learning with function approximation
  benefit from general coverage conditions?
What can online reinforcement learning with function approximation benefit from general coverage conditions?
Fanghui Liu
Luca Viano
V. Cevher
OffRL
32
2
0
25 Apr 2023
Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling
  in Offline Reinforcement Learning
Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning
Cheng Lu
Huayu Chen
Jianfei Chen
Hang Su
Chongxuan Li
Jun Zhu
DiffM
OffRL
25
58
0
25 Apr 2023
IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion
  Policies
IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Philippe Hansen-Estruch
Ilya Kostrikov
Michael Janner
J. Kuba
Sergey Levine
OffRL
25
130
0
20 Apr 2023
FastRLAP: A System for Learning High-Speed Driving via Deep RL and
  Autonomous Practicing
FastRLAP: A System for Learning High-Speed Driving via Deep RL and Autonomous Practicing
Kyle Stachowicz
Dhruv Shah
Arjun Bhorkar
Ilya Kostrikov
Sergey Levine
OffRL
38
26
0
19 Apr 2023
Using Offline Data to Speed-up Reinforcement Learning in Procedurally
  Generated Environments
Using Offline Data to Speed-up Reinforcement Learning in Procedurally Generated Environments
Alain Andres
Lukas Schafer
Esther Villar-Rodriguez
Stefano V. Albrecht
Javier Del Ser
OffRL
OnRL
31
2
0
18 Apr 2023
Learning Robot Manipulation from Cross-Morphology Demonstration
Learning Robot Manipulation from Cross-Morphology Demonstration
G. Salhotra
Isabella Liu
Gaurav Sukhatme
LM&Ro
25
9
0
07 Apr 2023
Enabling A Network AI Gym for Autonomous Cyber Agents
Enabling A Network AI Gym for Autonomous Cyber Agents
Li Li
Jean-Pierre S. El Rami
Adrian Taylor
James Hailing Rao
T. Kunz
OffRL
AI4CE
4
3
0
03 Apr 2023
PyFlyt -- UAV Simulation Environments for Reinforcement Learning
  Research
PyFlyt -- UAV Simulation Environments for Reinforcement Learning Research
Jun Jet Tai
J. Wong
M. Innocente
N. Horri
J. Brusey
S. K. Phang
16
10
0
03 Apr 2023
Unified Emulation-Simulation Training Environment for Autonomous Cyber
  Agents
Unified Emulation-Simulation Training Environment for Autonomous Cyber Agents
Li Li
Jean-Pierre S. El Rami
Adrian Taylor
James Hailing Rao
T. Kunz
13
4
0
03 Apr 2023
Chain-of-Thought Predictive Control
Chain-of-Thought Predictive Control
Zhiwei Jia
Vineet Thumuluri
Fangchen Liu
Ling-Hao Chen
Zhiao Huang
H. Su
LM&Ro
39
20
0
03 Apr 2023
Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs
  and Practical Solutions
Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions
Yicheng Luo
Jackie Kay
Edward Grefenstette
M. Deisenroth
OffRL
OnRL
13
15
0
30 Mar 2023
Learning Excavation of Rigid Objects with Offline Reinforcement Learning
Learning Excavation of Rigid Objects with Offline Reinforcement Learning
Shiyu Jin
Zhixian Ye
Liangjun Zhang
OffRL
24
7
0
29 Mar 2023
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value
  Regularization
Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization
Haoran Xu
Li Jiang
Jianxiong Li
Zhuoran Yang
Zhaoran Wang
Victor Chan
Xianyuan Zhan
OffRL
36
71
0
28 Mar 2023
Boosting Reinforcement Learning and Planning with Demonstrations: A
  Survey
Boosting Reinforcement Learning and Planning with Demonstrations: A Survey
Tongzhou Mu
H. Su
OffRL
35
1
0
23 Mar 2023
Bridging Imitation and Online Reinforcement Learning: An Optimistic Tale
Bridging Imitation and Online Reinforcement Learning: An Optimistic Tale
Botao Hao
Rahul Jain
Dengwang Tang
Zheng Wen
OffRL
29
3
0
20 Mar 2023
A Survey of Demonstration Learning
A Survey of Demonstration Learning
André Rosa de Sousa Porfírio Correia
Luís A. Alexandre
OffRL
36
17
0
20 Mar 2023
Towards Real-World Applications of Personalized Anesthesia Using Policy
  Constraint Q Learning for Propofol Infusion Control
Towards Real-World Applications of Personalized Anesthesia Using Policy Constraint Q Learning for Propofol Infusion Control
Xiuding Cai
Jiao Chen
Yaoyao Zhu
Beiming Wang
Yu Yao
OffRL
36
5
0
17 Mar 2023
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning
Han Zheng
Xufang Luo
Pengfei Wei
Xuan Song
Dongsheng Li
Jing Jiang
OffRL
OnRL
18
21
0
14 Mar 2023
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online
  Fine-Tuning
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
Mitsuhiko Nakamoto
Yuexiang Zhai
Anika Singh
Max Sobol Mark
Yi Ma
Chelsea Finn
Aviral Kumar
Sergey Levine
OffRL
OnRL
112
108
0
09 Mar 2023
Learning Exploration Strategies to Solve Real-World Marble Runs
Learning Exploration Strategies to Solve Real-World Marble Runs
Alisa Allaire
C. Atkeson
34
0
0
08 Mar 2023
Foundation Models for Decision Making: Problems, Methods, and
  Opportunities
Foundation Models for Decision Making: Problems, Methods, and Opportunities
Sherry Yang
Ofir Nachum
Yilun Du
Jason W. Wei
Pieter Abbeel
Dale Schuurmans
LM&Ro
OffRL
LRM
AI4CE
92
155
0
07 Mar 2023
Environment Transformer and Policy Optimization for Model-Based Offline
  Reinforcement Learning
Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning
Pengqin Wang
Meixin Zhu
Shaojie Shen
OffRL
30
1
0
07 Mar 2023
Teach a Robot to FISH: Versatile Imitation from One Minute of
  Demonstrations
Teach a Robot to FISH: Versatile Imitation from One Minute of Demonstrations
Siddhant Haldar
Jyothish Pari
A. Rai
Lerrel Pinto
24
65
0
02 Mar 2023
The In-Sample Softmax for Offline Reinforcement Learning
The In-Sample Softmax for Offline Reinforcement Learning
Chenjun Xiao
Han Wang
Yangchen Pan
Adam White
Martha White
OffRL
29
26
0
28 Feb 2023
Demonstration-Guided Reinforcement Learning with Efficient Exploration
  for Task Automation of Surgical Robot
Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot
Tao Huang
Kai-xiang Chen
Bin Li
Yunhui Liu
Qingxu Dou
35
23
0
20 Feb 2023
Swapped goal-conditioned offline reinforcement learning
Swapped goal-conditioned offline reinforcement learning
Wenyan Yang
Huiling Wang
Dingding Cai
Joni Pajarinen
Joni-Kristen Kämäräinen
OffRL
OnRL
33
1
0
17 Feb 2023
Pretraining Language Models with Human Preferences
Pretraining Language Models with Human Preferences
Tomasz Korbak
Kejian Shi
Angelica Chen
Rasika Bhalerao
C. L. Buckley
Jason Phang
Sam Bowman
Ethan Perez
ALM
SyDa
36
207
0
16 Feb 2023
Dual RL: Unification and New Methods for Reinforcement and Imitation
  Learning
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning
Harshit S. Sikchi
Qinqing Zheng
Amy Zhang
S. Niekum
OffRL
33
19
0
16 Feb 2023
Constrained Decision Transformer for Offline Safe Reinforcement Learning
Constrained Decision Transformer for Offline Safe Reinforcement Learning
Zuxin Liu
Zijian Guo
Yi-Fan Yao
Zhepeng Cen
Wenhao Yu
Tingnan Zhang
Ding Zhao
OffRL
31
46
0
14 Feb 2023
Conservative State Value Estimation for Offline Reinforcement Learning
Conservative State Value Estimation for Offline Reinforcement Learning
Liting Chen
Jie Yan
Zhengdao Shao
Lu Wang
Qingwei Lin
Saravan Rajmohan
Thomas Moscibroda
Dongmei Zhang
OffRL
20
5
0
14 Feb 2023
ALAN: Autonomously Exploring Robotic Agents in the Real World
ALAN: Autonomously Exploring Robotic Agents in the Real World
Russell Mendonca
Shikhar Bahl
Deepak Pathak
LM&Ro
36
20
0
13 Feb 2023
Efficient Online Reinforcement Learning with Offline Data
Efficient Online Reinforcement Learning with Offline Data
Philip J. Ball
Laura M. Smith
Ilya Kostrikov
Sergey Levine
OffRL
OnRL
32
163
0
06 Feb 2023
Two-Stage Constrained Actor-Critic for Short Video Recommendation
Two-Stage Constrained Actor-Critic for Short Video Recommendation
Qingpeng Cai
Zhenghai Xue
Chi Zhang
Wanqi Xue
Shuchang Liu
...
Tianyou Zuo
Wentao Xie
Dong Zheng
Peng Jiang
Kun Gai
OffRL
CML
19
44
0
03 Feb 2023
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning
Haichao Zhang
Weiwen Xu
Haonan Yu
CLL
OffRL
OnRL
40
62
0
02 Feb 2023
Anti-Exploration by Random Network Distillation
Anti-Exploration by Random Network Distillation
Alexander Nikulin
Vladislav Kurenkov
Denis Tarasov
Sergey Kolesnikov
38
24
0
31 Jan 2023
Identifying Expert Behavior in Offline Training Datasets Improves
  Behavioral Cloning of Robotic Manipulation Policies
Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation Policies
Qiang-qiang Wang
Robert McCarthy
David Córdova Bulens
Francisco Roldan Sanchez
Kevin McGuinness
Noel E. O'Connor
S. Redmond
OffRL
25
3
0
30 Jan 2023
Guiding Online Reinforcement Learning with Action-Free Offline
  Pretraining
Guiding Online Reinforcement Learning with Action-Free Offline Pretraining
Deyao Zhu
Yuhui Wang
Jürgen Schmidhuber
Mohamed Elhoseiny
OffRL
OnRL
41
8
0
30 Jan 2023
Importance Weighted Actor-Critic for Optimal Conservative Offline
  Reinforcement Learning
Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning
Hanlin Zhu
Paria Rashidinejad
Jiantao Jiao
OffRL
38
15
0
30 Jan 2023
Previous
123456789
Next