ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.09359
  4. Cited By
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets

AWAC: Accelerating Online Reinforcement Learning with Offline Datasets

16 June 2020
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
    OffRL
    OnRL
ArXivPDFHTML

Papers citing "AWAC: Accelerating Online Reinforcement Learning with Offline Datasets"

50 / 423 papers shown
Title
GTA: Generative Trajectory Augmentation with Guidance for Offline
  Reinforcement Learning
GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning
Jaewoo Lee
Sujin Yun
Taeyoung Yun
Jinkyoo Park
46
6
0
27 May 2024
Efficient Multi-agent Reinforcement Learning by Planning
Efficient Multi-agent Reinforcement Learning by Planning
Qihan Liu
Jianing Ye
Xiaoteng Ma
Jun Yang
Bin Liang
Chongjie Zhang
29
4
0
20 May 2024
Ensemble Successor Representations for Task Generalization in
  Offline-to-Online Reinforcement Learning
Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning
Changhong Wang
Xudong Yu
Chenjia Bai
Qiaosheng Zhang
Zhen Wang
40
1
0
12 May 2024
Learning Robot Soccer from Egocentric Vision with Deep Reinforcement
  Learning
Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning
Dhruva Tirumala
Markus Wulfmeier
Ben Moran
Sandy Huang
Jan Humplik
...
Kushal Patel
Marlon Gwira
Francesco Nori
Martin Riedmiller
N. Heess
38
10
0
03 May 2024
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline
  Reinforcement Learning
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning
Chenjia Bai
Lingxiao Wang
Jianye Hao
Zhuoran Yang
Bin Zhao
Zhen Wang
Xuelong Li
OffRL
37
9
0
30 Apr 2024
Benchmarking Mobile Device Control Agents across Diverse Configurations
Benchmarking Mobile Device Control Agents across Diverse Configurations
Juyong Lee
Taywon Min
Minyong An
Changyeon Kim
Kimin Lee
41
9
0
25 Apr 2024
Continual Offline Reinforcement Learning via Diffusion-based Dual
  Generative Replay
Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay
Jinmei Liu
Wenbin Li
Xiangyu Yue
Shilin Zhang
Chunlin Chen
Zhi Wang
OffRL
DiffM
36
5
0
16 Apr 2024
Sequential Decision Making with Expert Demonstrations under Unobserved
  Heterogeneity
Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity
Vahid Balazadeh Meresht
Keertana Chidambaram
Viet Nguyen
Rahul G. Krishnan
Vasilis Syrgkanis
47
0
0
10 Apr 2024
Diverse Randomized Value Functions: A Provably Pessimistic Approach for
  Offline Reinforcement Learning
Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning
Xudong Yu
Chenjia Bai
Hongyi Guo
Changhong Wang
Zhen Wang
OffRL
39
0
0
09 Apr 2024
Demonstration Guided Multi-Objective Reinforcement Learning
Demonstration Guided Multi-Objective Reinforcement Learning
Junlin Lu
Patrick Mannion
Karl Mason
27
0
0
05 Apr 2024
Simple Ingredients for Offline Reinforcement Learning
Simple Ingredients for Offline Reinforcement Learning
Edoardo Cetin
Andrea Tirinzoni
Matteo Pirotta
A. Lazaric
Yann Ollivier
Ahmed Touati
OffRL
40
2
0
19 Mar 2024
OPEN TEACH: A Versatile Teleoperation System for Robotic Manipulation
OPEN TEACH: A Versatile Teleoperation System for Robotic Manipulation
Aadhithya Iyer
Zhuoran Peng
Yinlong Dai
Irmak Güzey
Siddhant Haldar
Soumith Chintala
Lerrel Pinto
39
48
0
12 Mar 2024
A2PO: Towards Effective Offline Reinforcement Learning from an
  Advantage-aware Perspective
A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective
Yunpeng Qing
Shunyu Liu
Jingyuan Cong
Kaixuan Chen
Yihe Zhou
Mingli Song
OffRL
37
1
0
12 Mar 2024
In-context Exploration-Exploitation for Reinforcement Learning
In-context Exploration-Exploitation for Reinforcement Learning
Zhenwen Dai
Federico Tomasi
Sina Ghiassian
OffRL
OnRL
38
3
0
11 Mar 2024
Reconciling Reality through Simulation: A Real-to-Sim-to-Real Approach
  for Robust Manipulation
Reconciling Reality through Simulation: A Real-to-Sim-to-Real Approach for Robust Manipulation
M. Torné
Anthony Simeonov
Zechu Li
April Chan
Tao Chen
Abhishek Gupta
Pulkit Agrawal
47
57
0
06 Mar 2024
Snapshot Reinforcement Learning: Leveraging Prior Trajectories for
  Efficiency
Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency
Yanxiao Zhao
Yangge Qian
Tianyi Wang
Jingyang Shan
Xiaolin Qin
21
0
0
01 Mar 2024
A Model-Based Approach for Improving Reinforcement Learning Efficiency
  Leveraging Expert Observations
A Model-Based Approach for Improving Reinforcement Learning Efficiency Leveraging Expert Observations
E. C. Ozcan
Vittorio Giammarino
James Queeney
I. Paschalidis
OffRL
39
0
0
29 Feb 2024
Enhancing Reinforcement Learning Agents with Local Guides
Enhancing Reinforcement Learning Agents with Local Guides
Paul Daoudi
Bogdan Robu
Christophe Prieur
Ludovic Dos Santos
M. Barlier
OnRL
31
3
0
21 Feb 2024
Learning Goal-Conditioned Policies from Sub-Optimal Offline Data via
  Metric Learning
Learning Goal-Conditioned Policies from Sub-Optimal Offline Data via Metric Learning
Alfredo Reichlin
Miguel Vasco
Hang Yin
Danica Kragic
OffRL
24
0
0
16 Feb 2024
SPO: Sequential Monte Carlo Policy Optimisation
SPO: Sequential Monte Carlo Policy Optimisation
Matthew Macfarlane
Edan Toledo
Donal Byrne
Paul Duckworth
Alexandre Laterre
30
1
0
12 Feb 2024
Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy
  Optimization
Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization
Talha Bozkus
Urbashi Mitra
OffRL
24
5
0
08 Feb 2024
The Virtues of Pessimism in Inverse Reinforcement Learning
David Wu
Gokul Swamy
J. Andrew Bagnell
Zhiwei Steven Wu
Sanjiban Choudhury
33
0
0
04 Feb 2024
ODICE: Revealing the Mystery of Distribution Correction Estimation via
  Orthogonal-gradient Update
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
34
10
0
01 Feb 2024
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
Jianlan Luo
Zheyuan Hu
Charles Xu
You Liang Tan
Jacob Berg
Archit Sharma
S. Schaal
Chelsea Finn
Abhishek Gupta
Sergey Levine
OffRL
OnRL
34
41
0
29 Jan 2024
RESPRECT: Speeding-up Multi-fingered Grasping with Residual
  Reinforcement Learning
RESPRECT: Speeding-up Multi-fingered Grasping with Residual Reinforcement Learning
Federico Ceola
Lorenzo Rosasco
Lorenzo Natale
37
5
0
26 Jan 2024
Learning from Sparse Offline Datasets via Conservative Density
  Estimation
Learning from Sparse Offline Datasets via Conservative Density Estimation
Zhepeng Cen
Zuxin Liu
Zitong Wang
Yi-Fan Yao
Henry Lam
Ding Zhao
OffRL
28
7
0
16 Jan 2024
Solving Continual Offline Reinforcement Learning with Decision
  Transformer
Solving Continual Offline Reinforcement Learning with Decision Transformer
Kaixin Huang
Li Shen
Chen Zhao
Chun Yuan
Dacheng Tao
CLL
OffRL
24
5
0
16 Jan 2024
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot
  Learning
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
OffRL
OnRL
22
9
0
06 Jan 2024
Diffusion Reward: Learning Rewards via Conditional Video Diffusion
Diffusion Reward: Learning Rewards via Conditional Video Diffusion
Tao Huang
Guangqi Jiang
Yanjie Ze
Huazhe Xu
VGen
39
22
0
21 Dec 2023
Small Dataset, Big Gains: Enhancing Reinforcement Learning by Offline
  Pre-Training with Model Based Augmentation
Small Dataset, Big Gains: Enhancing Reinforcement Learning by Offline Pre-Training with Model Based Augmentation
Girolamo Macaluso
Alessandro Sestini
Andrew D. Bagdanov
OffRL
OnRL
27
3
0
15 Dec 2023
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement
  Learning
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
Yinmin Zhang
Jie Liu
Chuming Li
Yazhe Niu
Yaodong Yang
Yu Liu
Wanli Ouyang
OffRL
OnRL
46
11
0
12 Dec 2023
The Generalization Gap in Offline Reinforcement Learning
The Generalization Gap in Offline Reinforcement Learning
Ishita Mediratta
Qingfei You
Minqi Jiang
Roberta Raileanu
OffRL
84
10
0
10 Dec 2023
Guaranteed Trust Region Optimization via Two-Phase KL Penalization
Guaranteed Trust Region Optimization via Two-Phase KL Penalization
K.R. Zentner
Ujjwal Puri
Zhehui Huang
Gaurav Sukhatme
OffRL
21
0
0
08 Dec 2023
Dexterous Functional Grasping
Dexterous Functional Grasping
Ananye Agarwal
Shagun Uppal
Kenneth Shaw
Deepak Pathak
36
34
0
05 Dec 2023
Replay across Experiments: A Natural Extension of Off-Policy RL
Replay across Experiments: A Natural Extension of Off-Policy RL
Dhruva Tirumala
Thomas Lampe
José Enrique Chen
Tuomas Haarnoja
Sandy Huang
...
Tim Hertweck
Leonard Hasenclever
Martin Riedmiller
N. Heess
Markus Wulfmeier
OffRL
37
8
0
27 Nov 2023
Supported Trust Region Optimization for Offline Reinforcement Learning
Supported Trust Region Optimization for Offline Reinforcement Learning
Yongyi Mao
Hongchang Zhang
Cheng Chen
Yi Tian Xu
Xiangyang Ji
OffRL
34
14
0
15 Nov 2023
Accelerating Exploration with Unlabeled Prior Data
Accelerating Exploration with Unlabeled Prior Data
Qiyang Li
Jason Zhang
Dibya Ghosh
Amy Zhang
Sergey Levine
OffRL
OnRL
31
9
0
09 Nov 2023
Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with
  Multi-Step On-Policy Optimization
Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with Multi-Step On-Policy Optimization
Kun Lei
Zhengmao He
Chenhao Lu
Kaizhe Hu
Yang Gao
Huazhe Xu
OffRL
OnRL
58
13
0
06 Nov 2023
Imitation Bootstrapped Reinforcement Learning
Imitation Bootstrapped Reinforcement Learning
Hengyuan Hu
Suvir Mirchandani
Dorsa Sadigh
41
24
0
03 Nov 2023
Learning Realistic Traffic Agents in Closed-loop
Learning Realistic Traffic Agents in Closed-loop
Chris Zhang
James Tu
Lunjun Zhang
Kelvin Wong
Simon Suo
R. Urtasun
31
18
0
02 Nov 2023
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with
  Learned Models
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models
Mianchu Wang
Rui Yang
Xi Chen
Hao Sun
Meng Fang
Giovanni Montana
OffRL
36
9
0
30 Oct 2023
Unsupervised Behavior Extraction via Random Intent Priors
Unsupervised Behavior Extraction via Random Intent Priors
Haotian Hu
Yiqin Yang
Jianing Ye
Ziqing Mai
Chongjie Zhang
OffRL
32
6
0
28 Oct 2023
Guided Data Augmentation for Offline Reinforcement Learning and
  Imitation Learning
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning
Nicholas Corrado
Yu-Tao Qu
John U. Balis
Adam Labiosa
Josiah P. Hanna
OffRL
35
2
0
27 Oct 2023
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online
  Reinforcement Learning
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning
Shenzhi Wang
Qisen Yang
Jiawei Gao
Matthieu Lin
Hao Chen
Liwei Wu
Ning Jia
Shiji Song
Gao Huang
OffRL
32
13
0
27 Oct 2023
Finetuning Offline World Models in the Real World
Finetuning Offline World Models in the Real World
Yunhai Feng
Nicklas Hansen
Ziyan Xiong
Chandramouli Rajagopalan
Xiaolong Wang
OffRL
OnRL
22
20
0
24 Oct 2023
Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for
  Autonomous Real-World Reinforcement Learning
Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for Autonomous Real-World Reinforcement Learning
Jingyun Yang
Max Sobol Mark
Brandon Vu
Archit Sharma
Jeannette Bohg
Chelsea Finn
OffRL
OnRL
32
21
0
23 Oct 2023
Towards Robust Offline Reinforcement Learning under Diverse Data
  Corruption
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
Rui Yang
Han Zhong
Jiawei Xu
Amy Zhang
Chong Zhang
Lei Han
Tong Zhang
OffRL
OnRL
41
15
0
19 Oct 2023
Plan-Guided Reinforcement Learning for Whole-Body Manipulation
Plan-Guided Reinforcement Learning for Whole-Body Manipulation
Mengchao Zhang
Jose Barreiros
Aykut Özgün Önol
35
4
0
18 Oct 2023
Action-Quantized Offline Reinforcement Learning for Robotic Skill
  Learning
Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning
Jianlan Luo
Perry Dong
Jeffrey Wu
Aviral Kumar
Xinyang Geng
Sergey Levine
OffRL
36
18
0
18 Oct 2023
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate
  Exploration Bias
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRL
OnRL
31
1
0
12 Oct 2023
Previous
123456789
Next