ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.09359
  4. Cited By
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets

AWAC: Accelerating Online Reinforcement Learning with Offline Datasets

16 June 2020
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
    OffRL
    OnRL
ArXivPDFHTML

Papers citing "AWAC: Accelerating Online Reinforcement Learning with Offline Datasets"

50 / 423 papers shown
Title
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with
  Stationary Distribution Shift Regularization
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization
The Viet Bui
Thanh Hong Nguyen
Tien Mai
OffRL
30
0
0
02 Oct 2024
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Jie Cheng
Ruixi Qiao
Gang Xiong
Binhua Li
Yingwei Ma
Binhua Li
Yongbin Li
Yisheng Lv
OffRL
OnRL
LM&Ro
50
3
0
01 Oct 2024
FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale
  Reinforcement Learning Fine-Tuning
FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning
Jiaheng Hu
Rose Hendrix
Ali Farhadi
Aniruddha Kembhavi
Roberto Martín-Martín
Peter Stone
Kuo-Hao Zeng
Kiana Ehsani
40
7
0
25 Sep 2024
Alignment of Diffusion Models: Fundamentals, Challenges, and Future
Alignment of Diffusion Models: Fundamentals, Challenges, and Future
Buhua Liu
Shitong Shao
Bao Li
Lichen Bai
Zhiqiang Xu
Haoyi Xiong
James Kwok
Sumi Helal
Zeke Xie
45
12
0
11 Sep 2024
Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive
  Attention
Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention
Wenhao Zhao
Qiushui Xu
Linjie Xu
Lei Song
Jinyu Wang
Chunlai Zhou
Jiang Bian
34
0
0
11 Sep 2024
Improving Deep Reinforcement Learning by Reducing the Chain Effect of
  Value and Policy Churn
Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn
Hongyao Tang
Glen Berseth
OffRL
40
1
0
07 Sep 2024
Goal-Reaching Policy Learning from Non-Expert Observations via Effective
  Subgoal Guidance
Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal Guidance
Renming Huang
Shaochong Liu
Yunqiang Pei
Peng Wang
Guoqing Wang
Yang Yang
Hengtao Shen
OffRL
37
0
0
06 Sep 2024
Surgical Task Automation Using Actor-Critic Frameworks and
  Self-Supervised Imitation Learning
Surgical Task Automation Using Actor-Critic Frameworks and Self-Supervised Imitation Learning
Jingshuai Liu
Alain Andres
Yonghang Jiang
Xichun Luo
Wenmiao Shu
Sotirios A. Tsaftaris
39
0
0
04 Sep 2024
Diffusion Policy Policy Optimization
Diffusion Policy Policy Optimization
Allen Z. Ren
Justin Lidard
Lars L. Ankile
Anthony Simeonov
Pulkit Agrawal
Anirudha Majumdar
Benjamin Burchfiel
Hongkai Dai
Max Simchowitz
45
36
0
01 Sep 2024
Optimization Solution Functions as Deterministic Policies for Offline
  Reinforcement Learning
Optimization Solution Functions as Deterministic Policies for Offline Reinforcement Learning
Vanshaj Khattar
Ming Jin
OffRL
21
0
0
27 Aug 2024
Unsupervised-to-Online Reinforcement Learning
Unsupervised-to-Online Reinforcement Learning
Junsu Kim
Seohong Park
Sergey Levine
OnRL
56
3
0
27 Aug 2024
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning
Rafael Rafailov
Kyle Hatch
Anikait Singh
Laura Smith
Aviral Kumar
...
Victor Kolev
Philip J. Ball
Jiajun Wu
Chelsea Finn
Sergey Levine
OffRL
34
3
0
15 Aug 2024
Hybrid Reinforcement Learning Breaks Sample Size Barriers in Linear MDPs
Hybrid Reinforcement Learning Breaks Sample Size Barriers in Linear MDPs
Kevin Tan
Wei Fan
Yuting Wei
OffRL
77
2
0
08 Aug 2024
F1tenth Autonomous Racing With Offline Reinforcement Learning Methods
F1tenth Autonomous Racing With Offline Reinforcement Learning Methods
Prajwal Koirala
Cody Fleming
OffRL
34
1
0
08 Aug 2024
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning
Shirong Liu
Chenjia Bai
Zixian Guo
Hao Zhang
Gaurav Sharma
Yang Liu
OffRL
35
2
0
04 Aug 2024
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement
  Learning
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
Amy Zhang
OffRL
41
5
0
29 Jul 2024
From Imitation to Refinement -- Residual RL for Precise Visual Assembly
From Imitation to Refinement -- Residual RL for Precise Visual Assembly
Lars Ankile
Anthony Simeonov
Idan Shenfeld
M. Torné
Pulkit Agrawal
OffRL
36
7
0
23 Jul 2024
Offline Imitation Learning Through Graph Search and Retrieval
Offline Imitation Learning Through Graph Search and Retrieval
Zhao-Heng Yin
Pieter Abbeel
OffRL
48
3
0
22 Jul 2024
Rocket Landing Control with Random Annealing Jump Start Reinforcement
  Learning
Rocket Landing Control with Random Annealing Jump Start Reinforcement Learning
Yuxuan Jiang
Yujie Yang
Zhiqian Lan
Guojian Zhan
Shengbo Eben Li
Qi Sun
Jian Ma
Tianwen Yu
Changwu Zhang
33
1
0
21 Jul 2024
Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement
  Learning
Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning
Xu-Hui Liu
Tian-Shuo Liu
Shengyi Jiang
Ruifeng Chen
Zhilong Zhang
Xinwei Chen
Yang Yu
OffRL
OnRL
34
2
0
17 Jul 2024
Affordance-Guided Reinforcement Learning via Visual Prompting
Affordance-Guided Reinforcement Learning via Visual Prompting
Olivia Y. Lee
Annie Xie
Kuan Fang
Karl Pertsch
Chelsea Finn
OffRL
LM&Ro
74
7
0
14 Jul 2024
A Benchmark Environment for Offline Reinforcement Learning in Racing
  Games
A Benchmark Environment for Offline Reinforcement Learning in Racing Games
Girolamo Macaluso
Alessandro Sestini
Andrew D. Bagdanov
OffRL
29
0
0
12 Jul 2024
Aligning Diffusion Behaviors with Q-functions for Efficient Continuous
  Control
Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
Huayu Chen
Kaiwen Zheng
Hang Su
Jun Zhu
51
1
0
12 Jul 2024
BiGym: A Demo-Driven Mobile Bi-Manual Manipulation Benchmark
BiGym: A Demo-Driven Mobile Bi-Manual Manipulation Benchmark
Nikita Chernyadev
Nicholas Backshall
Xiao Ma
Yunfan Lu
Younggyo Seo
Stephen James
22
11
0
10 Jul 2024
Preference-Guided Reinforcement Learning for Efficient Exploration
Preference-Guided Reinforcement Learning for Efficient Exploration
Guojian Wang
Faguo Wu
Xiao Zhang
Tianyuan Chen
Xuyang Chen
Lin Zhao
40
0
0
09 Jul 2024
AI Safety in Generative AI Large Language Models: A Survey
AI Safety in Generative AI Large Language Models: A Survey
Jaymari Chua
Yun Yvonna Li
Shiyi Yang
Chen Wang
Lina Yao
LM&MA
39
12
0
06 Jul 2024
FOSP: Fine-tuning Offline Safe Policy through World Models
FOSP: Fine-tuning Offline Safe Policy through World Models
Chenyang Cao
Yucheng Xin
Silang Wu
Longxiang He
Zichen Yan
Junbo Tan
Xueqian Wang
OffRL
61
0
0
06 Jul 2024
Hindsight Preference Learning for Offline Preference-based Reinforcement
  Learning
Hindsight Preference Learning for Offline Preference-based Reinforcement Learning
Chen-Xiao Gao
Shengjun Fang
Chenjun Xiao
Yang Yu
Zongzhang Zhang
OffRL
32
0
0
05 Jul 2024
Residual-MPPI: Online Policy Customization for Continuous Control
Residual-MPPI: Online Policy Customization for Continuous Control
Pengcheng Wang
Chenran Li
Catherine Weaver
Kenta Kawamoto
Masayoshi Tomizuka
Chen Tang
Wei Zhan
OffRL
37
3
0
01 Jul 2024
Benchmarks for Reinforcement Learning with Biased Offline Data and
  Imperfect Simulators
Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators
Ori Linial
Guy Tennenholtz
Uri Shalit
OffRL
46
1
0
30 Jun 2024
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making
Vivek Myers
Chongyi Zheng
Anca Dragan
Sergey Levine
Benjamin Eysenbach
OffRL
45
7
0
24 Jun 2024
Equivariant Offline Reinforcement Learning
Equivariant Offline Reinforcement Learning
Arsh Tangri
Ondrej Biza
Dian Wang
David M. Klee
Owen Howell
Robert Platt
OffRL
39
3
0
20 Jun 2024
Offline Imitation Learning with Model-based Reverse Augmentation
Offline Imitation Learning with Model-based Reverse Augmentation
Jie-Jing Shao
Hao-Sen Shi
Lan-Zhe Guo
Yu-Feng Li
OffRL
40
5
0
18 Jun 2024
Is Value Learning Really the Main Bottleneck in Offline RL?
Is Value Learning Really the Main Bottleneck in Offline RL?
Seohong Park
Kevin Frans
Sergey Levine
Aviral Kumar
OffRL
51
8
0
13 Jun 2024
A Dual Approach to Imitation Learning from Observations with Offline
  Datasets
A Dual Approach to Imitation Learning from Observations with Offline Datasets
Harshit S. Sikchi
Caleb Chuck
Amy Zhang
S. Niekum
OffRL
39
4
0
13 Jun 2024
Residual Learning and Context Encoding for Adaptive Offline-to-Online
  Reinforcement Learning
Residual Learning and Context Encoding for Adaptive Offline-to-Online Reinforcement Learning
Mohammadreza Nakhaei
Aidan Scannell
Joni Pajarinen
OffRL
49
1
0
12 Jun 2024
Hybrid Reinforcement Learning from Offline Observation Alone
Hybrid Reinforcement Learning from Offline Observation Alone
Yuda Song
J. Andrew Bagnell
Aarti Singh
OffRL
84
2
0
11 Jun 2024
Discovering Multiple Solutions from a Single Task in Offline
  Reinforcement Learning
Discovering Multiple Solutions from a Single Task in Offline Reinforcement Learning
Takayuki Osa
Tatsuya Harada
OffRL
40
2
0
10 Jun 2024
Strategically Conservative Q-Learning
Strategically Conservative Q-Learning
Yutaka Shimizu
Joey Hong
Sergey Levine
Masayoshi Tomizuka
OffRL
OnRL
45
0
0
06 Jun 2024
ATraDiff: Accelerating Online Reinforcement Learning with Imaginary
  Trajectories
ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories
Qianlan Yang
Yu-Xiong Wang
OnRL
42
1
0
06 Jun 2024
DEER: A Delay-Resilient Framework for Reinforcement Learning with
  Variable Delays
DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays
Bo Xia
Yilun Kong
Yongzhe Chang
Bo Yuan
Zhiheng Li
Xueqian Wang
Bin Liang
OffRL
45
3
0
05 Jun 2024
"Give Me an Example Like This": Episodic Active Reinforcement Learning
  from Demonstrations
"Give Me an Example Like This": Episodic Active Reinforcement Learning from Demonstrations
Muhan Hou
Koen V. Hindriks
A. E. Eiben
Kim Baraka
OffRL
28
3
0
05 Jun 2024
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
Linjiajie Fang
Ruoxue Liu
Jing Zhang
Wenjia Wang
Bing-Yi Jing
OffRL
56
1
0
31 May 2024
Diffusion Policies creating a Trust Region for Offline Reinforcement
  Learning
Diffusion Policies creating a Trust Region for Offline Reinforcement Learning
Tianyu Chen
Zhendong Wang
Mingyuan Zhou
OffRL
32
5
0
30 May 2024
Weak-to-Strong Search: Align Large Language Models via Searching over
  Small Language Models
Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
Zhanhui Zhou
Zhixuan Liu
Jie Liu
Zhichen Dong
Chao Yang
Yu Qiao
ALM
44
20
0
29 May 2024
Multi-objective Cross-task Learning via Goal-conditioned GPT-based
  Decision Transformers for Surgical Robot Task Automation
Multi-objective Cross-task Learning via Goal-conditioned GPT-based Decision Transformers for Surgical Robot Task Automation
Jiawei Fu
Yonghao Long
Kai-xiang Chen
Wang Wei
Qi Dou
MedIm
37
4
0
29 May 2024
Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement
  Learning
Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning
Tianle Zhang
Jiayi Guan
Lin Zhao
Yihang Li
Dongjiang Li
...
Lei Sun
Yue Chen
Xuelong Wei
Lusong Li
Xiaodong He
43
1
0
29 May 2024
Efficient Preference-based Reinforcement Learning via Aligned Experience
  Estimation
Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation
Fengshuo Bai
Rui Zhao
Hongming Zhang
Sijia Cui
Ying Wen
Yaodong Yang
Bo Xu
Lei Han
OffRL
29
6
0
29 May 2024
AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained
  Optimization
AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization
Longxiang He
Li Shen
Junbo Tan
Xueqian Wang
49
1
0
28 May 2024
Leveraging Offline Data in Linear Latent Bandits
Leveraging Offline Data in Linear Latent Bandits
Chinmaya Kausik
Kevin Tan
Ambuj Tewari
OffRL
34
2
0
27 May 2024
Previous
123456789
Next