Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.09359
Cited By
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
16 June 2020
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
OffRL
OnRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"AWAC: Accelerating Online Reinforcement Learning with Offline Datasets"
50 / 423 papers shown
Title
Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM
Thang Duong
Minglai Yang
Chicheng Zhang
OffRL
14
0
0
16 May 2025
What Matters for Batch Online Reinforcement Learning in Robotics?
Perry Dong
Suvir Mirchandani
Dorsa Sadigh
Chelsea Finn
OffRL
31
0
0
12 May 2025
VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making
Jake Grigsby
Yuke Zhu
Michael S Ryoo
Juan Carlos Niebles
OffRL
VLM
41
0
0
06 May 2025
A Survey of Robotic Navigation and Manipulation with Physics Simulators in the Era of Embodied AI
Lik Hang Kenny Wong
Xueyang Kang
Kaixin Bai
Jianwei Zhang
56
0
0
01 May 2025
Fine-Tuning without Performance Degradation
Han Wang
Adam White
Martha White
OnRL
163
0
0
01 May 2025
Dynamic Action Interpolation: A Universal Approach for Accelerating Reinforcement Learning with Expert Guidance
Wenjun Cao
52
0
0
26 Apr 2025
Integrating Learning-Based Manipulation and Physics-Based Locomotion for Whole-Body Badminton Robot Control
Haoran Wang
Zhiwei Shi
Chengxi Zhu
Yafei Qiao
Cheng Zhang
Fan Yang
Pengjie Ren
Lan Lu
D. Xuan
64
1
0
24 Apr 2025
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
Haoran Xu
Shuozhe Li
Harshit S. Sikchi
S. Niekum
Amy Zhang
OffRL
27
0
0
17 Apr 2025
Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing
Vinal Asodia
Zhenhua Feng
Saber Fallah
OffRL
37
0
0
11 Apr 2025
Human-Level Competitive Pokémon via Scalable Offline Reinforcement Learning with Transformers
Jake Grigsby
Yuqi Xie
Justin Sasek
Steven Zheng
Yuke Zhu
OffRL
26
0
0
06 Apr 2025
Reward Generation via Large Vision-Language Model in Offline Reinforcement Learning
Younghwan Lee
Tung M. Luu
Donghoon Lee
Chang D. Yoo
3DV
VLM
OffRL
41
0
0
03 Apr 2025
COLSON: Controllable Learning-Based Social Navigation via Diffusion-Based Reinforcement Learning
Yuki Tomita
Kohei Matsumoto
Yuki Hyodo
Ryo Kurazume
66
0
0
18 Mar 2025
Robot Policy Transfer with Online Demonstrations: An Active Reinforcement Learning Approach
Muhan Hou
Koen V. Hindriks
A. E. Eiben
Kim Baraka
OffRL
46
0
0
17 Mar 2025
Evaluation-Time Policy Switching for Offline Reinforcement Learning
Natinael Solomon Neggatu
Jeremie Houssineau
Giovanni Montana
OffRL
OnRL
72
0
0
15 Mar 2025
Active Robot Curriculum Learning from Online Human Demonstrations
Muhan Hou
Koen V. Hindriks
A. E. Eiben
Kim Baraka
67
0
0
04 Mar 2025
A2Perf: Real-World Autonomous Agents Benchmark
Ikechukwu Uchendu
Jason J. Jabbour
Korneel Van den Berghe
Joel Runevic
Matthew P. Stewart
...
S. Guadarrama
Jie Tan
Jordan K. Terry
Aleksandra Faust
Vijay Janapa Reddi
36
0
0
04 Mar 2025
Diffusion Stabilizer Policy for Automated Surgical Robot Manipulations
Chonlam Ho
Jianshu Hu
Haoran Wang
Qi Dou
Yutong Ban
MedIm
73
1
0
03 Mar 2025
Behavior Preference Regression for Offline Reinforcement Learning
Padmanaba Srinivasan
William J. Knottenbelt
OffRL
35
0
0
02 Mar 2025
SFO: Piloting VLM Feedback for Offline RL
Jacob Beck
OffRL
39
0
0
02 Mar 2025
Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation
Siddhant Haldar
Lerrel Pinto
3DPC
66
2
0
27 Feb 2025
Fewer May Be Better: Enhancing Offline Reinforcement Learning with Reduced Dataset
Yiqin Yang
Quanwei Wang
Chenghao Li
Hao Hu
Chengjie Wu
...
Dianyu Zhong
Ziyou Zhang
Qianchuan Zhao
Chongjie Zhang
Xu Bo
OffRL
47
0
0
26 Feb 2025
Efficient Reinforcement Learning by Guiding Generalist World Models with Non-Curated Data
Yi Zhao
Aidan Scannell
Wenshuai Zhao
Yuxin Hou
Tianyu Cui
Le Chen
Dieter Büchler
Arno Solin
Juho Kannala
Joni Pajarinen
OffRL
OnRL
96
1
0
26 Feb 2025
Yes, Q-learning Helps Offline In-Context RL
Denis Tarasov
Alexander Nikulin
Ilya Zisman
Albina Klepach
Andrei Polubarov
Nikita Lyubaykin
Alexander Derevyagin
Igor Kiselev
Vladislav Kurenkov
OffRL
OnRL
175
0
0
24 Feb 2025
Autonomous Vehicles Using Multi-Agent Reinforcement Learning for Routing Decisions Can Harm Urban Traffic
Anastasia Psarou
Ahmet Onur Akman
Łukasz Gorczyca
Michał Hoffmann
Zoltán György Varga
Grzegorz Jamróz
Rafał Kucharski
62
0
0
20 Feb 2025
Privacy-Preserving Dataset Combination
Keren Fuentes
Mimee Xu
Irene Chen
38
0
0
09 Feb 2025
Skill Expansion and Composition in Parameter Space
Tenglong Liu
J. Li
Yinan Zheng
Haoyi Niu
Yixing Lan
Xin Xu
Xianyuan Zhan
58
4
0
09 Feb 2025
ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy
Yuhui Chen
Shuai Tian
Shugao Liu
Yingting Zhou
Haoran Li
Dongbin Zhao
OffRL
106
1
0
08 Feb 2025
Rapidly Adapting Policies to the Real World via Simulation-Guided Fine-Tuning
Patrick Yin
Tyler Westenbroek
Simran Bagaria
Kevin Huang
Ching-an Cheng
Andrey Kobolov
Abhishek Gupta
80
2
0
04 Feb 2025
Search-Based Adversarial Estimates for Improving Sample Efficiency in Off-Policy Reinforcement Learning
Federico Malato
Ville Hautamaki
39
0
0
03 Feb 2025
Preference-Based Multi-Agent Reinforcement Learning: Data Coverage and Algorithmic Techniques
Natalia Zhang
X. Wang
Qiwen Cui
Runlong Zhou
Sham Kakade
Simon S. Du
OffRL
48
0
0
10 Jan 2025
Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy
Keru Chen
Honghao Wei
Zhigang Deng
Sen Lin
OffRL
OnRL
94
0
0
31 Dec 2024
Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation
Fei Zhao
Xueliang Zhang
36
0
0
25 Dec 2024
ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning
Kun Wu
Yinuo Zhao
Zhihao Xu
Zhengping Che
Chengxiang Yin
C. Liu
Qinru Qiu
Feiferi Feng
OffRL
100
1
0
22 Dec 2024
Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model
Xiu Yuan
Tongzhou Mu
Stone Tao
Yunhao Fang
Mengke Zhang
H. Su
OffRL
66
3
0
18 Dec 2024
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone
Max Sobol Mark
Tian Gao
Georgia Gabriela Sampaio
Mohan Kumar Srirama
Archit Sharma
Chelsea Finn
Aviral Kumar
OffRL
OnRL
95
4
0
09 Dec 2024
Supervised Learning-enhanced Multi-Group Actor Critic for Live Stream Allocation in Feed
Jingxin Liu
Xiang Gao
Yisha Li
Xin Li
Haiyang Lu
Ben Wang
OffRL
72
0
0
28 Nov 2024
Accelerating Proximal Policy Optimization Learning Using Task Prediction for Solving Environments with Delayed Rewards
A. Ahmad
Mehdi Kermanshah
Kevin J. Leahy
Zachary Serlin
H. Siu
Makai Mann
C. Vasile
Roberto Tron
C. Belta
OffRL
66
0
0
26 Nov 2024
AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers
Jake Grigsby
Justin Sasek
Samyak Parajuli
Daniel Adebi
Amy Zhang
Yuke Zhu
OffRL
26
3
0
17 Nov 2024
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
Kai Yan
A. Schwing
Yu-xiong Wang
OffRL
OnRL
36
0
0
31 Oct 2024
A Non-Monolithic Policy Approach of Offline-to-Online Reinforcement Learning
JaeYoon Kim
Junyu Xuan
Christy Jie Liang
F. Hussain
OffRL
OnRL
36
0
0
31 Oct 2024
SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving
Minh Tri Huynh
Duc Dung Nguyen
OffRL
31
0
0
30 Oct 2024
Robot Policy Learning with Temporal Optimal Transport Reward
Yuwei Fu
Haichao Zhang
Di Wu
Wei-ping Xu
Benoit Boulet
OffRL
42
1
0
29 Oct 2024
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
Jing Zhang
Linjiajie Fang
Kexin Shi
Wenjia Wang
Bing-Yi Jing
OffRL
36
0
0
27 Oct 2024
Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
Yixiu Mao
Qi Wang
Chen Chen
Yun Qu
Xiangyang Ji
OffRL
48
6
0
25 Oct 2024
Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces
Jifeng Hu
Sili Huang
Li Shen
Zhejian Yang
Shengchao Hu
Shisong Tang
H. Chen
Yi-Ju Chang
Dacheng Tao
Lichao Sun
OffRL
39
0
0
21 Oct 2024
Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance
Mitsuhiko Nakamoto
Oier Mees
Aviral Kumar
Sergey Levine
OffRL
79
13
0
17 Oct 2024
Incremental Learning for Robot Shared Autonomy
Yiran Tao
Guixiu Qiao
Dan Ding
Zackory Erickson
CLL
35
0
0
08 Oct 2024
Unpacking Failure Modes of Generative Policies: Runtime Monitoring of Consistency and Progress
Christopher Agia
Rohan Sinha
Jingyun Yang
Zi-ang Cao
Rika Antonova
Marco Pavone
Jeannette Bohg
28
7
0
06 Oct 2024
Robust Offline Imitation Learning from Diverse Auxiliary Data
Udita Ghosh
Dripta S. Raychaudhuri
Jiachen Li
Konstantinos Karydis
A. Roy-Chowdhury
OffRL
24
1
0
04 Oct 2024
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Yun Qu
Boyuan Wang
Yuhang Jiang
Jianzhun Shao
Yixiu Mao
Cheems Wang
Chang Liu
Xiangyang Ji
46
4
0
03 Oct 2024
1
2
3
4
5
6
7
8
9
Next