Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.06860
Cited By
A Minimalist Approach to Offline Reinforcement Learning
12 June 2021
Scott Fujimoto
S. Gu
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Minimalist Approach to Offline Reinforcement Learning"
50 / 522 papers shown
Title
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
21
7
0
02 Feb 2024
Distilling LLMs' Decomposition Abilities into Compact Language Models
Denis Tarasov
Kumar Shridhar
SyDa
OffRL
LRM
45
2
0
02 Feb 2024
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
34
10
0
01 Feb 2024
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback
Songyang Gao
Qiming Ge
Wei Shen
Shihan Dou
Junjie Ye
...
Yicheng Zou
Zhi Chen
Hang Yan
Qi Zhang
Dahua Lin
57
10
0
21 Jan 2024
MoMA: Model-based Mirror Ascent for Offline Reinforcement Learning
Mao Hong
Zhiyue Zhang
Yue Wu
Yan Xu
OffRL
48
0
0
21 Jan 2024
Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model
Yinan Zheng
Jianxiong Li
Dongjie Yu
Yujie Yang
Shengbo Eben Li
Xianyuan Zhan
Jingjing Liu
OffRL
36
24
0
19 Jan 2024
Exploration and Anti-Exploration with Distributional Random Network Distillation
Kai Yang
Jian Tao
Jiafei Lyu
Xiu Li
37
15
0
18 Jan 2024
DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy Learning
Sabariswaran Mani
Sreyas Venkataraman
Abhranil Chandra
Adyan Rizvi
Yash Sirvi
Soumojit Bhattacharya
Aritra Hazra
OffRL
26
1
0
17 Jan 2024
Learning from Sparse Offline Datasets via Conservative Density Estimation
Zhepeng Cen
Zuxin Liu
Zitong Wang
Yi-Fan Yao
Henry Lam
Ding Zhao
OffRL
25
7
0
16 Jan 2024
Solving Continual Offline Reinforcement Learning with Decision Transformer
Kaixin Huang
Li Shen
Chen Zhao
Chun Yuan
Dacheng Tao
CLL
OffRL
21
5
0
16 Jan 2024
Optimistic Model Rollouts for Pessimistic Offline Policy Optimization
Yuanzhao Zhai
Yiying Li
Zijian Gao
Xudong Gong
Kele Xu
Dawei Feng
Bo Ding
Huaimin Wang
OffRL
43
2
0
11 Jan 2024
Functional Graphical Models: Structure Enables Offline Data-Driven Optimization
J. Kuba
Masatoshi Uehara
Pieter Abbeel
Sergey Levine
AI4CE
21
4
0
08 Jan 2024
DDM-Lag : A Diffusion-based Decision-making Model for Autonomous Vehicles with Lagrangian Safety Enhancement
Jiaqi Liu
Peng Hang
Xiaocong Zhao
Jianqiang Wang
Jian Sun
54
10
0
08 Jan 2024
Policy-regularized Offline Multi-objective Reinforcement Learning
Qian Lin
Chao Yu
Zongkai Liu
Zifan Wu
OffRL
13
6
0
04 Jan 2024
Efficient Reinforcement Learning via Decoupling Exploration and Utilization
Jingpu Yang
Helin Wang
Qirui Zhao
Zhecheng Shi
Zirui Song
Miao Fang
24
0
0
26 Dec 2023
Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations
Renzhe Zhou
Chenxiao Gao
Zongzhang Zhang
Yang Yu
OffRL
45
11
0
26 Dec 2023
Parameterized Decision-making with Multi-modal Perception for Autonomous Driving
Yuyang Xia
Shuncheng Liu
Quanlin Yu
Liwei Deng
You Zhang
Han Su
Kai Zheng
27
49
0
19 Dec 2023
Small Dataset, Big Gains: Enhancing Reinforcement Learning by Offline Pre-Training with Model Based Augmentation
Girolamo Macaluso
Alessandro Sestini
Andrew D. Bagdanov
OffRL
OnRL
27
3
0
15 Dec 2023
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
Yinmin Zhang
Jie Liu
Chuming Li
Yazhe Niu
Yaodong Yang
Yu Liu
Wanli Ouyang
OffRL
OnRL
44
11
0
12 Dec 2023
The Generalization Gap in Offline Reinforcement Learning
Ishita Mediratta
Qingfei You
Minqi Jiang
Roberta Raileanu
OffRL
84
10
0
10 Dec 2023
Pearl: A Production-ready Reinforcement Learning Agent
Zheqing Zhu
Rodrigo de Salvo Braz
Jalaj Bhandari
Daniel Jiang
Yi Wan
...
D. Korenkevych
Ürün Dogan
Frank Cheng
Zheng Wu
Wanqiao Xu
VLM
OffRL
OnRL
39
6
0
06 Dec 2023
Diffused Task-Agnostic Milestone Planner
Mineui Hong
Minjae Kang
Songhwai Oh
21
6
0
06 Dec 2023
H-GAP: Humanoid Control with a Generalist Planner
Zhengyao Jiang
Yingchen Xu
Nolan Wagener
Yicheng Luo
Michael Janner
Edward Grefenstette
Tim Rocktaschel
Yuandong Tian
AI4CE
27
5
0
05 Dec 2023
When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
Vincent Liu
P. Nagarajan
Andrew Patterson
Martha White
OffRL
26
2
0
04 Dec 2023
Supported Trust Region Optimization for Offline Reinforcement Learning
Yongyi Mao
Hongchang Zhang
Cheng Chen
Yi Tian Xu
Xiangyang Ji
OffRL
31
14
0
15 Nov 2023
Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with Multi-Step On-Policy Optimization
Kun Lei
Zhengmao He
Chenhao Lu
Kaizhe Hu
Yang Gao
Huazhe Xu
OffRL
OnRL
54
13
0
06 Nov 2023
Learning Realistic Traffic Agents in Closed-loop
Chris Zhang
James Tu
Lunjun Zhang
Kelvin Wong
Simon Suo
R. Urtasun
31
18
0
02 Nov 2023
Diffusion Models for Reinforcement Learning: A Survey
Zhengbang Zhu
Hanye Zhao
Haoran He
Yichao Zhong
Shenyu Zhang
Haoquan Guo
Tingting Chen
Weinan Zhang
41
60
0
02 Nov 2023
Rethinking Decision Transformer via Hierarchical Reinforcement Learning
Yi Ma
Chenjun Xiao
Hebin Liang
Jianye Hao
OffRL
19
6
0
01 Nov 2023
Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning
Ruizhe Shi
Yuyao Liu
Yanjie Ze
Simon S. Du
Huazhe Xu
OffRL
RALM
31
18
0
31 Oct 2023
Contrastive Difference Predictive Coding
Chongyi Zheng
Ruslan Salakhutdinov
Benjamin Eysenbach
AI4TS
OffRL
28
11
0
31 Oct 2023
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models
Mianchu Wang
Rui Yang
Xi Chen
Hao Sun
Meng Fang
Giovanni Montana
OffRL
36
9
0
30 Oct 2023
Robust Offline Reinforcement learning with Heavy-Tailed Rewards
Jin Zhu
Runzhe Wan
Zhengling Qi
S. Luo
C. Shi
OffRL
37
0
0
28 Oct 2023
Unsupervised Behavior Extraction via Random Intent Priors
Haotian Hu
Yiqin Yang
Jianing Ye
Ziqing Mai
Chongjie Zhang
OffRL
32
6
0
28 Oct 2023
Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage
Kishan Panaganti
Zaiyan Xu
D. Kalathil
Mohammad Ghavamzadeh
OOD
OffRL
34
6
0
27 Oct 2023
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning
Nicholas Corrado
Yu-Tao Qu
John U. Balis
Adam Labiosa
Josiah P. Hanna
OffRL
35
2
0
27 Oct 2023
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning
Shenzhi Wang
Qisen Yang
Jiawei Gao
Matthieu Lin
Hao Chen
Liwei Wu
Ning Jia
Shiji Song
Gao Huang
OffRL
32
13
0
27 Oct 2023
CROP: Conservative Reward for Model-based Offline Policy Optimization
Hao Li
Xiaohu Zhou
Xiaoliang Xie
Shiqi Liu
Zhen-Qiu Feng
...
Mei-Jiang Gui
Tian-Yu Xiang
De-Xing Huang
Bo-Xian Yao
Zeng-Guang Hou
OffRL
32
2
0
26 Oct 2023
Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning
Hongyu Zang
Xin-hui Li
Leiji Zhang
Yang Liu
Baigui Sun
Riashat Islam
Rémi Tachet des Combes
Romain Laroche
OffRL
29
5
0
26 Oct 2023
Finetuning Offline World Models in the Real World
Yunhai Feng
Nicklas Hansen
Ziyan Xiong
Chandramouli Rajagopalan
Xiaolong Wang
OffRL
OnRL
19
20
0
24 Oct 2023
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption
Rui Yang
Han Zhong
Jiawei Xu
Amy Zhang
Chong Zhang
Lei Han
Tong Zhang
OffRL
OnRL
41
15
0
19 Oct 2023
Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning
Jianlan Luo
Perry Dong
Jeffrey Wu
Aviral Kumar
Xinyang Geng
Sergey Levine
OffRL
33
18
0
18 Oct 2023
End-to-end Offline Reinforcement Learning for Glycemia Control
Tristan Beolet
Alice Adenis
E. Huneker
Maxime Louis
OffRL
32
1
0
16 Oct 2023
Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples
Hao Sun
Alihan Huyuk
Daniel Jarrett
M. Schaar
OffRL
36
6
0
11 Oct 2023
Score Regularized Policy Optimization through Diffusion Behavior
Huayu Chen
Cheng Lu
Zhengyi Wang
Hang Su
Jun Zhu
28
20
0
11 Oct 2023
Boosting Continuous Control with Consistency Policy
Yuhui Chen
Haoran Li
Dongbin Zhao
OffRL
41
20
0
10 Oct 2023
Bi-Level Offline Policy Optimization with Limited Exploration
Wenzhuo Zhou
OffRL
36
4
0
10 Oct 2023
Memory-Consistent Neural Networks for Imitation Learning
Kaustubh Sridhar
Souradeep Dutta
Dinesh Jayaraman
James Weimer
Insup Lee
41
8
0
09 Oct 2023
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
Trevor A. McInroe
Adam Jelley
Stefano V. Albrecht
Amos Storkey
OffRL
OnRL
20
6
0
09 Oct 2023
Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments
Xiong-Hui Chen
Junyin Ye
Hang Zhao
Yi-Chen Li
Haoran Shi
...
Si-Hang Yang
Anqi Huang
Kai Xu
Zongzhang Zhang
Yang Yu
22
0
0
09 Oct 2023
Previous
1
2
3
4
5
6
...
9
10
11
Next