Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.12021
Cited By
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
22 March 2021
Paria Rashidinejad
Banghua Zhu
Cong Ma
Jiantao Jiao
Stuart J. Russell
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism"
23 / 73 papers shown
Title
Hindsight Learning for MDPs with Exogenous Inputs
Sean R. Sinclair
Felipe Vieira Frujeri
Ching-An Cheng
Luke Marshall
Hugo Barbalho
Jingling Li
Jennifer Neville
Ishai Menache
Adith Swaminathan
18
23
0
13 Jul 2022
A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP
Fan Chen
Junyu Zhang
Zaiwen Wen
OffRL
39
8
0
13 Jul 2022
Offline Equilibrium Finding
Shuxin Li
Xinrun Wang
Youzhi Zhang
Jakub Cerny
Pengdeng Li
Hau Chan
Bo An
OffRL
46
2
0
12 Jul 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
Yunfei Li
Tian Gao
Jiaqi Yang
Huazhe Xu
Yi Wu
OffRL
31
22
0
24 Jun 2022
Offline Stochastic Shortest Path: Learning, Evaluation and Towards Optimality
Ming Yin
Wenjing Chen
Mengdi Wang
Yu Wang
OffRL
30
4
0
10 Jun 2022
On the Role of Discount Factor in Offline Reinforcement Learning
Haotian Hu
Yiqin Yang
Qianchuan Zhao
Chongjie Zhang
OffRL
34
18
0
07 Jun 2022
Byzantine-Robust Online and Offline Distributed Reinforcement Learning
Yiding Chen
Xuezhou Zhang
Kaipeng Zhang
Mengdi Wang
Xiaojin Zhu
OffRL
26
16
0
01 Jun 2022
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes
Miao Lu
Yifei Min
Zhaoran Wang
Zhuoran Yang
OffRL
57
22
0
26 May 2022
Settling the Sample Complexity of Model-Based Offline Reinforcement Learning
Gen Li
Laixi Shi
Yuxin Chen
Yuejie Chi
Yuting Wei
OffRL
54
83
0
11 Apr 2022
Jump-Start Reinforcement Learning
Ikechukwu Uchendu
Ted Xiao
Yao Lu
Banghua Zhu
Mengyuan Yan
...
Chuyuan Fu
Cong Ma
Jiantao Jiao
Sergey Levine
Karol Hausman
OffRL
OnRL
44
109
0
05 Apr 2022
Offline Reinforcement Learning Under Value and Density-Ratio Realizability: The Power of Gaps
Jinglin Chen
Nan Jiang
OffRL
23
34
0
25 Mar 2022
The Efficacy of Pessimism in Asynchronous Q-Learning
Yuling Yan
Gen Li
Yuxin Chen
Jianqing Fan
OffRL
78
40
0
14 Mar 2022
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism
Ming Yin
Yaqi Duan
Mengdi Wang
Yu Wang
OffRL
34
66
0
11 Mar 2022
When is Offline Two-Player Zero-Sum Markov Game Solvable?
Qiwen Cui
S. Du
OffRL
39
29
0
10 Jan 2022
Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopic Followers?
Han Zhong
Zhuoran Yang
Zhaoran Wang
Michael I. Jordan
31
30
0
27 Dec 2021
Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization
Thanh Nguyen-Tang
Sunil R. Gupta
A. Nguyen
Svetha Venkatesh
OffRL
31
29
0
27 Nov 2021
Representation Learning for Online and Offline RL in Low-rank MDPs
Masatoshi Uehara
Xuezhou Zhang
Wen Sun
OffRL
62
127
0
09 Oct 2021
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning
Andrea Zanette
Martin J. Wainwright
Emma Brunskill
OffRL
29
115
0
19 Aug 2021
Offline Reinforcement Learning as Anti-Exploration
Shideh Rezaeifar
Robert Dadashi
Nino Vieillard
Léonard Hussenot
Olivier Bachem
Olivier Pietquin
M. Geist
OffRL
54
51
0
11 Jun 2021
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings
Ming Yin
Yu Wang
OffRL
32
19
0
13 May 2021
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
222
419
0
16 Feb 2021
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Seyed Kamyar Seyed Ghasemipour
Dale Schuurmans
S. Gu
OffRL
214
119
0
21 Jul 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
343
1,968
0
04 May 2020
Previous
1
2