Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.13239
Cited By
MOPO: Model-based Offline Policy Optimization
27 May 2020
Tianhe Yu
G. Thomas
Lantao Yu
Stefano Ermon
James Zou
Sergey Levine
Chelsea Finn
Tengyu Ma
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"MOPO: Model-based Offline Policy Optimization"
50 / 212 papers shown
Title
UMBRELLA: Uncertainty-Aware Model-Based Offline Reinforcement Learning Leveraging Planning
Christopher P. Diehl
Timo Sievernich
Martin Krüger
F. Hoffmann
Torsten Bertram
OffRL
34
26
0
22 Nov 2021
Curriculum Offline Imitation Learning
Minghuan Liu
Hanye Zhao
Zhengyu Yang
Jian Shen
Weinan Zhang
Li Zhao
Tie-Yan Liu
OffRL
29
1
0
03 Nov 2021
The Difficulty of Passive Learning in Deep Reinforcement Learning
Georg Ostrovski
Pablo Samuel Castro
Will Dabney
OffRL
24
57
0
26 Oct 2021
False Correlation Reduction for Offline Reinforcement Learning
Arvindkumar Krishnakumar
Zuyue Fu
Lingxiao Wang
Zhuoran Yang
Chenjia Bai
Tianyi Zhou
Judy Hoffman
Jing Jiang
OffRL
39
9
0
24 Oct 2021
Policy Search using Dynamic Mirror Descent MPC for Model Free Off Policy RL
Aarush Gupta
27
0
0
23 Oct 2021
Efficient Robotic Manipulation Through Offline-to-Online Reinforcement Learning and Goal-Aware State Information
Jin Li
Xianyuan Zhan
Zixu Xiao
Guyue Zhou
OffRL
OnRL
29
2
0
21 Oct 2021
A Closer Look at Advantage-Filtered Behavioral Cloning in High-Noise Datasets
J. E. Grigsby
Yanjun Qi
OffRL
31
5
0
10 Oct 2021
Representation Learning for Online and Offline RL in Low-rank MDPs
Masatoshi Uehara
Xuezhou Zhang
Wen Sun
OffRL
64
127
0
09 Oct 2021
Evaluating model-based planning and planner amortization for continuous control
Arunkumar Byravan
Leonard Hasenclever
Piotr Trochim
M. Berk Mirza
Alessandro Davide Ialongo
...
Jost Tobias Springenberg
A. Abdolmaleki
N. Heess
J. Merel
Martin Riedmiller
55
17
0
07 Oct 2021
NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL
Khaled Nakhleh
Santosh Ganji
Ping-Chun Hsieh
I.-Hong Hou
S. Shakkottai
61
38
0
05 Oct 2021
Dropout Q-Functions for Doubly Efficient Reinforcement Learning
Takuya Hiraoka
Takahisa Imagawa
Taisei Hashimoto
Takashi Onishi
Yoshimasa Tsuruoka
13
105
0
05 Oct 2021
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble
Gaon An
Seungyong Moon
Jang-Hyun Kim
Hyun Oh Song
OffRL
105
265
0
04 Oct 2021
A Workflow for Offline Model-Free Robotic Reinforcement Learning
Aviral Kumar
Anika Singh
Stephen Tian
Chelsea Finn
Sergey Levine
OffRL
143
85
0
22 Sep 2021
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
35
77
0
16 Sep 2021
Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning
Andrea Zanette
Martin J. Wainwright
Emma Brunskill
OffRL
29
115
0
19 Aug 2021
Offline Decentralized Multi-Agent Reinforcement Learning
Jiechuan Jiang
Zongqing Lu
OffRL
30
37
0
04 Aug 2021
Model Selection for Offline Reinforcement Learning: Practical Considerations for Healthcare Settings
Shengpu Tang
Jenna Wiens
OffRL
26
78
0
23 Jul 2021
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning
Haoran Xu
Xianyuan Zhan
Xiangyu Zhu
OffRL
16
86
0
19 Jul 2021
Visual Adversarial Imitation Learning using Variational Models
Rafael Rafailov
Tianhe Yu
Aravind Rajeswaran
Chelsea Finn
SSL
28
49
0
16 Jul 2021
Conservative Offline Distributional Reinforcement Learning
Yecheng Jason Ma
Dinesh Jayaraman
Osbert Bastani
OffRL
73
79
0
12 Jul 2021
Offline reinforcement learning with uncertainty for treatment strategies in sepsis
Ran Liu
J. Greenstein
J. Fackler
Jules Bergmann
M. Bembea
R. Winslow
OffRL
14
7
0
09 Jul 2021
Evaluating the progress of Deep Reinforcement Learning in the real world: aligning domain-agnostic and domain-specific research
J. Luis
E. Crawley
B. Cameron
OffRL
25
6
0
07 Jul 2021
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Jongmin Lee
Wonseok Jeon
Byung-Jun Lee
J. Pineau
Kee-Eung Kim
OffRL
37
91
0
21 Jun 2021
Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL
Catherine Cang
Aravind Rajeswaran
Pieter Abbeel
Michael Laskin
OffRL
32
29
0
16 Jun 2021
Bellman-consistent Pessimism for Offline Reinforcement Learning
Tengyang Xie
Ching-An Cheng
Nan Jiang
Paul Mineiro
Alekh Agarwal
OffRL
LRM
27
271
0
13 Jun 2021
Offline Reinforcement Learning as Anti-Exploration
Shideh Rezaeifar
Robert Dadashi
Nino Vieillard
Léonard Hussenot
Olivier Bachem
Olivier Pietquin
M. Geist
OffRL
54
51
0
11 Jun 2021
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning
Tengyang Xie
Nan Jiang
Huan Wang
Caiming Xiong
Yu Bai
OffRL
OnRL
44
162
0
09 Jun 2021
Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation
Evgenii Nikishin
Romina Abachi
Rishabh Agarwal
Pierre-Luc Bacon
OffRL
52
35
0
06 Jun 2021
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Michael Janner
Qiyang Li
Sergey Levine
OffRL
71
651
0
03 Jun 2021
Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Yue Wu
Shuangfei Zhai
Nitish Srivastava
J. Susskind
Jian Zhang
Ruslan Salakhutdinov
Hanlin Goh
EDL
OffRL
OnRL
21
184
0
17 May 2021
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings
Ming Yin
Yu Wang
OffRL
32
19
0
13 May 2021
Identity Concealment Games: How I Learned to Stop Revealing and Love the Coincidences
Mustafa O. Karabag
Melkior Ornik
Ufuk Topcu
42
3
0
12 May 2021
Spectral Normalisation for Deep Reinforcement Learning: an Optimisation Perspective
Florin Gogianu
Tudor Berariu
Mihaela Rosca
Claudia Clopath
L. Buşoniu
Razvan Pascanu
24
54
0
11 May 2021
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Michael Ruogu Zhang
T. Paine
Ofir Nachum
Cosmin Paduraru
George Tucker
Ziyun Wang
Mohammad Norouzi
OffRL
16
45
0
28 Apr 2021
Safe Chance Constrained Reinforcement Learning for Batch Process Control
M. Mowbray
Panagiotis Petsagkourakis
Ehecatl Antonio del Rio Chanona
Dongda Zhang
OffRL
37
34
0
23 Apr 2021
Online and Offline Reinforcement Learning by Planning with a Learned Model
Julian Schrittwieser
Thomas Hubert
Amol Mandhane
M. Barekatain
Ioannis Antonoglou
David Silver
OffRL
31
114
0
13 Apr 2021
Regularized Behavior Value Estimation
Çağlar Gülçehre
Sergio Gomez Colmenarejo
Ziyun Wang
Jakub Sygnowski
T. Paine
Konrad Zolna
Yutian Chen
Matthew W. Hoffman
Razvan Pascanu
Nando de Freitas
OffRL
31
37
0
17 Mar 2021
S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning
Samarth Sinha
Ajay Mandlekar
Animesh Garg
OffRL
26
107
0
10 Mar 2021
Instabilities of Offline RL with Pre-Trained Neural Representation
Ruosong Wang
Yifan Wu
Ruslan Salakhutdinov
Sham Kakade
OffRL
22
42
0
08 Mar 2021
Offline Reinforcement Learning with Pseudometric Learning
Robert Dadashi
Shideh Rezaeifar
Nino Vieillard
Léonard Hussenot
Olivier Pietquin
M. Geist
OffRL
39
40
0
02 Mar 2021
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning
Xianyuan Zhan
Haoran Xu
Yueying Zhang
Xiangyu Zhu
Honglei Yin
Yu Zheng
OffRL
AI4CE
40
68
0
23 Feb 2021
Dealing with Non-Stationarity in MARL via Trust-Region Decomposition
Wenhao Li
Xiangfeng Wang
Bo Jin
Junjie Sheng
H. Zha
31
7
0
21 Feb 2021
Model-Invariant State Abstractions for Model-Based Reinforcement Learning
Manan Tomar
Amy Zhang
Roberto Calandra
Matthew E. Taylor
Joelle Pineau
24
24
0
19 Feb 2021
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
222
419
0
16 Feb 2021
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators
Anish Agarwal
Abdullah Alomar
Varkey Alumootil
Devavrat Shah
Dennis Shen
Zhi Xu
Cindy Yang
OffRL
18
18
0
13 Feb 2021
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
Rongjun Qin
Songyi Gao
Xingyuan Zhang
Zhen Xu
Shengkai Huang
Zewen Li
Weinan Zhang
Yang Yu
OffRL
140
80
0
01 Feb 2021
Is Pessimism Provably Efficient for Offline RL?
Ying Jin
Zhuoran Yang
Zhaoran Wang
OffRL
27
350
0
30 Dec 2020
SemiNLL: A Framework of Noisy-Label Learning by Semi-Supervised Learning
Zhuowei Wang
Jing Jiang
Bo Han
Lei Feng
Bo An
Gang Niu
Guodong Long
NoLa
33
17
0
02 Dec 2020
PLAS: Latent Action Space for Offline Reinforcement Learning
Wenxuan Zhou
Sujay Bajracharya
David Held
OffRL
38
158
0
14 Nov 2020
Reliable Off-policy Evaluation for Reinforcement Learning
Jie Wang
Rui Gao
H. Zha
OffRL
22
11
0
08 Nov 2020
Previous
1
2
3
4
5
Next