Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.01708
Cited By
Benchmarking Batch Deep Reinforcement Learning Algorithms
3 October 2019
Shih-Han Chou
Wen-Yen Chang
W. Hsu
Jianlong Fu
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Benchmarking Batch Deep Reinforcement Learning Algorithms"
50 / 105 papers shown
Title
The Difficulty of Passive Learning in Deep Reinforcement Learning
Georg Ostrovski
Pablo Samuel Castro
Will Dabney
OffRL
19
57
0
26 Oct 2021
Towards Hyperparameter-free Policy Selection for Offline Reinforcement Learning
Siyuan Zhang
Nan Jiang
OffRL
19
39
0
26 Oct 2021
False Correlation Reduction for Offline Reinforcement Learning
Arvindkumar Krishnakumar
Zuyue Fu
Lingxiao Wang
Zhuoran Yang
Chenjia Bai
Tianyi Zhou
Judy Hoffman
Jing Jiang
OffRL
39
9
0
24 Oct 2021
Offline Reinforcement Learning for Autonomous Driving with Safety and Exploration Enhancement
Tianyu Shi
Dong Chen
Kaian Chen
Zhaojian Li
OffRL
34
31
0
13 Oct 2021
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters
Vladislav Kurenkov
Sergey Kolesnikov
OffRL
32
24
0
08 Oct 2021
Greedy UnMixing for Q-Learning in Multi-Agent Reinforcement Learning
Chapman Siu
Jason M. Traish
R. Xu
33
2
0
19 Sep 2021
Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation
Haruka Kiyohara
K. Kawakami
Yuta Saito
OffRL
29
12
0
17 Sep 2021
DCUR: Data Curriculum for Teaching via Samples with Reinforcement Learning
Daniel Seita
Abhinav Gopal
Zhao Mandi
John F. Canny
OffRL
OnRL
24
0
0
15 Sep 2021
Eden: A Unified Environment Framework for Booming Reinforcement Learning Algorithms
Ruizhi Chen
Xiaoyu Wu
Yansong Pan
Kaizhao Yuan
Ling Li
...
Shaohui Peng
Xishan Zhang
Zidong Du
Qi Guo
Yunji Chen
OffRL
39
3
0
04 Sep 2021
A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning
Tianchi Cai
Wenpeng Zhang
Lihong Gu
Xiaodong Zeng
Jinjie Gu
11
0
0
29 Aug 2021
Model Selection for Offline Reinforcement Learning: Practical Considerations for Healthcare Settings
Shengpu Tang
Jenna Wiens
OffRL
26
78
0
23 Jul 2021
Offline Meta-Reinforcement Learning with Online Self-Supervision
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
34
66
0
08 Jul 2021
Supervised Off-Policy Ranking
Yue Jin
Yue Zhang
Tao Qin
Xudong Zhang
Jian Yuan
Houqiang Li
Tie-Yan Liu
OffRL
32
5
0
03 Jul 2021
Offline RL Without Off-Policy Evaluation
David Brandfonbrener
William F. Whitney
Rajesh Ranganath
Joan Bruna
OffRL
42
162
0
16 Jun 2021
A Minimalist Approach to Offline Reinforcement Learning
Scott Fujimoto
S. Gu
OffRL
58
785
0
12 Jun 2021
A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation
Scott Fujimoto
David Meger
Doina Precup
8
16
0
12 Jun 2021
An Offline Risk-aware Policy Selection Method for Bayesian Markov Decision Processes
Giorgio Angelotti
Nicolas Drougard
Caroline Ponzoni Carvalho Chanel
OffRL
21
0
0
27 May 2021
Interpretable performance analysis towards offline reinforcement learning: A dataset perspective
Chenyang Xi
Bo Tang
Jiajun Shen
Xinfu Liu
Zhiyu Li
Xueying Li
OffRL
24
1
0
12 May 2021
Discovering an Aid Policy to Minimize Student Evasion Using Offline Reinforcement Learning
Leandro M. de Lima
R. Krohling
OffRL
40
4
0
20 Apr 2021
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Paria Rashidinejad
Banghua Zhu
Cong Ma
Jiantao Jiao
Stuart J. Russell
OffRL
39
277
0
22 Mar 2021
Regularized Behavior Value Estimation
Çağlar Gülçehre
Sergio Gomez Colmenarejo
Ziyun Wang
Jakub Sygnowski
T. Paine
Konrad Zolna
Yutian Chen
Matthew W. Hoffman
Razvan Pascanu
Nando de Freitas
OffRL
31
37
0
17 Mar 2021
Learning robust driving policies without online exploration
D. Graves
Nhat M. Nguyen
Kimia Hassanzadeh
Jun Jin
Jun Luo
OffRL
11
2
0
15 Mar 2021
RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads
James Gleeson
Srivatsan Krishnan
Moshe Gabel
Vijay Janapa Reddi
Eyal de Lara
Gennady Pekhimenko
OffRL
12
11
0
08 Feb 2021
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
Rongjun Qin
Songyi Gao
Xingyuan Zhang
Zhen Xu
Shengkai Huang
Zewen Li
Weinan Zhang
Yang Yu
OffRL
140
79
0
01 Feb 2021
POPO: Pessimistic Offline Policy Optimization
Qiang He
Xinwen Hou
OffRL
35
10
0
26 Dec 2020
Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation
Diksha Garg
Priyanka Gupta
Pankaj Malhotra
L. Vig
Gautam M. Shroff
OffRL
14
8
0
16 Dec 2020
Offline Reinforcement Learning Hands-On
L. Monier
Jakub Kmec
Alexandre Laterre
Thomas Pierrot
Valentin Courgeau
Olivier Sigaud
Karim Beguir
OffRL
19
9
0
29 Nov 2020
An Empirical Study of Representation Learning for Reinforcement Learning in Healthcare
Taylor W. Killian
Haoran Zhang
Jayakumar Subramanian
Mehdi Fatemi
Marzyeh Ghassemi
OffRL
22
37
0
23 Nov 2020
Conservative Safety Critics for Exploration
Homanga Bharadhwaj
Aviral Kumar
Nicholas Rhinehart
Sergey Levine
Florian Shkurti
Animesh Garg
OffRL
20
137
0
27 Oct 2020
DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPs
Aayam Shrestha
Stefan Lee
Prasad Tadepalli
Alan Fern
OffRL
55
23
0
18 Oct 2020
Constrained Model-based Reinforcement Learning with Robust Cross-Entropy Method
Zuxin Liu
Hongyi Zhou
Baiming Chen
Sicheng Zhong
M. Hebert
Ding Zhao
14
11
0
15 Oct 2020
Balancing Constraints and Rewards with Meta-Gradient D4PG
D. A. Calian
D. Mankowitz
Tom Zahavy
Zhongwen Xu
Junhyuk Oh
Nir Levine
Timothy A. Mann
31
25
0
13 Oct 2020
Offline Learning for Planning: A Summary
Giorgio Angelotti
Nicolas Drougard
Caroline Ponzoni Carvalho Chanel
OffRL
14
4
0
05 Oct 2020
Multimodal Safety-Critical Scenarios Generation for Decision-Making Algorithms Evaluation
Wenhao Ding
Baiming Chen
Bo-wen Li
Kim Ji Eun
Ding Zhao
AAML
16
100
0
16 Sep 2020
Overcoming Model Bias for Robust Offline Deep Reinforcement Learning
Phillip Swazinna
Steffen Udluft
Thomas Runkler
OffRL
21
83
0
12 Aug 2020
QPLEX: Duplex Dueling Multi-Agent Q-Learning
Jianhao Wang
Zhizhou Ren
Terry Liu
Yang Yu
Chongjie Zhang
OffRL
51
437
0
03 Aug 2020
An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay
Scott Fujimoto
David Meger
Doina Precup
21
57
0
12 Jul 2020
Critic Regularized Regression
Ziyun Wang
Alexander Novikov
Konrad Zolna
Jost Tobias Springenberg
Scott E. Reed
...
Noah Y. Siegel
J. Merel
Çağlar Gülçehre
N. Heess
Nando de Freitas
OffRL
36
319
0
26 Jun 2020
RL Unplugged: A Suite of Benchmarks for Offline Reinforcement Learning
Çağlar Gülçehre
Ziyun Wang
Alexander Novikov
T. Paine
Sergio Gomez Colmenarejo
...
Matthew W. Hoffman
Ofir Nachum
George Tucker
N. Heess
Nando de Freitas
OffRL
27
71
0
24 Jun 2020
Expert-Supervised Reinforcement Learning for Offline Policy Learning and Evaluation
W. AaronSonabend
Junwei Lu
Leo Anthony Celi
Tianxi Cai
Peter Szolovits
OffRL
14
24
0
23 Jun 2020
Accelerating Safe Reinforcement Learning with Constraint-mismatched Policies
Tsung-Yen Yang
Justinian P. Rosca
Karthik Narasimhan
Peter J. Ramadge
29
18
0
20 Jun 2020
WD3: Taming the Estimation Bias in Deep Reinforcement Learning
Qiang He
Xinwen Hou
OffRL
10
28
0
18 Jun 2020
Recurrent Sum-Product-Max Networks for Decision Making in Perfectly-Observed Environments
H. Tatavarti
Prashant Doshi
Layton Hayes
TPM
13
2
0
12 Jun 2020
Avoiding Side Effects in Complex Environments
Alexander Matt Turner
Neale Ratzlaff
Prasad Tadepalli
30
34
0
11 Jun 2020
Exploration by Maximizing Rényi Entropy for Reward-Free RL Framework
Chuheng Zhang
Yuanying Cai
Longbo Huang
Jian Li
OffRL
8
1
0
11 Jun 2020
Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
T. Matsushima
Hiroki Furuta
Y. Matsuo
Ofir Nachum
S. Gu
OffRL
25
147
0
05 Jun 2020
Causality and Batch Reinforcement Learning: Complementary Approaches To Planning In Unknown Domains
James Bannon
Bradford T. Windsor
Wenbo Song
Tao Li
CML
OOD
OffRL
26
20
0
03 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
65
225
0
01 Jun 2020
Playing Minecraft with Behavioural Cloning
Anssi Kanervisto
Janne Karttunen
Ville Hautamaki
28
12
0
07 May 2020
Benchmarking End-to-End Behavioural Cloning on Video Games
Anssi Kanervisto
J. Pussinen
Ville Hautamaki
OffRL
22
24
0
02 Apr 2020
Previous
1
2
3
Next