Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.11003
Cited By
Model Selection for Offline Reinforcement Learning: Practical Considerations for Healthcare Settings
23 July 2021
Shengpu Tang
Jenna Wiens
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Github (9★)
Papers citing
"Model Selection for Offline Reinforcement Learning: Practical Considerations for Healthcare Settings"
43 / 43 papers shown
Title
Dual Alignment Maximin Optimization for Offline Model-based RL
Chi Zhou
Wang Luo
Haoran Li
Congying Han
Tiande Guo
Zicheng Zhang
OffRL
122
0
0
02 Feb 2025
Preference Elicitation for Offline Reinforcement Learning
Alizée Pace
Bernhard Schölkopf
Gunnar Rätsch
Giorgia Ramponi
OffRL
104
1
0
26 Jun 2024
Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate
Yifan Lin
Yuhao Wang
Enlu Zhou
108
0
0
01 Mar 2024
Benchmarks for Deep Off-Policy Evaluation
Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
Ziyun Wang
...
Yutian Chen
Aviral Kumar
Cosmin Paduraru
Sergey Levine
T. Paine
ELM
OffRL
78
103
0
30 Mar 2021
An Empirical Study of Representation Learning for Reinforcement Learning in Healthcare
Taylor W. Killian
Haoran Zhang
Jayakumar Subramanian
Mehdi Fatemi
Marzyeh Ghassemi
OffRL
91
39
0
23 Nov 2020
Batch Value-function Approximation with Only Realizability
Tengyang Xie
Nan Jiang
OffRL
382
121
0
11 Aug 2020
Clinician-in-the-Loop Decision Making: Reinforcement Learning with Near-Optimal Set-Valued Policies
Shengpu Tang
Aditya Modi
Michael Sjoding
Jenna Wiens
OffRL
52
26
0
24 Jul 2020
Hyperparameter Selection for Offline Reinforcement Learning
T. Paine
Cosmin Paduraru
Andrea Michi
Çağlar Gülçehre
Konrad Zolna
Alexander Novikov
Ziyun Wang
Nando de Freitas
GP
OffRL
179
148
0
17 Jul 2020
Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting
Ilja Kuzborskij
Claire Vernade
András Gyorgy
Csaba Szepesvári
OffRL
55
47
0
18 Jun 2020
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
140
1,831
0
08 Jun 2020
MOPO: Model-based Offline Policy Optimization
Tianhe Yu
G. Thomas
Lantao Yu
Stefano Ermon
James Zou
Sergey Levine
Chelsea Finn
Tengyu Ma
OffRL
76
772
0
27 May 2020
MOReL : Model-Based Offline Reinforcement Learning
Rahul Kidambi
Aravind Rajeswaran
Praneeth Netrapalli
Thorsten Joachims
OffRL
96
676
0
12 May 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
561
2,040
0
04 May 2020
A Game Theoretic Framework for Model Based Reinforcement Learning
Aravind Rajeswaran
Igor Mordatch
Vikash Kumar
OffRL
49
127
0
16 Apr 2020
GenDICE: Generalized Offline Estimation of Stationary Values
Ruiyi Zhang
Bo Dai
Lihong Li
Dale Schuurmans
OffRL
192
174
0
21 Feb 2020
POPCORN: Partially Observed Prediction COnstrained ReiNforcement Learning
Joseph D. Futoma
M. C. Hughes
Finale Doshi-Velez
OffRL
71
50
0
13 Jan 2020
Identifying Distinct, Effective Treatments for Acute Hypotension with SODA-RL: Safely Optimized Diverse Accurate Reinforcement Learning
Joseph D. Futoma
M. A. Masood
Finale Doshi-Velez
OffRL
OOD
69
12
0
09 Jan 2020
Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning
Cameron Voloshin
Hoang Minh Le
Nan Jiang
Yisong Yue
OffRL
63
154
0
15 Nov 2019
Minimax Weight and Q-Function Learning for Off-Policy Evaluation
Masatoshi Uehara
Jiawei Huang
Nan Jiang
OffRL
151
187
0
28 Oct 2019
Infinite-horizon Off-Policy Policy Evaluation with Multiple Behavior Policies
Xinyun Chen
Lu Wang
Yizhe Hang
Heng Ge
H. Zha
OffRL
88
5
0
10 Oct 2019
Benchmarking Batch Deep Reinforcement Learning Algorithms
Shih-Han Chou
Wen-Yen Chang
W. Hsu
Jianlong Fu
OffRL
63
185
0
03 Oct 2019
Reinforcement Learning in Healthcare: A Survey
Chao Yu
Jiming Liu
S. Nemati
LM&MA
OffRL
183
570
0
22 Aug 2019
Off-Policy Evaluation via Off-Policy Classification
A. Irpan
Kanishka Rao
Konstantinos Bousmalis
Chris Harris
Julian Ibarz
Sergey Levine
OffRL
53
50
0
04 Jun 2019
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar
Justin Fu
George Tucker
Sergey Levine
OffRL
OnRL
134
1,066
0
03 Jun 2019
Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models
Michael Oberst
David Sontag
CML
OffRL
71
173
0
14 May 2019
Batch Policy Learning under Constraints
Hoang Minh Le
Cameron Voloshin
Yisong Yue
OffRL
60
333
0
20 Mar 2019
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto
David Meger
Doina Precup
OffRL
BDL
234
1,624
0
07 Dec 2018
Breaking the Curse of Horizon: Infinite-Horizon Off-Policy Estimation
Qiang Liu
Lihong Li
Ziyang Tang
Dengyong Zhou
OffRL
158
356
0
29 Oct 2018
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation
Dmitry Kalashnikov
A. Irpan
P. Pastor
Julian Ibarz
Alexander Herzog
...
Deirdre Quillen
E. Holly
Mrinal Kalakrishnan
Vincent Vanhoucke
Sergey Levine
128
1,470
0
27 Jun 2018
Importance Sampling Policy Evaluation with an Estimated Behavior Policy
Josiah P. Hanna
S. Niekum
Peter Stone
OffRL
46
68
0
04 Jun 2018
Evaluating Reinforcement Learning Algorithms in Observational Health Settings
Omer Gottesman
Fredrik D. Johansson
Joshua Meier
Jack Dent
Donghun Lee
...
Matthieu Komorowski
A. Faisal
Leo Anthony Celi
David Sontag
Finale Doshi-Velez
OOD
OffRL
44
134
0
31 May 2018
Behavioral Cloning from Observation
F. Torabi
Garrett A. Warnell
Peter Stone
OffRL
113
730
0
04 May 2018
Safe Policy Improvement with Baseline Bootstrapping
Romain Laroche
P. Trichelair
Rémi Tachet des Combes
OffRL
64
201
0
19 Dec 2017
Deep Reinforcement Learning for Sepsis Treatment
Aniruddh Raghu
Matthieu Komorowski
Imran Ahmed
Leo Anthony Celi
Peter Szolovits
Marzyeh Ghassemi
OffRL
57
172
0
27 Nov 2017
Deep Reinforcement Learning that Matters
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
121
1,961
0
19 Sep 2017
Continuous State-Space Models for Optimal Sepsis Treatment - a Deep Reinforcement Learning Approach
Aniruddh Raghu
Matthieu Komorowski
Leo Anthony Celi
Peter Szolovits
Marzyeh Ghassemi
OffRL
40
193
0
23 May 2017
A Reinforcement Learning Approach to Weaning of Mechanical Ventilation in Intensive Care Units
Niranjani Prasad
Li-Fang Cheng
C. Chivers
Michael Draugelis
Barbara E. Engelhardt
OffRL
58
168
0
20 Apr 2017
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
432
576
0
04 Apr 2016
Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization
Lisha Li
Kevin Jamieson
Giulia DeSalvo
Afshin Rostamizadeh
Ameet Talwalkar
227
2,333
0
21 Mar 2016
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Nan Jiang
Lihong Li
OffRL
214
624
0
11 Nov 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.0K
150,260
0
22 Dec 2014
Practical Bayesian Optimization of Machine Learning Algorithms
Jasper Snoek
Hugo Larochelle
Ryan P. Adams
362
7,954
0
13 Jun 2012
A Contextual-Bandit Approach to Personalized News Article Recommendation
Lihong Li
Wei Chu
John Langford
Robert Schapire
471
2,954
0
28 Feb 2010
1