Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2005.01643
Cited By
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
4 May 2020
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems"
20 / 120 papers shown
Title
Deep Reinforcement Learning for Sepsis Treatment
Aniruddh Raghu
Matthieu Komorowski
Imran Ahmed
Leo Anthony Celi
Peter Szolovits
Marzyeh Ghassemi
OffRL
57
172
0
27 Nov 2017
End-to-end Driving via Conditional Imitation Learning
Felipe Codevilla
Matthias Muller
Antonio M. López
V. Koltun
Alexey Dosovitskiy
123
1,066
0
06 Oct 2017
Deep Reinforcement Learning framework for Autonomous Driving
Ahmad El-Sallab
Mohammed Abdou
E. Perot
S. Yogamani
79
969
0
08 Apr 2017
What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision?
Alex Kendall
Y. Gal
BDL
OOD
UD
UQCV
PER
340
4,700
0
15 Mar 2017
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
95
1,339
0
27 Feb 2017
Interaction Networks for Learning about Objects, Relations and Physics
Peter W. Battaglia
Razvan Pascanu
Matthew Lai
Danilo Jimenez Rezende
Koray Kavukcuoglu
AI4CE
OCL
PINN
GNN
514
1,407
0
01 Dec 2016
CAD2RL: Real Single-Image Flight without a Single Real Image
Fereshteh Sadeghi
Sergey Levine
SSL
300
814
0
13 Nov 2016
Learning to Act by Predicting the Future
Alexey Dosovitskiy
V. Koltun
138
281
0
06 Nov 2016
Why is Posterior Sampling Better than Optimism for Reinforcement Learning?
Ian Osband
Benjamin Van Roy
BDL
76
259
0
01 Jul 2016
f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization
Sebastian Nowozin
Botond Cseke
Ryota Tomioka
GAN
133
1,654
0
02 Jun 2016
End to End Learning for Self-Driving Cars
Mariusz Bojarski
D. Testa
Daniel Dworakowski
Bernhard Firner
B. Flepp
...
Urs Muller
Jiakai Zhang
Xin Zhang
Jake Zhao
Karol Zieba
SSL
85
4,163
0
25 Apr 2016
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
381
576
0
04 Apr 2016
Value Iteration Networks
Aviv Tamar
Yi Wu
G. Thomas
Sergey Levine
Pieter Abbeel
73
653
0
09 Feb 2016
Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis
Assaf Hallak
Aviv Tamar
Rémi Munos
Shie Mannor
OffRL
91
56
0
17 Sep 2015
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
308
13,214
0
09 Sep 2015
On Convergence of Emphatic Temporal-Difference Learning
Huizhen Yu
OffRL
56
73
0
08 Jun 2015
An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning
R. Sutton
A. R. Mahmood
Martha White
82
269
0
14 Mar 2015
Doubly Robust Policy Evaluation and Optimization
Miroslav Dudík
D. Erhan
John Langford
Lihong Li
OffRL
170
285
0
10 Mar 2015
Semi-Supervised Learning with Deep Generative Models
Diederik P. Kingma
Danilo Jimenez Rezende
S. Mohamed
Max Welling
GAN
SSL
BDL
83
2,738
0
20 Jun 2014
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
114
12,201
0
19 Dec 2013
Previous
1
2
3