ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2005.01643
  4. Cited By
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems

4 May 2020
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
    OffRL
    GP
ArXivPDFHTML

Papers citing "Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems"

20 / 120 papers shown
Title
Deep Reinforcement Learning for Sepsis Treatment
Deep Reinforcement Learning for Sepsis Treatment
Aniruddh Raghu
Matthieu Komorowski
Imran Ahmed
Leo Anthony Celi
Peter Szolovits
Marzyeh Ghassemi
OffRL
57
172
0
27 Nov 2017
End-to-end Driving via Conditional Imitation Learning
End-to-end Driving via Conditional Imitation Learning
Felipe Codevilla
Matthias Muller
Antonio M. López
V. Koltun
Alexey Dosovitskiy
123
1,066
0
06 Oct 2017
Deep Reinforcement Learning framework for Autonomous Driving
Deep Reinforcement Learning framework for Autonomous Driving
Ahmad El-Sallab
Mohammed Abdou
E. Perot
S. Yogamani
79
969
0
08 Apr 2017
What Uncertainties Do We Need in Bayesian Deep Learning for Computer
  Vision?
What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision?
Alex Kendall
Y. Gal
BDL
OOD
UD
UQCV
PER
340
4,700
0
15 Mar 2017
Reinforcement Learning with Deep Energy-Based Policies
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
95
1,339
0
27 Feb 2017
Interaction Networks for Learning about Objects, Relations and Physics
Interaction Networks for Learning about Objects, Relations and Physics
Peter W. Battaglia
Razvan Pascanu
Matthew Lai
Danilo Jimenez Rezende
Koray Kavukcuoglu
AI4CE
OCL
PINN
GNN
514
1,407
0
01 Dec 2016
CAD2RL: Real Single-Image Flight without a Single Real Image
CAD2RL: Real Single-Image Flight without a Single Real Image
Fereshteh Sadeghi
Sergey Levine
SSL
300
814
0
13 Nov 2016
Learning to Act by Predicting the Future
Learning to Act by Predicting the Future
Alexey Dosovitskiy
V. Koltun
138
281
0
06 Nov 2016
Why is Posterior Sampling Better than Optimism for Reinforcement
  Learning?
Why is Posterior Sampling Better than Optimism for Reinforcement Learning?
Ian Osband
Benjamin Van Roy
BDL
76
259
0
01 Jul 2016
f-GAN: Training Generative Neural Samplers using Variational Divergence
  Minimization
f-GAN: Training Generative Neural Samplers using Variational Divergence Minimization
Sebastian Nowozin
Botond Cseke
Ryota Tomioka
GAN
133
1,654
0
02 Jun 2016
End to End Learning for Self-Driving Cars
End to End Learning for Self-Driving Cars
Mariusz Bojarski
D. Testa
Daniel Dworakowski
Bernhard Firner
B. Flepp
...
Urs Muller
Jiakai Zhang
Xin Zhang
Jake Zhao
Karol Zieba
SSL
85
4,163
0
25 Apr 2016
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
381
576
0
04 Apr 2016
Value Iteration Networks
Value Iteration Networks
Aviv Tamar
Yi Wu
G. Thomas
Sergey Levine
Pieter Abbeel
73
653
0
09 Feb 2016
Generalized Emphatic Temporal Difference Learning: Bias-Variance
  Analysis
Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis
Assaf Hallak
Aviv Tamar
Rémi Munos
Shie Mannor
OffRL
91
56
0
17 Sep 2015
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
308
13,214
0
09 Sep 2015
On Convergence of Emphatic Temporal-Difference Learning
On Convergence of Emphatic Temporal-Difference Learning
Huizhen Yu
OffRL
56
73
0
08 Jun 2015
An Emphatic Approach to the Problem of Off-policy Temporal-Difference
  Learning
An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning
R. Sutton
A. R. Mahmood
Martha White
82
269
0
14 Mar 2015
Doubly Robust Policy Evaluation and Optimization
Doubly Robust Policy Evaluation and Optimization
Miroslav Dudík
D. Erhan
John Langford
Lihong Li
OffRL
170
285
0
10 Mar 2015
Semi-Supervised Learning with Deep Generative Models
Semi-Supervised Learning with Deep Generative Models
Diederik P. Kingma
Danilo Jimenez Rezende
S. Mohamed
Max Welling
GAN
SSL
BDL
83
2,738
0
20 Jun 2014
Playing Atari with Deep Reinforcement Learning
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
114
12,201
0
19 Dec 2013
Previous
123