ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.13888
  4. Cited By
RL Unplugged: A Suite of Benchmarks for Offline Reinforcement Learning

RL Unplugged: A Suite of Benchmarks for Offline Reinforcement Learning

24 June 2020
Çağlar Gülçehre
Ziyun Wang
Alexander Novikov
T. Paine
Sergio Gomez Colmenarejo
Konrad Zolna
Rishabh Agarwal
J. Merel
D. Mankowitz
Cosmin Paduraru
Gabriel Dulac-Arnold
Jingkai Li
Mohammad Norouzi
Matthew W. Hoffman
Ofir Nachum
George Tucker
N. Heess
Nando de Freitas
    OffRL
ArXivPDFHTML

Papers citing "RL Unplugged: A Suite of Benchmarks for Offline Reinforcement Learning"

28 / 28 papers shown
Title
Transductive Reward Inference on Graph
Transductive Reward Inference on Graph
B. Qu
Xiaofeng Cao
Qing Guo
Yi Chang
Ivor W. Tsang
Chengqi Zhang
OffRL
38
0
0
06 Feb 2024
Robotic Manipulation Datasets for Offline Compositional Reinforcement
  Learning
Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning
Marcel Hussing
Jorge Armando Mendez Mendez
Anisha Singrodia
Cassandra Kent
Eric Eaton
OffRL
35
5
0
13 Jul 2023
Offline Q-Learning on Diverse Multi-Task Data Both Scales And
  Generalizes
Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes
Aviral Kumar
Rishabh Agarwal
Xinyang Geng
George Tucker
Sergey Levine
OffRL
44
48
0
28 Nov 2022
Federated Offline Reinforcement Learning
Federated Offline Reinforcement Learning
D. Zhou
Yufeng Zhang
Aaron Sonabend-W
Zhaoran Wang
Junwei Lu
Tianxi Cai
OffRL
40
13
0
11 Jun 2022
Challenges and Opportunities in Offline Reinforcement Learning from
  Visual Observations
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
Cong Lu
Philip J. Ball
Tim G. J. Rudner
Jack Parker-Holder
Michael A. Osborne
Yee Whye Teh
OffRL
32
52
0
09 Jun 2022
Dataset Distillation by Matching Training Trajectories
Dataset Distillation by Matching Training Trajectories
George Cazenavette
Tongzhou Wang
Antonio Torralba
Alexei A. Efros
Jun-Yan Zhu
FedML
DD
84
368
0
22 Mar 2022
DR3: Value-Based Deep Reinforcement Learning Requires Explicit
  Regularization
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization
Aviral Kumar
Rishabh Agarwal
Tengyu Ma
Aaron Courville
George Tucker
Sergey Levine
OffRL
31
65
0
09 Dec 2021
The Surprising Effectiveness of Representation Learning for Visual
  Imitation
The Surprising Effectiveness of Representation Learning for Visual Imitation
Jyothish Pari
Nur Muhammad (Mahi) Shafiullah
Sridhar Pandian Arunachalam
Lerrel Pinto
SSL
25
161
0
02 Dec 2021
Pessimistic Model Selection for Offline Deep Reinforcement Learning
Pessimistic Model Selection for Offline Deep Reinforcement Learning
Chao-Han Huck Yang
Zhengling Qi
Yifan Cui
Pin-Yu Chen
OffRL
39
4
0
29 Nov 2021
Measuring Data Quality for Dataset Selection in Offline Reinforcement
  Learning
Measuring Data Quality for Dataset Selection in Offline Reinforcement Learning
Phillip Swazinna
Steffen Udluft
Thomas Runkler
OffRL
27
6
0
26 Nov 2021
The Impact of Data Distribution on Q-learning with Function
  Approximation
The Impact of Data Distribution on Q-learning with Function Approximation
Pedro P. Santos
Diogo S. Carvalho
Alberto Sardinha
Francisco S. Melo
OffRL
19
2
0
23 Nov 2021
d3rlpy: An Offline Deep Reinforcement Learning Library
d3rlpy: An Offline Deep Reinforcement Learning Library
Takuma Seno
M. Imai
OffRL
GP
65
100
0
06 Nov 2021
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement
  Learning
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Sabela Ramos
Sertan Girgin
Léonard Hussenot
Damien Vincent
Hanna Yakubovich
...
Piotr Stańczyk
Raphaël Marinier
Jeremiah Harmsen
Olivier Pietquin
Nikola Momchev
OffRL
38
24
0
04 Nov 2021
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data
TRAIL: Near-Optimal Imitation Learning with Suboptimal Data
Mengjiao Yang
Sergey Levine
Ofir Nachum
OffRL
41
42
0
27 Oct 2021
Continuous-Time Fitted Value Iteration for Robust Policies
Continuous-Time Fitted Value Iteration for Robust Policies
M. Lutter
Boris Belousov
Shie Mannor
Dieter Fox
Animesh Garg
Jan Peters
10
9
0
05 Oct 2021
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Sergey Levine
Chelsea Finn
OffRL
35
77
0
16 Sep 2021
Implicit Behavioral Cloning
Implicit Behavioral Cloning
Peter R. Florence
Corey Lynch
Andy Zeng
Oscar Ramirez
Ayzaan Wahid
Laura Downs
Adrian S. Wong
Johnny Lee
Igor Mordatch
Jonathan Tompson
OffRL
74
372
0
01 Sep 2021
Offline RL Without Off-Policy Evaluation
Offline RL Without Off-Policy Evaluation
David Brandfonbrener
William F. Whitney
Rajesh Ranganath
Joan Bruna
OffRL
42
162
0
16 Jun 2021
Autoregressive Dynamics Models for Offline Policy Evaluation and
  Optimization
Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Michael Ruogu Zhang
T. Paine
Ofir Nachum
Cosmin Paduraru
George Tucker
Ziyun Wang
Mohammad Norouzi
OffRL
14
45
0
28 Apr 2021
Online and Offline Reinforcement Learning by Planning with a Learned
  Model
Online and Offline Reinforcement Learning by Planning with a Learned Model
Julian Schrittwieser
Thomas Hubert
Amol Mandhane
M. Barekatain
Ioannis Antonoglou
David Silver
OffRL
31
114
0
13 Apr 2021
Benchmarks for Deep Off-Policy Evaluation
Benchmarks for Deep Off-Policy Evaluation
Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
Ziyun Wang
...
Yutian Chen
Aviral Kumar
Cosmin Paduraru
Sergey Levine
T. Paine
ELM
OffRL
35
100
0
30 Mar 2021
Regularized Behavior Value Estimation
Regularized Behavior Value Estimation
Çağlar Gülçehre
Sergio Gomez Colmenarejo
Ziyun Wang
Jakub Sygnowski
T. Paine
Konrad Zolna
Yutian Chen
Matthew W. Hoffman
Razvan Pascanu
Nando de Freitas
OffRL
31
37
0
17 Mar 2021
S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement
  Learning
S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning
Samarth Sinha
Ajay Mandlekar
Animesh Garg
OffRL
26
106
0
10 Mar 2021
Offline Learning from Demonstrations and Unlabeled Experience
Offline Learning from Demonstrations and Unlabeled Experience
Konrad Zolna
Alexander Novikov
Ksenia Konyushkova
Çağlar Gülçehre
Ziyun Wang
Y. Aytar
Misha Denil
Nando de Freitas
Scott E. Reed
SSL
OffRL
32
66
0
27 Nov 2020
REALab: An Embedded Perspective on Tampering
REALab: An Embedded Perspective on Tampering
Ramana Kumar
J. Uesato
Richard Ngo
Tom Everitt
Victoria Krakovna
Shane Legg
30
10
0
17 Nov 2020
DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPs
DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPs
Aayam Shrestha
Stefan Lee
Prasad Tadepalli
Alan Fern
OffRL
55
23
0
18 Oct 2020
Emergence of Locomotion Behaviours in Rich Environments
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
143
928
0
07 Jul 2017
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in
  Natural Images
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images
Andreas Veit
Tomas Matera
Lukás Neumann
Jirí Matas
Serge J. Belongie
191
515
0
26 Jan 2016
1