ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.12016
  4. Cited By
Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels
v1v2 (latest)

Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels

24 September 2022
Sai Rajeswar
Pietro Mazzaglia
Tim Verbelen
Alexandre Piché
Bart Dhoedt
Rameswar Panda
Alexandre Lacoste
    SSL
ArXiv (abs)PDFHTML

Papers citing "Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels"

30 / 30 papers shown
Title
Does Zero-Shot Reinforcement Learning Exist?
Does Zero-Shot Reinforcement Learning Exist?
Ahmed Touati
Jérémy Rapin
Yann Ollivier
OffRL
99
46
0
29 Sep 2022
DayDreamer: World Models for Physical Robot Learning
DayDreamer: World Models for Physical Robot Learning
Philipp Wu
Alejandro Escontrela
Danijar Hafner
Ken Goldberg
Pieter Abbeel
115
301
0
28 Jun 2022
Multi-Source Transfer Learning for Deep Model-Based Reinforcement
  Learning
Multi-Source Transfer Learning for Deep Model-Based Reinforcement Learning
Remo Sasso
M. Sabatelli
M. Wiering
88
9
0
28 May 2022
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Michael Ahn
Anthony Brohan
Noah Brown
Yevgen Chebotar
Omar Cortes
...
Ted Xiao
Peng Xu
Sichun Xu
Mengyuan Yan
Andy Zeng
LM&Ro
195
1,988
0
04 Apr 2022
Temporal Difference Learning for Model Predictive Control
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen
Xiaolong Wang
H. Su
PINNMU
93
254
0
09 Mar 2022
AW-Opt: Learning Robotic Skills with Imitation and Reinforcement at
  Scale
AW-Opt: Learning Robotic Skills with Imitation and Reinforcement at Scale
Yao Lu
Karol Hausman
Yevgen Chebotar
Mengyuan Yan
Eric Jang
...
Ted Xiao
A. Irpan
Mohi Khansari
Dmitry Kalashnikov
Sergey Levine
OffRL
183
60
0
09 Nov 2021
URLB: Unsupervised Reinforcement Learning Benchmark
URLB: Unsupervised Reinforcement Learning Benchmark
Michael Laskin
Denis Yarats
Hao Liu
Kimin Lee
Albert Zhan
Kevin Lu
Catherine Cang
Lerrel Pinto
Pieter Abbeel
SSLOffRL
82
139
0
28 Oct 2021
DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with
  Prototypical Representations
DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations
Fei Deng
Ingook Jang
Sungjin Ahn
VLM
71
62
0
27 Oct 2021
APS: Active Pretraining with Successor Features
APS: Active Pretraining with Successor Features
Hao Liu
Pieter Abbeel
100
122
0
31 Aug 2021
Mastering Visual Continuous Control: Improved Data-Augmented
  Reinforcement Learning
Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning
Denis Yarats
Rob Fergus
A. Lazaric
Lerrel Pinto
OffRL
106
351
0
20 Jul 2021
Accelerating Reinforcement Learning with Learned Skill Priors
Accelerating Reinforcement Learning with Learned Skill Priors
Karl Pertsch
Youngwoon Lee
Joseph J. Lim
OffRLOnRL
105
239
0
22 Oct 2020
Mastering Atari with Discrete World Models
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
125
873
0
05 Oct 2020
Learning Off-Policy with Online Planning
Learning Off-Policy with Online Planning
Harshit S. Sikchi
Wenxuan Zhou
David Held
OffRL
115
50
0
23 Aug 2020
Model-Based Offline Planning
Model-Based Offline Planning
Arthur Argenson
Gabriel Dulac-Arnold
OffRL
70
155
0
12 Aug 2020
Dream to Control: Learning Behaviors by Latent Imagination
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
135
1,374
0
03 Dec 2019
Solving Rubik's Cube with a Robot Hand
Solving Rubik's Cube with a Robot Hand
OpenAI
Ilge Akkaya
Marcin Andrychowicz
Maciek Chociej
Ma-teusz Litwin
...
Peter Welinder
Lilian Weng
Qiming Yuan
Wojciech Zaremba
Lei Zhang
ODL
121
1,232
0
16 Oct 2019
When to Trust Your Model: Model-Based Policy Optimization
When to Trust Your Model: Model-Based Policy Optimization
Michael Janner
Justin Fu
Marvin Zhang
Sergey Levine
OffRL
113
957
0
19 Jun 2019
Self-Supervised Exploration via Disagreement
Self-Supervised Exploration via Disagreement
Deepak Pathak
Dhiraj Gandhi
Abhinav Gupta
SSL
83
384
0
10 Jun 2019
Learning Latent Dynamics for Planning from Pixels
Learning Latent Dynamics for Planning from Pixels
Danijar Hafner
Timothy Lillicrap
Ian S. Fischer
Ruben Villegas
David R Ha
Honglak Lee
James Davidson
BDL
92
1,448
0
12 Nov 2018
Plan Online, Learn Offline: Efficient Learning and Exploration via
  Model-Based Control
Plan Online, Learn Offline: Efficient Learning and Exploration via Model-Based Control
Kendall Lowrey
Aravind Rajeswaran
Sham Kakade
G. Haro
Igor Mordatch
OffRL
73
229
0
05 Nov 2018
Generalization and Regularization in DQN
Generalization and Regularization in DQN
Jesse Farebrother
Marlos C. Machado
Michael Bowling
102
207
0
29 Sep 2018
Learning the Reward Function for a Misspecified Model
Learning the Reward Function for a Misspecified Model
Erik Talvitie
69
10
0
29 Jan 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
317
8,420
0
04 Jan 2018
DeepMind Control Suite
DeepMind Control Suite
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
...
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELMLM&RoBDL
150
1,144
0
02 Jan 2018
Curiosity-driven Exploration by Self-supervised Prediction
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRMSSL
125
2,453
0
15 May 2017
Unifying Count-Based Exploration and Intrinsic Motivation
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
184
1,484
0
06 Jun 2016
Continuous control with deep reinforcement learning
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
330
13,289
0
09 Sep 2015
Model Predictive Path Integral Control using Covariance Variable
  Importance Sampling
Model Predictive Path Integral Control using Covariance Variable Importance Sampling
Grady Williams
Andrew Aldrich
Evangelos Theodorou
88
152
0
03 Sep 2015
High-Dimensional Continuous Control Using Generalized Advantage
  Estimation
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
133
3,439
0
08 Jun 2015
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence
  Modeling
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
Junyoung Chung
Çağlar Gülçehre
Kyunghyun Cho
Yoshua Bengio
607
12,745
0
11 Dec 2014
1