v1v2 (latest)

Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels

24 September 2022

Papers citing "Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels"

30 / 30 papers shown

Title
Does Zero-Shot Reinforcement Learning Exist? Ahmed Touati Jérémy Rapin Yann Ollivier OffRL 99 46 0 29 Sep 2022
DayDreamer: World Models for Physical Robot Learning Philipp Wu Alejandro Escontrela Danijar Hafner Ken Goldberg Pieter Abbeel 115 301 0 28 Jun 2022
Multi-Source Transfer Learning for Deep Model-Based Reinforcement Learning Remo Sasso M. Sabatelli M. Wiering 88 9 0 28 May 2022
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances Michael Ahn Anthony Brohan Noah Brown Yevgen Chebotar Omar Cortes ... Ted Xiao Peng Xu Sichun Xu Mengyuan Yan Andy Zeng LM&Ro 195 1,988 0 04 Apr 2022
Temporal Difference Learning for Model Predictive Control Nicklas Hansen Xiaolong Wang H. Su PINN MU 93 254 0 09 Mar 2022
AW-Opt: Learning Robotic Skills with Imitation and Reinforcement at Scale Yao Lu Karol Hausman Yevgen Chebotar Mengyuan Yan Eric Jang ... Ted Xiao A. Irpan Mohi Khansari Dmitry Kalashnikov Sergey Levine OffRL 183 60 0 09 Nov 2021
URLB: Unsupervised Reinforcement Learning Benchmark Michael Laskin Denis Yarats Hao Liu Kimin Lee Albert Zhan Kevin Lu Catherine Cang Lerrel Pinto Pieter Abbeel SSL OffRL 82 139 0 28 Oct 2021
DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations Fei Deng Ingook Jang Sungjin Ahn VLM 71 62 0 27 Oct 2021
APS: Active Pretraining with Successor Features Hao Liu Pieter Abbeel 100 122 0 31 Aug 2021
Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning Denis Yarats Rob Fergus A. Lazaric Lerrel Pinto OffRL 106 351 0 20 Jul 2021
Accelerating Reinforcement Learning with Learned Skill Priors Karl Pertsch Youngwoon Lee Joseph J. Lim OffRL OnRL 105 239 0 22 Oct 2020
Mastering Atari with Discrete World Models Danijar Hafner Timothy Lillicrap Mohammad Norouzi Jimmy Ba DRL 125 873 0 05 Oct 2020
Learning Off-Policy with Online Planning Harshit S. Sikchi Wenxuan Zhou David Held OffRL 115 50 0 23 Aug 2020
Model-Based Offline Planning Arthur Argenson Gabriel Dulac-Arnold OffRL 70 155 0 12 Aug 2020
Dream to Control: Learning Behaviors by Latent Imagination Danijar Hafner Timothy Lillicrap Jimmy Ba Mohammad Norouzi VLM 135 1,374 0 03 Dec 2019
Solving Rubik's Cube with a Robot Hand OpenAI Ilge Akkaya Marcin Andrychowicz Maciek Chociej Ma-teusz Litwin ... Peter Welinder Lilian Weng Qiming Yuan Wojciech Zaremba Lei Zhang ODL 121 1,232 0 16 Oct 2019
When to Trust Your Model: Model-Based Policy Optimization Michael Janner Justin Fu Marvin Zhang Sergey Levine OffRL 113 957 0 19 Jun 2019
Self-Supervised Exploration via Disagreement Deepak Pathak Dhiraj Gandhi Abhinav Gupta SSL 83 384 0 10 Jun 2019
Learning Latent Dynamics for Planning from Pixels Danijar Hafner Timothy Lillicrap Ian S. Fischer Ruben Villegas David R Ha Honglak Lee James Davidson BDL 92 1,448 0 12 Nov 2018
Plan Online, Learn Offline: Efficient Learning and Exploration via Model-Based Control Kendall Lowrey Aravind Rajeswaran Sham Kakade G. Haro Igor Mordatch OffRL 73 229 0 05 Nov 2018
Generalization and Regularization in DQN Jesse Farebrother Marlos C. Machado Michael Bowling 102 207 0 29 Sep 2018
Learning the Reward Function for a Misspecified Model Erik Talvitie 69 10 0 29 Jan 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor Tuomas Haarnoja Aurick Zhou Pieter Abbeel Sergey Levine 317 8,420 0 04 Jan 2018
DeepMind Control Suite Yuval Tassa Yotam Doron Alistair Muldal Tom Erez Yazhe Li ... A. Abdolmaleki J. Merel Andrew Lefrancq Timothy Lillicrap Martin Riedmiller ELM LM&Ro BDL 150 1,144 0 02 Jan 2018
Curiosity-driven Exploration by Self-supervised Prediction Deepak Pathak Pulkit Agrawal Alexei A. Efros Trevor Darrell LRM SSL 125 2,453 0 15 May 2017
Unifying Count-Based Exploration and Intrinsic Motivation Marc G. Bellemare S. Srinivasan Georg Ostrovski Tom Schaul D. Saxton Rémi Munos 184 1,484 0 06 Jun 2016
Continuous control with deep reinforcement learning Timothy Lillicrap Jonathan J. Hunt Alexander Pritzel N. Heess Tom Erez Yuval Tassa David Silver Daan Wierstra 330 13,289 0 09 Sep 2015
Model Predictive Path Integral Control using Covariance Variable Importance Sampling Grady Williams Andrew Aldrich Evangelos Theodorou 88 152 0 03 Sep 2015
High-Dimensional Continuous Control Using Generalized Advantage Estimation John Schulman Philipp Moritz Sergey Levine Michael I. Jordan Pieter Abbeel OffRL 133 3,439 0 08 Jun 2015
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling Junyoung Chung Çağlar Gülçehre Kyunghyun Cho Yoshua Bengio 607 12,745 0 11 Dec 2014