Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.00690
Cited By
DeepMind Control Suite
2 January 2018
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
Diego de Las Casas
David Budden
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELM
LM&Ro
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DeepMind Control Suite"
50 / 791 papers shown
Title
Representation Matters: Improving Perception and Exploration for Robotics
Markus Wulfmeier
Arunkumar Byravan
Tim Hertweck
I. Higgins
Ankush Gupta
...
Malcolm Reynolds
Denis Teplyashin
Roland Hafner
Thomas Lampe
Martin Riedmiller
42
15
0
03 Nov 2020
Sim-to-Real Learning of All Common Bipedal Gaits via Periodic Reward Composition
J. Siekmann
Yesh Godse
Alan Fern
J. Hurst
21
153
0
02 Nov 2020
Observation Space Matters: Benchmark and Optimization Algorithm
J. Kim
Sehoon Ha
OOD
OffRL
24
11
0
02 Nov 2020
Deep Reactive Planning in Dynamic Environments
Keita Ota
Devesh K. Jha
T. Onishi
Asako Kanezaki
Yusuke Yoshiyasu
Y. Sasaki
T. Mariyama
D. Nikovski
33
6
0
31 Oct 2020
Learning to Plan Optimistically: Uncertainty-Guided Deep Exploration via Latent Model Ensembles
Tim Seyde
Wilko Schwarting
S. Karaman
Daniela Rus
31
14
0
27 Oct 2020
Behavior Priors for Efficient Reinforcement Learning
Dhruva Tirumala
Alexandre Galashov
Hyeonwoo Noh
Leonard Hasenclever
Razvan Pascanu
...
Guillaume Desjardins
Wojciech M. Czarnecki
Arun Ahuja
Yee Whye Teh
N. Heess
37
39
0
27 Oct 2020
Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls
Jeongho Kim
Jaeuk Shin
Insoon Yang
15
33
0
27 Oct 2020
How to Make Deep RL Work in Practice
Nirnai Rao
Elie Aljalbout
Axel Sauer
Sami Haddadin
OffRL
29
11
0
25 Oct 2020
CLOUD: Contrastive Learning of Unsupervised Dynamics
Jianren Wang
Yujie Lu
Hang Zhao
28
5
0
23 Oct 2020
Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning
Guangxiang Zhu
Minghao Zhang
Honglak Lee
Chongjie Zhang
OffRL
73
17
0
23 Oct 2020
Reinforcement Learning with Combinatorial Actions: An Application to Vehicle Routing
A. Delarue
Ross Anderson
Christian Tjandraatmadja
35
94
0
22 Oct 2020
Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification
D. Mankowitz
D. A. Calian
Rae Jeong
Cosmin Paduraru
N. Heess
Sumanth Dathathri
Martin Riedmiller
Timothy A. Mann
26
12
0
20 Oct 2020
Measuring Visual Generalization in Continuous Control from Pixels
J. E. Grigsby
Yanjun Qi
14
25
0
13 Oct 2020
Balancing Constraints and Rewards with Meta-Gradient D4PG
D. A. Calian
D. Mankowitz
Tom Zahavy
Zhongwen Xu
Junhyuk Oh
Nir Levine
Timothy A. Mann
39
25
0
13 Oct 2020
Local Search for Policy Iteration in Continuous Control
Jost Tobias Springenberg
N. Heess
D. Mankowitz
J. Merel
Arunkumar Byravan
...
Julian Schrittwieser
Yuval Tassa
J. Buchli
Dan Belov
Martin Riedmiller
OffRL
22
15
0
12 Oct 2020
CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning
Ossama Ahmed
Frederik Trauble
Anirudh Goyal
Alexander Neitz
Yoshua Bengio
Bernhard Schölkopf
M. Wuthrich
Stefan Bauer
CML
39
120
0
08 Oct 2020
Agent Environment Cycle Games
J. K. Terry
Nathaniel Grammel
Benjamin Black
Ananth Hari
Caroline Horsch
L. Santos
29
7
0
28 Sep 2020
Decoupling Representation Learning from Reinforcement Learning
Adam Stooke
Kimin Lee
Pieter Abbeel
Michael Laskin
SSL
DRL
288
341
0
14 Sep 2020
Physically Embedded Planning Problems: New Challenges for Reinforcement Learning
M. Berk Mirza
Andrew Jaegle
Jonathan J. Hunt
A. Guez
S. Tunyasuvunakool
...
Peter Karkus
S. Racanière
Lars Buesing
Timothy Lillicrap
N. Heess
AI4CE
31
12
0
11 Sep 2020
AllenAct: A Framework for Embodied AI Research
Luca Weihs
Jordi Salvador
Klemen Kotar
Unnat Jain
Kuo-Hao Zeng
Roozbeh Mottaghi
Aniruddha Kembhavi
LM&Ro
AI4CE
27
71
0
28 Aug 2020
Meta-Sim2: Unsupervised Learning of Scene Structure for Synthetic Data Generation
Jeevan Devaranjan
Amlan Kar
Sanja Fidler
28
88
0
20 Aug 2020
Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey
Aske Plaat
W. Kosters
Mike Preuss
BDL
OffRL
21
17
0
11 Aug 2020
TriFinger: An Open-Source Robot for Learning Dexterity
Manuel Wüthrich
Felix Widmaier
F. Grimminger
J. Akpo
S. Joshi
...
Julian Viereck
M. Naveau
Ludovic Righetti
Bernhard Schölkopf
Stefan Bauer
29
72
0
08 Aug 2020
Deep Reinforcement Learning for Tactile Robotics: Learning to Type on a Braille Keyboard
Alex Church
John Lloyd
R. Hadsell
Nathan Lepora
26
31
0
06 Aug 2020
Contrastive Variational Reinforcement Learning for Complex Observations
Xiao Ma
Siwei Chen
David Hsu
W. Lee
OffRL
27
23
0
06 Aug 2020
Learning to Drive (L2D) as a Low-Cost Benchmark for Real-World Reinforcement Learning
A. Viitala
Rinu Boney
Yi Zhao
Alexander Ilin
Arno Solin
OffRL
22
7
0
03 Aug 2020
Dreaming: Model-based Reinforcement Learning by Latent Imagination without Reconstruction
Masashi Okada
T. Taniguchi
OffRL
39
84
0
29 Jul 2020
Weak Human Preference Supervision For Deep Reinforcement Learning
Zehong Cao
Kaichiu Wong
Chin-Teng Lin
25
5
0
25 Jul 2020
Maximum Mutation Reinforcement Learning for Scalable Control
Karush Suri
Xiaolong Shi
Konstantinos N. Plataniotis
Y. Lawryshyn
25
4
0
24 Jul 2020
Monte-Carlo Tree Search as Regularized Policy Optimization
Jean-Bastien Grill
Florent Altché
Yunhao Tang
Thomas Hubert
Michal Valko
Ioannis Antonoglou
Rémi Munos
27
73
0
24 Jul 2020
Predictive Information Accelerates Learning in RL
Kuang-Huei Lee
Ian S. Fischer
Anthony Z. Liu
Yijie Guo
Honglak Lee
John F. Canny
S. Guadarrama
23
72
0
24 Jul 2020
Probabilistic Active Meta-Learning
Jean Kaddour
Steindór Sæmundsson
M. Deisenroth
27
34
0
17 Jul 2020
Learning Robust State Abstractions for Hidden-Parameter Block MDPs
Amy Zhang
Shagun Sodhani
Khimya Khetarpal
Joelle Pineau
31
5
0
14 Jul 2020
Data-Efficient Reinforcement Learning with Self-Predictive Representations
Max Schwarzer
Ankesh Anand
Rishab Goel
R. Devon Hjelm
Aaron Courville
Philip Bachman
41
312
0
12 Jul 2020
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Kimin Lee
Michael Laskin
A. Srinivas
Pieter Abbeel
OffRL
25
199
0
09 Jul 2020
Self-Supervised Policy Adaptation during Deployment
Nicklas Hansen
Rishabh Jangir
Yu Sun
Guillem Alenyà
Pieter Abbeel
Alexei A. Efros
Lerrel Pinto
Xiaolong Wang
41
159
0
08 Jul 2020
robo-gym -- An Open Source Toolkit for Distributed Deep Reinforcement Learning on Real and Simulated Robots
M. Lucchi
Friedemann Zindler
Stephan Mühlbacher-Karrer
Horst Pichler
OffRL
30
29
0
06 Jul 2020
Debiased Contrastive Learning
Ching-Yao Chuang
Joshua Robinson
Yen-Chen Lin
Antonio Torralba
Stefanie Jegelka
SSL
19
553
0
01 Jul 2020
Distributed Uplink Beamforming in Cell-Free Networks Using Deep Reinforcement Learning
Firas Fredj
Yasser F. Al-Eryani
S. Maghsudi
Mohamed Akrout
Ekram Hossain
OffRL
11
29
0
26 Jun 2020
Critic Regularized Regression
Ziyun Wang
Alexander Novikov
Konrad Zolna
Jost Tobias Springenberg
Scott E. Reed
...
Noah Y. Siegel
J. Merel
Çağlar Gülçehre
N. Heess
Nando de Freitas
OffRL
36
319
0
26 Jun 2020
RL Unplugged: A Suite of Benchmarks for Offline Reinforcement Learning
Çağlar Gülçehre
Ziyun Wang
Alexander Novikov
T. Paine
Sergio Gomez Colmenarejo
...
Matthew W. Hoffman
Ofir Nachum
George Tucker
N. Heess
Nando de Freitas
OffRL
35
71
0
24 Jun 2020
dm_control: Software and Tasks for Continuous Control
Yuval Tassa
S. Tunyasuvunakool
Alistair Muldal
Yotam Doron
Piotr Trochim
...
Steven Bohez
J. Merel
Tom Erez
Timothy Lillicrap
N. Heess
LM&Ro
42
397
0
22 Jun 2020
Learning Invariant Representations for Reinforcement Learning without Reconstruction
Amy Zhang
R. McAllister
Roberto Calandra
Y. Gal
Sergey Levine
OOD
SSL
60
464
0
18 Jun 2020
Analytic Manifold Learning: Unifying and Evaluating Representations for Continuous Control
Rika Antonova
Maksim Maydanskiy
Danica Kragic
Sam Devlin
Katja Hofmann
24
8
0
15 Jun 2020
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
Sebastian Curi
Felix Berkenkamp
Andreas Krause
33
82
0
15 Jun 2020
Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies
Yunhao Tang
K. Choromanski
OffRL
11
13
0
13 Jun 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
33
24
0
12 Jun 2020
Adaptive Checkpoint Adjoint Method for Gradient Estimation in Neural ODE
Juntang Zhuang
Nicha Dvornek
Xiaoxiao Li
S. Tatikonda
X. Papademetris
James Duncan
BDL
66
110
0
03 Jun 2020
Temporally-Extended ε-Greedy Exploration
Will Dabney
Georg Ostrovski
André Barreto
22
34
0
02 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
65
225
0
01 Jun 2020
Previous
1
2
3
...
12
13
14
15
16
Next