Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.00690
Cited By
DeepMind Control Suite
2 January 2018
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
Diego de Las Casas
David Budden
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELM
LM&Ro
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DeepMind Control Suite"
50 / 791 papers shown
Title
Mutual Information Maximization for Robust Plannable Representations
Yiming Ding
I. Clavera
Pieter Abbeel
19
15
0
16 May 2020
A Distributional View on Multi-Objective Policy Optimization
A. Abdolmaleki
Sandy H. Huang
Leonard Hasenclever
Michael Neunert
H. F. Song
Martina Zambelli
M. Martins
N. Heess
R. Hadsell
Martin Riedmiller
26
74
0
15 May 2020
Planning to Explore via Self-Supervised World Models
Ramanan Sekar
Oleh Rybkin
Kostas Daniilidis
Pieter Abbeel
Danijar Hafner
Deepak Pathak
SSL
33
399
0
12 May 2020
Improving Robustness via Risk Averse Distributional Reinforcement Learning
Rahul Singh
Qinsheng Zhang
Yongxin Chen
OOD
15
43
0
01 May 2020
Reinforcement Learning with Augmented Data
Michael Laskin
Kimin Lee
Adam Stooke
Lerrel Pinto
Pieter Abbeel
A. Srinivas
OffRL
20
648
0
30 Apr 2020
Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Z. Guo
Bernardo Avila-Pires
Bilal Piot
Jean-Bastien Grill
Florent Altché
Rémi Munos
M. G. Azar
BDL
DRL
SSL
43
140
0
30 Apr 2020
Actor-Critic Reinforcement Learning for Control with Stability Guarantee
Minghao Han
Lixian Zhang
Jun Wang
Wei Pan
16
106
0
29 Apr 2020
Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels
Ilya Kostrikov
Denis Yarats
Rob Fergus
OffRL
45
774
0
28 Apr 2020
PBCS : Efficient Exploration and Exploitation Using a Synergy between Reinforcement Learning and Motion Planning
Guillaume Matheron
Nicolas Perrin
Olivier Sigaud
15
18
0
24 Apr 2020
Model-Predictive Control via Cross-Entropy and Gradient-Based Optimization
Homanga Bharadhwaj
Kevin Xie
Florian Shkurti
21
49
0
19 Apr 2020
Thinking While Moving: Deep Reinforcement Learning with Concurrent Control
Ted Xiao
Eric Jang
Dmitry Kalashnikov
Sergey Levine
Julian Ibarz
Karol Hausman
Alexander Herzog
23
37
0
13 Apr 2020
Energy Shaping Control of a CyberOctopus Soft Arm
Heng-Sheng Chang
Udit Halder
Chia-Hsien Shih
Arman Tekinalp
Tejaswin Parthasarathy
Ekaterina D. Gribkova
Girish Chowdhary
R. Gillette
M. Gazzola
P. Mehta
13
28
0
13 Apr 2020
CURL: Contrastive Unsupervised Representations for Reinforcement Learning
A. Srinivas
Michael Laskin
Pieter Abbeel
SSL
DRL
OffRL
49
1,063
0
08 Apr 2020
Model-based actor-critic: GAN (model generator) + DRL (actor-critic) => AGI
Aras R. Dargazany
OffRL
AI4CE
11
1
0
04 Apr 2020
An empirical investigation of the challenges of real-world reinforcement learning
Gabriel Dulac-Arnold
Nir Levine
D. Mankowitz
Jerry Li
Cosmin Paduraru
Sven Gowal
Todd Hester
OffRL
34
121
0
24 Mar 2020
SAPIEN: A SimulAted Part-based Interactive ENvironment
Fanbo Xiang
Yuzhe Qin
Kaichun Mo
Yikuan Xia
Hao Zhu
...
He Wang
Li Yi
Angel X. Chang
Leonidas J. Guibas
Hao Su
223
488
0
19 Mar 2020
Invariant Causal Prediction for Block MDPs
Amy Zhang
Clare Lyle
Shagun Sodhani
Angelos Filos
Marta Z. Kwiatkowska
Joelle Pineau
Y. Gal
Doina Precup
OffRL
AI4CE
OOD
37
139
0
12 Mar 2020
Learning Predictive Representations for Deformable Objects Using Contrastive Estimation
Wilson Yan
Ashwin Vangipuram
Pieter Abbeel
Lerrel Pinto
37
188
0
11 Mar 2020
SQUIRL: Robust and Efficient Learning from Video Demonstration of Long-Horizon Robotic Manipulation Tasks
Bohan Wu
Feng Xu
Zhanpeng He
Abhi Gupta
Peter K. Allen
OffRL
23
13
0
10 Mar 2020
Hierarchically Decoupled Imitation for Morphological Transfer
D. Hejna
Pieter Abbeel
Lerrel Pinto
LM&Ro
25
41
0
03 Mar 2020
Out-of-Distribution Generalization via Risk Extrapolation (REx)
David M. Krueger
Ethan Caballero
J. Jacobsen
Amy Zhang
Jonathan Binas
Dinghuai Zhang
Rémi Le Priol
Aaron Courville
OOD
215
908
0
02 Mar 2020
PlaNet of the Bayesians: Reconsidering and Improving Deep Planning Network by Incorporating Bayesian Inference
Masashi Okada
Norio Kosaka
T. Taniguchi
8
43
0
01 Mar 2020
Human-like Planning for Reaching in Cluttered Environments
Mohamed Hasan
Matthew Warburton
Wisdom C. Agboh
M. Dogar
Matteo Leonetti
He Wang
F. Mushtaq
M. Mon-Williams
Anthony G. Cohn
16
14
0
28 Feb 2020
Acceleration of Actor-Critic Deep Reinforcement Learning for Visual Grasping in Clutter by State Representation Learning Based on Disentanglement of a Raw Input Image
Tae Won Kim
Yeseong Park
Youngbin Park
I. Suh
DRL
OffRL
6
9
0
27 Feb 2020
Using a thousand optimization tasks to learn hyperparameter search strategies
Luke Metz
Niru Maheswaranathan
Ruoxi Sun
C. Freeman
Ben Poole
Jascha Narain Sohl-Dickstein
20
46
0
27 Feb 2020
Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement
Benjamin Eysenbach
Xinyang Geng
Sergey Levine
Ruslan Salakhutdinov
OffRL
18
86
0
25 Feb 2020
Modeling Continuous Stochastic Processes with Dynamic Normalizing Flows
Ruizhi Deng
B. Chang
Marcus A. Brubaker
Greg Mori
Andreas M. Lehrmann
25
50
0
24 Feb 2020
On the Search for Feedback in Reinforcement Learning
Ran A. Wang
Karthikeya S. Parunandi
Aayushman Sharma
R. Goyal
S. Chakravorty
16
9
0
21 Feb 2020
Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning
Noah Y. Siegel
Jost Tobias Springenberg
Felix Berkenkamp
A. Abdolmaleki
Michael Neunert
Thomas Lampe
Roland Hafner
Nicolas Heess
Martin Riedmiller
OffRL
22
282
0
19 Feb 2020
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer
Tianpei Yang
Jianye Hao
Zhaopeng Meng
Zongzhang Zhang
Yujing Hu
...
Changjie Fan
Weixun Wang
Wulong Liu
Zhaodong Wang
J. Peng
OffRL
22
12
0
19 Feb 2020
Learning Functionally Decomposed Hierarchies for Continuous Control Tasks with Path Planning
Sammy Christen
Lukás Jendele
Emre Aksan
Otmar Hilliges
OffRL
30
25
0
14 Feb 2020
Domain-Adversarial and Conditional State Space Model for Imitation Learning
Ryogo Okumura
Masashi Okada
T. Taniguchi
21
11
0
31 Jan 2020
Q-Learning in enormous action spaces via amortized approximate maximization
T. Wiele
David Warde-Farley
A. Mnih
Volodymyr Mnih
29
60
0
22 Jan 2020
Automatic Differentiation and Continuous Sensitivity Analysis of Rigid Body Dynamics
David Millard
Eric Heiden
Shubham Agrawal
Gaurav Sukhatme
AI4CE
27
13
0
22 Jan 2020
Lyceum: An efficient and scalable ecosystem for robot learning
Colin Summers
Kendall Lowrey
Aravind Rajeswaran
S. Srinivasa
E. Todorov
24
18
0
21 Jan 2020
Sample-based Distributional Policy Gradient
Rahul Singh
Keuntaek Lee
Yongxin Chen
23
19
0
08 Jan 2020
Blue River Controls: A toolkit for Reinforcement Learning Control Systems on Hardware
Kirill Polzounov
R. Sundar
L. Redden
6
10
0
07 Jan 2020
MushroomRL: Simplifying Reinforcement Learning Research
Carlo DÉramo
Davide Tateo
Andrea Bonarini
Marcello Restelli
Jan Peters
OffRL
14
84
0
04 Jan 2020
Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics
Michael Neunert
A. Abdolmaleki
Markus Wulfmeier
Thomas Lampe
Jost Tobias Springenberg
Roland Hafner
Francesco Romano
J. Buchli
N. Heess
Martin Riedmiller
21
91
0
02 Jan 2020
Information Theoretic Model Predictive Q-Learning
M. Bhardwaj
Ankur Handa
Dieter Fox
Byron Boots
35
23
0
31 Dec 2019
Learning to grow: control of material self-assembly using evolutionary reinforcement learning
S. Whitelam
Isaac Tamblyn
11
33
0
18 Dec 2019
Parareal with a Learned Coarse Model for Robotic Manipulation
Wisdom C. Agboh
Oliver Grainger
Daniel Ruprecht
M. Dogar
19
11
0
12 Dec 2019
Learning Latent State Spaces for Planning through Reward Prediction
Aaron J. Havens
Ouyang Yi
P. Nagarajan
Yasuhiro Fujita
16
6
0
09 Dec 2019
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
37
1,313
0
03 Dec 2019
IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks
Michael Luo
Jiahao Yao
Richard Liaw
Eric Liang
Ion Stoica
24
15
0
30 Nov 2019
Attention-Privileged Reinforcement Learning
Sasha Salter
Dushyant Rao
Markus Wulfmeier
R. Hadsell
Ingmar Posner
23
8
0
19 Nov 2019
IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks
Youngwoon Lee
E. Hu
Zhengyu Yang
Alexander Yin
Joseph J. Lim
33
122
0
17 Nov 2019
Improved Exploration through Latent Trajectory Optimization in Deep Deterministic Policy Gradient
K. Luck
Mel Vecerík
Simon Stepputtis
H. B. Amor
Jonathan Scholz
14
9
0
15 Nov 2019
Catch & Carry: Reusable Neural Controllers for Vision-Guided Whole-Body Tasks
J. Merel
S. Tunyasuvunakool
Arun Ahuja
Yuval Tassa
Leonard Hasenclever
Vu Pham
Tom Erez
Greg Wayne
N. Heess
31
9
0
15 Nov 2019
Quinoa: a Q-function You Infer Normalized Over Actions
Jonas Degrave
A. Abdolmaleki
Jost Tobias Springenberg
N. Heess
Martin Riedmiller
15
5
0
05 Nov 2019
Previous
1
2
3
...
13
14
15
16
Next