ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.00690
  4. Cited By
DeepMind Control Suite

DeepMind Control Suite

2 January 2018
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
Diego de Las Casas
David Budden
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
    ELM
    LM&Ro
    BDL
ArXivPDFHTML

Papers citing "DeepMind Control Suite"

41 / 791 papers shown
Title
Model Primitive Hierarchical Lifelong Reinforcement Learning
Model Primitive Hierarchical Lifelong Reinforcement Learning
Bohan Wu
Jayesh K. Gupta
Mykel J. Kochenderfer
OffRL
14
10
0
04 Mar 2019
Verification of Non-Linear Specifications for Neural Networks
Verification of Non-Linear Specifications for Neural Networks
Chongli Qin
Krishnamurthy Dvijotham
Dvijotham
Brendan O'Donoghue
Rudy Bunel
Robert Stanforth
Sven Gowal
J. Uesato
G. Swirszcz
Pushmeet Kohli
AAML
16
43
0
25 Feb 2019
Emergent Coordination Through Competition
Emergent Coordination Through Competition
Siqi Liu
Guy Lever
J. Merel
S. Tunyasuvunakool
N. Heess
T. Graepel
47
149
0
19 Feb 2019
Investigating Generalisation in Continuous Deep Reinforcement Learning
Investigating Generalisation in Continuous Deep Reinforcement Learning
Chenyang Zhao
Olivier Sigaud
F. Stulp
Timothy M. Hospedales
OffRL
22
48
0
19 Feb 2019
Sufficiently Accurate Model Learning
Sufficiently Accurate Model Learning
Clark Zhang
Arbaaz Khan
Santiago Paternain
Alejandro Ribeiro
31
3
0
19 Feb 2019
CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater
  Sample Efficiency and Simplicity
CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity
Aditya Bhatt
Daniel Palenicek
Boris Belousov
Max Argus
Artemij Amiranashvili
Thomas Brox
Jan Peters
35
45
0
14 Feb 2019
Value constrained model-free continuous control
Value constrained model-free continuous control
Steven Bohez
A. Abdolmaleki
Michael Neunert
J. Buchli
N. Heess
R. Hadsell
24
62
0
12 Feb 2019
TF-Replicator: Distributed Machine Learning for Researchers
TF-Replicator: Distributed Machine Learning for Researchers
P. Buchlovsky
David Budden
Dominik Grewe
Chris Jones
John Aslanides
...
Aidan Clark
Sergio Gomez Colmenarejo
Aedan Pope
Fabio Viola
Dan Belov
GNN
OffRL
AI4CE
37
20
0
01 Feb 2019
Motion Perception in Reinforcement Learning with Dynamic Objects
Motion Perception in Reinforcement Learning with Dynamic Objects
Artemij Amiranashvili
Alexey Dosovitskiy
V. Koltun
Thomas Brox
14
35
0
10 Jan 2019
Dopamine: A Research Framework for Deep Reinforcement Learning
Dopamine: A Research Framework for Deep Reinforcement Learning
Pablo Samuel Castro
Subhodeep Moitra
Carles Gelada
Saurabh Kumar
Marc G. Bellemare
OffRL
28
276
0
14 Dec 2018
Relative Entropy Regularized Policy Iteration
Relative Entropy Regularized Policy Iteration
A. Abdolmaleki
Jost Tobias Springenberg
Jonas Degrave
Steven Bohez
Yuval Tassa
Dan Belov
N. Heess
Martin Riedmiller
27
72
0
05 Dec 2018
Composing Entropic Policies using Divergence Correction
Composing Entropic Policies using Divergence Correction
Jonathan J. Hunt
André Barreto
Timothy Lillicrap
N. Heess
24
2
0
05 Dec 2018
Rigorous Agent Evaluation: An Adversarial Approach to Uncover
  Catastrophic Failures
Rigorous Agent Evaluation: An Adversarial Approach to Uncover Catastrophic Failures
Junhui Yin
Jiayan Qiu
Csaba Szepesvári
Siqing Zhang
Avraham Ruderman
Jiyang Xie
Krishnamurthy Dvijotham
Zhanyu Ma
N. Heess
Pushmeet Kohli
AAML
15
80
0
04 Dec 2018
CompILE: Compositional Imitation Learning and Execution
CompILE: Compositional Imitation Learning and Execution
Thomas Kipf
Yujia Li
H. Dai
V. Zambaldi
Alvaro Sanchez-Gonzalez
Edward Grefenstette
Pushmeet Kohli
Peter W. Battaglia
VLM
30
13
0
04 Dec 2018
Adversarial Domain Randomization
Adversarial Domain Randomization
Rawal Khirodkar
Kris Kitani
21
5
0
03 Dec 2018
Unsupervised Control Through Non-Parametric Discriminative Rewards
Unsupervised Control Through Non-Parametric Discriminative Rewards
David Warde-Farley
T. Wiele
Tejas D. Kulkarni
Catalin Ionescu
Steven Hansen
Volodymyr Mnih
DRL
OffRL
SSL
41
173
0
28 Nov 2018
Hierarchical visuomotor control of humanoids
Hierarchical visuomotor control of humanoids
J. Merel
Arun Ahuja
Vu Pham
S. Tunyasuvunakool
Siqi Liu
Dhruva Tirumala
N. Heess
Greg Wayne
42
97
0
23 Nov 2018
Learning Latent Dynamics for Planning from Pixels
Learning Latent Dynamics for Planning from Pixels
Danijar Hafner
Timothy Lillicrap
Ian S. Fischer
Ruben Villegas
David R Ha
Honglak Lee
James Davidson
BDL
42
1,407
0
12 Nov 2018
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search
Gary Cheng
Kannan Ramchandran
L. Ghaoui
14
23
0
06 Nov 2018
Deep Reinforcement Learning
Deep Reinforcement Learning
Yuxi Li
VLM
OffRL
28
144
0
15 Oct 2018
GPU-Accelerated Robotic Simulation for Distributed Reinforcement
  Learning
GPU-Accelerated Robotic Simulation for Distributed Reinforcement Learning
Jacky Liang
Viktor Makoviychuk
Ankur Handa
N. Chentanez
Miles Macklin
Dieter Fox
AI4CE
27
182
0
12 Oct 2018
Benchmarking Reinforcement Learning Algorithms on Real-World Robots
Benchmarking Reinforcement Learning Algorithms on Real-World Robots
A. R. Mahmood
D. Korenkevych
Gautham Vasan
W. Ma
James Bergstra
OffRL
14
155
0
20 Sep 2018
Deterministic Implementations for Reproducibility in Deep Reinforcement
  Learning
Deterministic Implementations for Reproducibility in Deep Reinforcement Learning
P. Nagarajan
Garrett A. Warnell
Peter Stone
22
51
0
15 Sep 2018
Unity: A General Platform for Intelligent Agents
Unity: A General Platform for Intelligent Agents
Arthur Juliani
Vincent-Pierre Berges
Esh Vckay
Andrew Cohen
Jonathan Harper
...
Chris Goy
Yuan Gao
Hunter Henry
Marwan Mattar
Danny Lange
39
808
0
07 Sep 2018
ARCHER: Aggressive Rewards to Counter bias in Hindsight Experience
  Replay
ARCHER: Aggressive Rewards to Counter bias in Hindsight Experience Replay
Sameera Lanka
Tianfu Wu
28
30
0
06 Sep 2018
Texar: A Modularized, Versatile, and Extensible Toolkit for Text
  Generation
Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation
Zhiting Hu
Haoran Shi
Bowen Tan
Wentao Wang
Zichao Yang
...
Zhengzhong Liu
Xiaodan Liang
Wangrong Zhu
Devendra Singh Sachan
Eric Xing
VLM
22
56
0
04 Sep 2018
Learning Actionable Representations from Visual Observations
Learning Actionable Representations from Visual Observations
Debidatta Dwibedi
Jonathan Tompson
Corey Lynch
P. Sermanet
SSL
22
80
0
02 Aug 2018
Remember and Forget for Experience Replay
Remember and Forget for Experience Replay
G. Novati
Petros Koumoutsakos
OffRL
35
90
0
16 Jul 2018
Adaptive Path-Integral Autoencoder: Representation Learning and Planning
  for Dynamical Systems
Adaptive Path-Integral Autoencoder: Representation Learning and Planning for Dynamical Systems
Jung-Su Ha
Young-Jin Park
Hyeok-Joo Chae
Soon-Seo Park
Han-Lim Choi
BDL
17
26
0
05 Jul 2018
Maximum a Posteriori Policy Optimisation
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
48
471
0
14 Jun 2018
Randomized Prior Functions for Deep Reinforcement Learning
Randomized Prior Functions for Deep Reinforcement Learning
Ian Osband
John Aslanides
Albin Cassirer
UQCV
BDL
21
372
0
08 Jun 2018
Continuous-time Value Function Approximation in Reproducing Kernel
  Hilbert Spaces
Continuous-time Value Function Approximation in Reproducing Kernel Hilbert Spaces
Motoya Ohnishi
M. Yukawa
M. Johansson
Masashi Sugiyama
8
3
0
08 Jun 2018
Graph networks as learnable physics engines for inference and control
Graph networks as learnable physics engines for inference and control
Alvaro Sanchez-Gonzalez
N. Heess
Jost Tobias Springenberg
J. Merel
Martin Riedmiller
R. Hadsell
Peter W. Battaglia
GNN
AI4CE
PINN
OCL
42
595
0
04 Jun 2018
Scalable Coordinated Exploration in Concurrent Reinforcement Learning
Scalable Coordinated Exploration in Concurrent Reinforcement Learning
Maria Dimakopoulou
Ian Osband
Benjamin Van Roy
OffRL
16
23
0
23 May 2018
Sim-to-Real: Learning Agile Locomotion For Quadruped Robots
Sim-to-Real: Learning Agile Locomotion For Quadruped Robots
Jie Tan
Tingnan Zhang
Erwin Coumans
Atil Iscen
Yunfei Bai
Danijar Hafner
Steven Bohez
Vincent Vanhoucke
25
790
0
27 Apr 2018
Distributed Distributional Deterministic Policy Gradients
Distributed Distributional Deterministic Policy Gradients
Gabriel Barth-Maron
Matthew W. Hoffman
David Budden
Will Dabney
Dan Horgan
TB Dhruva
Alistair Muldal
N. Heess
Timothy Lillicrap
OffRL
52
477
0
23 Apr 2018
Terrain RL Simulator
Terrain RL Simulator
Glen Berseth
Xue Bin Peng
M. van de Panne
AI4CE
27
5
0
17 Apr 2018
Distributed Prioritized Experience Replay
Distributed Prioritized Experience Replay
Dan Horgan
John Quan
David Budden
Gabriel Barth-Maron
Matteo Hessel
H. V. Hasselt
David Silver
113
731
0
02 Mar 2018
Clipped Action Policy Gradient
Clipped Action Policy Gradient
Yasuhiro Fujita
S. Maeda
OffRL
34
37
0
21 Feb 2018
State Representation Learning for Control: An Overview
State Representation Learning for Control: An Overview
Timothée Lesort
Natalia Díaz Rodríguez
Jean-François Goudou
David Filliat
OffRL
30
319
0
12 Feb 2018
A Deeper Look at Experience Replay
A Deeper Look at Experience Replay
Shangtong Zhang
R. Sutton
OffRL
VLM
41
269
0
04 Dec 2017
Previous
123...141516