ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.00690
  4. Cited By
DeepMind Control Suite

DeepMind Control Suite

2 January 2018
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
Diego de Las Casas
David Budden
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
    ELMLM&RoBDL
ArXiv (abs)PDFHTMLGithub (4082★)

Papers citing "DeepMind Control Suite"

50 / 821 papers shown
Title
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
89
45
0
13 Oct 2023
PolyTask: Learning Unified Policies through Behavior Distillation
PolyTask: Learning Unified Policies through Behavior Distillation
Siddhant Haldar
Lerrel Pinto
75
9
0
12 Oct 2023
Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules
  and Training Stages
Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages
Guozheng Ma
Lu Li
Sen Zhang
Zixuan Liu
Zhen Wang
Yixin Chen
Li Shen
Xueqian Wang
Dacheng Tao
OffRL
97
21
0
11 Oct 2023
RoboHive: A Unified Framework for Robot Learning
RoboHive: A Unified Framework for Robot Learning
Vikash Kumar
Rutav Shah
Gaoyue Zhou
Vincent Moens
Vittorio Caggiano
Jay Vakil
Abhishek Gupta
Aravind Rajeswaran
69
25
0
10 Oct 2023
A Unified View on Solving Objective Mismatch in Model-Based
  Reinforcement Learning
A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning
Ran Wei
Nathan Lambert
Anthony D. McDonald
Alfredo Garcia
Roberto Calandra
109
7
0
10 Oct 2023
Learning Interactive Real-World Simulators
Learning Interactive Real-World Simulators
Mengjiao Yang
Yilun Du
Kamyar Ghasemipour
Jonathan Tompson
Leslie Kaelbling
Dale Schuurmans
Pieter Abbeel
LM&RoPINN
90
215
0
09 Oct 2023
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement
  Learning
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
Trevor A. McInroe
Adam Jelley
Stefano V. Albrecht
Amos Storkey
OffRLOnRL
78
6
0
09 Oct 2023
Multi-timestep models for Model-based Reinforcement Learning
Multi-timestep models for Model-based Reinforcement Learning
Abdelhakim Benechehab
Giuseppe Paolo
Albert Thomas
Maurizio Filippone
Balázs Kégl
OffRL
78
0
0
09 Oct 2023
Hieros: Hierarchical Imagination on Structured State Space Sequence
  World Models
Hieros: Hierarchical Imagination on Structured State Space Sequence World Models
Paul Mattes
Rainer Schlosser
R. Herbrich
73
5
0
08 Oct 2023
Learning Generalizable Agents via Saliency-Guided Features Decorrelation
Learning Generalizable Agents via Saliency-Guided Features Decorrelation
Sili Huang
Yanchao Sun
Jifeng Hu
Siyuan Guo
Hechang Chen
Yi-Ju Chang
Lichao Sun
Bo Yang
80
6
0
08 Oct 2023
Offline Imitation Learning with Variational Counterfactual Reasoning
Offline Imitation Learning with Variational Counterfactual Reasoning
Bowei He
Zexu Sun
Jinxin Liu
Shuai Zhang
Xu Chen
Chen Ma
OffRL
81
9
0
07 Oct 2023
A Kernel Perspective on Behavioural Metrics for Markov Decision
  Processes
A Kernel Perspective on Behavioural Metrics for Markov Decision Processes
Pablo Samuel Castro
Tyler Kastner
Prakash Panangaden
Mark Rowland
85
5
0
05 Oct 2023
Differentially Encoded Observation Spaces for Perceptive Reinforcement
  Learning
Differentially Encoded Observation Spaces for Perceptive Reinforcement Learning
Lev Grossman
Brian Plancher
OffRL
49
0
0
03 Oct 2023
Blending Imitation and Reinforcement Learning for Robust Policy
  Improvement
Blending Imitation and Reinforcement Learning for Robust Policy Improvement
Xuefeng Liu
Takuma Yoneda
Rick L. Stevens
Matthew R. Walter
Yuxin Chen
98
11
0
03 Oct 2023
Controlling Neural Style Transfer with Deep Reinforcement Learning
Controlling Neural Style Transfer with Deep Reinforcement Learning
Chengming Feng
Jing Hu
Xin Wang
Shu Hu
Bin Zhu
Xi Wu
Hongtu Zhu
Siwei Lyu
53
3
0
30 Sep 2023
HarmonyDream: Task Harmonization Inside World Models
HarmonyDream: Task Harmonization Inside World Models
Haoyu Ma
Jialong Wu
Ningya Feng
Chenjun Xiao
Dong Li
Jianye Hao
Jianmin Wang
Mingsheng Long
80
8
0
30 Sep 2023
Adversarial Imitation Learning from Visual Observations using Latent
  Information
Adversarial Imitation Learning from Visual Observations using Latent Information
Vittorio Giammarino
Tomas Landelius
I. Paschalidis
97
7
0
29 Sep 2023
ComSD: Balancing Behavioral Quality and Diversity in Unsupervised Skill Discovery
ComSD: Balancing Behavioral Quality and Diversity in Unsupervised Skill Discovery
Xin Liu
Yaran Chen
Dong Zhao
70
0
0
29 Sep 2023
RLLTE: Long-Term Evolution Project of Reinforcement Learning
RLLTE: Long-Term Evolution Project of Reinforcement Learning
Tao Lv
Zequn Zhang
Yang Xu
Shihao Luo
Bo Li
Xin Jin
Wenjun Zeng
OffRL
77
1
0
28 Sep 2023
Task-Oriented Koopman-Based Control with Contrastive Encoder
Task-Oriented Koopman-Based Control with Contrastive Encoder
Xubo Lyu
Hanyang Hu
Seth Siriya
Ye Pu
Mo Chen
88
8
0
28 Sep 2023
Zero-Shot Reinforcement Learning from Low Quality Data
Zero-Shot Reinforcement Learning from Low Quality Data
Scott Jeen
Tom Bewley
Jonathan M. Cullen
OffRLOnRL
92
4
0
26 Sep 2023
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in
  Continuous Control
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Nate Rahn
P. DÓro
Harley Wiltzer
Pierre-Luc Bacon
Marc G. Bellemare
98
3
0
26 Sep 2023
Diagnosing and exploiting the computational demands of videos games for
  deep reinforcement learning
Diagnosing and exploiting the computational demands of videos games for deep reinforcement learning
L. Govindarajan
Rex G Liu
Drew Linsley
A. Ashok
Max Reuter
M. Frank
Thomas Serre
OffRL
63
0
0
22 Sep 2023
Sequential Action-Induced Invariant Representation for Reinforcement
  Learning
Sequential Action-Induced Invariant Representation for Reinforcement Learning
Dayang Liang
Qihang Chen
Yunlong Liu
90
4
0
22 Sep 2023
OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control
OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control
Botian Xu
Feng Gao
Chao Yu
Chao Yu
Yi Wu
Yu Wang
112
30
0
22 Sep 2023
Efficient RL via Disentangled Environment and Agent Representations
Efficient RL via Disentangled Environment and Agent Representations
Kevin Gmelin
Shikhar Bahl
Russell Mendonca
Deepak Pathak
DRL
73
9
0
05 Sep 2023
A Survey on Physics Informed Reinforcement Learning: Review and Open
  Problems
A Survey on Physics Informed Reinforcement Learning: Review and Open Problems
C. Banerjee
Kien Nguyen
Clinton Fookes
M. Raissi
PINNAI4CE
111
10
0
05 Sep 2023
RePo: Resilient Model-Based Reinforcement Learning by Regularizing
  Posterior Predictability
RePo: Resilient Model-Based Reinforcement Learning by Regularizing Posterior Predictability
Chuning Zhu
Max Simchowitz
Siri Gadipudi
Abhishek Gupta
107
14
0
31 Aug 2023
Policy composition in reinforcement learning via multi-objective policy
  optimization
Policy composition in reinforcement learning via multi-objective policy optimization
Shruti Mishra
Ankit Anand
Jordan Hoffmann
N. Heess
Martin Riedmiller
A. Abdolmaleki
Doina Precup
106
0
0
29 Aug 2023
${\rm E}(3)$-Equivariant Actor-Critic Methods for Cooperative
  Multi-Agent Reinforcement Learning
E(3){\rm E}(3)E(3)-Equivariant Actor-Critic Methods for Cooperative Multi-Agent Reinforcement Learning
Dingyang Chen
Qi Zhang
130
4
0
23 Aug 2023
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning
  from Human Feedback
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback
Souradip Chakraborty
Amrit Singh Bedi
Alec Koppel
Dinesh Manocha
Huazheng Wang
Mengdi Wang
Furong Huang
112
27
0
03 Aug 2023
Improving Generalization in Visual Reinforcement Learning via
  Conflict-aware Gradient Agreement Augmentation
Improving Generalization in Visual Reinforcement Learning via Conflict-aware Gradient Agreement Augmentation
Siao Liu
Zhaoyu Chen
Yang Liu
Yuzheng Wang
Dingkang Yang
...
Ziqing Zhou
Xie Yi
Wei Li
Wenqiang Zhang
Zhongxue Gan
118
24
0
02 Aug 2023
Shrink-Perturb Improves Architecture Mixing during Population Based
  Training for Neural Architecture Search
Shrink-Perturb Improves Architecture Mixing during Population Based Training for Neural Architecture Search
A. Chebykin
A. Dushatskiy
Tanja Alderliesten
Peter A. N. Bosman
91
1
0
28 Jul 2023
Worrisome Properties of Neural Network Controllers and Their Symbolic
  Representations
Worrisome Properties of Neural Network Controllers and Their Symbolic Representations
J. Cyranka
Kevin E. M. Church
J. Lessard
73
0
0
28 Jul 2023
Approximate Model-Based Shielding for Safe Reinforcement Learning
Approximate Model-Based Shielding for Safe Reinforcement Learning
Alexander W. Goodall
Francesco Belardinelli
56
0
0
27 Jul 2023
JoinGym: An Efficient Query Optimization Environment for Reinforcement
  Learning
JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning
Kaiwen Wang
Junxiong Wang
Yueying Li
Nathan Kallus
Immanuel Trummer
Wen Sun
GP
122
2
0
21 Jul 2023
STRAPPER: Preference-based Reinforcement Learning via Self-training
  Augmentation and Peer Regularization
STRAPPER: Preference-based Reinforcement Learning via Self-training Augmentation and Peer Regularization
Yachen Kang
Li He
Jinxin Liu
Zifeng Zhuang
Donglin Wang
86
1
0
19 Jul 2023
Can Euclidean Symmetry be Leveraged in Reinforcement Learning and
  Planning?
Can Euclidean Symmetry be Leveraged in Reinforcement Learning and Planning?
Linfeng Zhao
Owen Howell
Jung Yeon Park
Xu Zhu
Robin Walters
Lawson L. S. Wong
85
1
0
17 Jul 2023
Is Imitation All You Need? Generalized Decision-Making with Dual-Phase
  Training
Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training
Yao Wei
Yanchao Sun
Ruijie Zheng
Sai H. Vemprala
Rogerio Bonatti
Shuhang Chen
Ratnesh Madaan
Zhongjie Ba
Ashish Kapoor
Shuang Ma
OffRL
73
17
0
16 Jul 2023
Policy Contrastive Imitation Learning
Policy Contrastive Imitation Learning
Jialei Huang
Zhao-Heng Yin
Yingdong Hu
Yang Gao
90
3
0
06 Jul 2023
MoVie: Visual Model-Based Policy Adaptation for View Generalization
MoVie: Visual Model-Based Policy Adaptation for View Generalization
Sizhe Yang
Yanjie Ze
Huazhe Xu
173
13
0
03 Jul 2023
Identifying Important Sensory Feedback for Learning Locomotion Skills
Identifying Important Sensory Feedback for Learning Locomotion Skills
Wanming Yu
Chuanyu Yang
C. McGreavy
Eleftherios Triantafyllidis
Guillaume Bellegarda
M. Shafiee
A. Ijspeert
Zhibin Li
85
16
0
29 Jun 2023
SARC: Soft Actor Retrospective Critic
SARC: Soft Actor Retrospective Critic
Sukriti Verma
Ayush Chopra
J. Subramanian
Mausoom Sarkar
Nikaash Puri
Piyush B. Gupta
Balaji Krishnamurthy
48
0
0
28 Jun 2023
Curious Replay for Model-based Adaptation
Curious Replay for Model-based Adaptation
Isaac Kauvar
Christopher Doyle
Linqi Zhou
Nick Haber
68
12
0
28 Jun 2023
Learning to Modulate pre-trained Models in RL
Learning to Modulate pre-trained Models in RL
Thomas Schmied
M. Hofmarcher
Fabian Paischer
Razvan Pascanu
Sepp Hochreiter
CLLOffRL
109
18
0
26 Jun 2023
Correcting discount-factor mismatch in on-policy policy gradient methods
Correcting discount-factor mismatch in on-policy policy gradient methods
Fengdi Che
Gautham Vasan
A. R. Mahmood
OffRL
62
9
0
23 Jun 2023
Optimistic Active Exploration of Dynamical Systems
Optimistic Active Exploration of Dynamical Systems
Bhavya Sukhija
Lenart Treven
Cansu Sancaktar
Sebastian Blaes
Stelian Coros
Andreas Krause
124
18
0
21 Jun 2023
AdCraft: An Advanced Reinforcement Learning Benchmark Environment for
  Search Engine Marketing Optimization
AdCraft: An Advanced Reinforcement Learning Benchmark Environment for Search Engine Marketing Optimization
Maziar Gomrokchi
Owen Levin
Jeffrey Roach
Jonah White
OffRL
90
1
0
21 Jun 2023
Efficient Dynamics Modeling in Interactive Environments with Koopman
  Theory
Efficient Dynamics Modeling in Interactive Environments with Koopman Theory
Arnab Kumar Mondal
Siba Smarak Panigrahi
Sai Rajeswar
K. Siddiqi
Siamak Ravanbakhsh
99
3
0
20 Jun 2023
Informed POMDP: Leveraging Additional Information in Model-Based RL
Informed POMDP: Leveraging Additional Information in Model-Based RL
Gaspard Lambrechts
Adrien Bolland
D. Ernst
72
8
0
20 Jun 2023
Previous
123...567...151617
Next