ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1801.00690
  4. Cited By
DeepMind Control Suite

DeepMind Control Suite

2 January 2018
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
Diego de Las Casas
David Budden
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
    ELM
    LM&Ro
    BDL
ArXivPDFHTML

Papers citing "DeepMind Control Suite"

50 / 791 papers shown
Title
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning
  from Human Feedback
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback
Souradip Chakraborty
Amrit Singh Bedi
Alec Koppel
Dinesh Manocha
Huazheng Wang
Mengdi Wang
Furong Huang
31
26
0
03 Aug 2023
Improving Generalization in Visual Reinforcement Learning via
  Conflict-aware Gradient Agreement Augmentation
Improving Generalization in Visual Reinforcement Learning via Conflict-aware Gradient Agreement Augmentation
Siao Liu
Zhaoyu Chen
Yang Liu
Yuzheng Wang
Dingkang Yang
...
Ziqing Zhou
Xie Yi
Wei Li
Wenqiang Zhang
Zhongxue Gan
41
22
0
02 Aug 2023
Shrink-Perturb Improves Architecture Mixing during Population Based
  Training for Neural Architecture Search
Shrink-Perturb Improves Architecture Mixing during Population Based Training for Neural Architecture Search
A. Chebykin
A. Dushatskiy
Tanja Alderliesten
Peter A. N. Bosman
42
0
0
28 Jul 2023
Worrisome Properties of Neural Network Controllers and Their Symbolic
  Representations
Worrisome Properties of Neural Network Controllers and Their Symbolic Representations
J. Cyranka
Kevin E. M. Church
J. Lessard
42
0
0
28 Jul 2023
Approximate Model-Based Shielding for Safe Reinforcement Learning
Approximate Model-Based Shielding for Safe Reinforcement Learning
Alexander W. Goodall
Francesco Belardinelli
18
0
0
27 Jul 2023
JoinGym: An Efficient Query Optimization Environment for Reinforcement
  Learning
JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning
Kaiwen Wang
Junxiong Wang
Yueying Li
Nathan Kallus
Immanuel Trummer
Wen Sun
GP
52
2
0
21 Jul 2023
STRAPPER: Preference-based Reinforcement Learning via Self-training
  Augmentation and Peer Regularization
STRAPPER: Preference-based Reinforcement Learning via Self-training Augmentation and Peer Regularization
Yachen Kang
Li He
Jinxin Liu
Zifeng Zhuang
Donglin Wang
39
0
0
19 Jul 2023
Can Euclidean Symmetry be Leveraged in Reinforcement Learning and
  Planning?
Can Euclidean Symmetry be Leveraged in Reinforcement Learning and Planning?
Linfeng Zhao
Owen Howell
Jung Yeon Park
Xu Zhu
Robin Walters
Lawson L. S. Wong
38
1
0
17 Jul 2023
Is Imitation All You Need? Generalized Decision-Making with Dual-Phase
  Training
Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training
Yao Wei
Yanchao Sun
Ruijie Zheng
Sai H. Vemprala
Rogerio Bonatti
Shuhang Chen
Ratnesh Madaan
Zhongjie Ba
Ashish Kapoor
Shuang Ma
OffRL
30
15
0
16 Jul 2023
Policy Contrastive Imitation Learning
Policy Contrastive Imitation Learning
Jialei Huang
Zhao-Heng Yin
Yingdong Hu
Yang Gao
32
3
0
06 Jul 2023
MoVie: Visual Model-Based Policy Adaptation for View Generalization
MoVie: Visual Model-Based Policy Adaptation for View Generalization
Sizhe Yang
Yanjie Ze
Huazhe Xu
52
12
0
03 Jul 2023
Identifying Important Sensory Feedback for Learning Locomotion Skills
Identifying Important Sensory Feedback for Learning Locomotion Skills
Wanming Yu
Chuanyu Yang
C. McGreavy
Eleftherios Triantafyllidis
Guillaume Bellegarda
M. Shafiee
A. Ijspeert
Zhibin Li
24
16
0
29 Jun 2023
SARC: Soft Actor Retrospective Critic
SARC: Soft Actor Retrospective Critic
Sukriti Verma
Ayush Chopra
J. Subramanian
Mausoom Sarkar
Nikaash Puri
Piyush B. Gupta
Balaji Krishnamurthy
15
0
0
28 Jun 2023
Curious Replay for Model-based Adaptation
Curious Replay for Model-based Adaptation
Isaac Kauvar
Christopher Doyle
Linqi Zhou
Nick Haber
25
11
0
28 Jun 2023
Learning to Modulate pre-trained Models in RL
Learning to Modulate pre-trained Models in RL
Thomas Schmied
M. Hofmarcher
Fabian Paischer
Razvan Pascanu
Sepp Hochreiter
CLL
OffRL
34
14
0
26 Jun 2023
Correcting discount-factor mismatch in on-policy policy gradient methods
Correcting discount-factor mismatch in on-policy policy gradient methods
Fengdi Che
Gautham Vasan
A. R. Mahmood
OffRL
20
9
0
23 Jun 2023
Optimistic Active Exploration of Dynamical Systems
Optimistic Active Exploration of Dynamical Systems
Bhavya Sukhija
Lenart Treven
Cansu Sancaktar
Sebastian Blaes
Stelian Coros
Andreas Krause
29
17
0
21 Jun 2023
AdCraft: An Advanced Reinforcement Learning Benchmark Environment for
  Search Engine Marketing Optimization
AdCraft: An Advanced Reinforcement Learning Benchmark Environment for Search Engine Marketing Optimization
Maziar Gomrokchi
Owen Levin
Jeffrey Roach
Jonah White
OffRL
32
1
0
21 Jun 2023
Efficient Dynamics Modeling in Interactive Environments with Koopman
  Theory
Efficient Dynamics Modeling in Interactive Environments with Koopman Theory
Arnab Kumar Mondal
Siba Smarak Panigrahi
Sai Rajeswar
K. Siddiqi
Siamak Ravanbakhsh
36
3
0
20 Jun 2023
Informed POMDP: Leveraging Additional Information in Model-Based RL
Informed POMDP: Leveraging Additional Information in Model-Based RL
Gaspard Lambrechts
Adrien Bolland
D. Ernst
31
7
0
20 Jun 2023
PLASTIC: Improving Input and Label Plasticity for Sample Efficient
  Reinforcement Learning
PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning
Hojoon Lee
Hanseul Cho
Hyunseung Kim
Daehoon Gwak
Joonkee Kim
Jaegul Choo
Se-Young Yun
Chulhee Yun
OffRL
82
26
0
19 Jun 2023
SeMAIL: Eliminating Distractors in Visual Imitation via Separated Models
SeMAIL: Eliminating Distractors in Visual Imitation via Separated Models
Shenghua Wan
Yucen Wang
Minghao Shao
Ruying Chen
De-Chuan Zhan
61
7
0
19 Jun 2023
Active Policy Improvement from Multiple Black-box Oracles
Active Policy Improvement from Multiple Black-box Oracles
Xuefeng Liu
Takuma Yoneda
Chaoqi Wang
Matthew R. Walter
Yuxin Chen
39
9
0
17 Jun 2023
Genes in Intelligent Agents
Genes in Intelligent Agents
Fu Feng
Jing Wang
Xu Yang
Xin Geng
AI4CE
32
7
0
17 Jun 2023
Reward-Free Curricula for Training Robust World Models
Reward-Free Curricula for Training Robust World Models
Marc Rigter
Minqi Jiang
Ingmar Posner
VLM
OffRL
39
6
0
15 Jun 2023
Probabilistic Learning of Multivariate Time Series with Temporal Irregularity
Probabilistic Learning of Multivariate Time Series with Temporal Irregularity
Yijun Li
Cheuk Hang Leung
Qi Wu
AI4TS
26
1
0
15 Jun 2023
VIBR: Learning View-Invariant Value Functions for Robust Visual Control
VIBR: Learning View-Invariant Value Functions for Robust Visual Control
Tom Dupuis
Jaonary Rabarisoa
Q. C. Pham
David Filliat
42
0
0
14 Jun 2023
On the Efficacy of 3D Point Cloud Reinforcement Learning
On the Efficacy of 3D Point Cloud Reinforcement Learning
Z. Ling
Yuan Yao
Xuanlin Li
H. Su
3DPC
34
13
0
11 Jun 2023
Learning World Models with Identifiable Factorization
Learning World Models with Identifiable Factorization
Yu-Ren Liu
Erdun Gao
Zhengmao Zhu
Hong Tian
Biwei Huang
Yang Yu
Kun Zhang
CML
OffRL
42
12
0
11 Jun 2023
Approximate information state based convergence analysis of recurrent
  Q-learning
Approximate information state based convergence analysis of recurrent Q-learning
Erfan Seyedsalehi
N. Akbarzadeh
Amit Sinha
Aditya Mahajan
27
6
0
09 Jun 2023
On the Importance of Feature Decorrelation for Unsupervised
  Representation Learning in Reinforcement Learning
On the Importance of Feature Decorrelation for Unsupervised Representation Learning in Reinforcement Learning
Hojoon Lee
Ko-tik Lee
Dongyoon Hwang
Hyunho Lee
ByungKun Lee
Jaegul Choo
SSL
OOD
31
5
0
09 Jun 2023
Generalization Across Observation Shifts in Reinforcement Learning
Generalization Across Observation Shifts in Reinforcement Learning
Anuj Mahajan
Amy Zhang
OOD
OffRL
19
0
0
07 Jun 2023
Model-Based Reinforcement Learning with Multi-Task Offline Pretraining
Model-Based Reinforcement Learning with Multi-Task Offline Pretraining
Minting Pan
Yitao Zheng
Yunbo Wang
Xiaokang Yang
OffRL
32
0
0
06 Jun 2023
Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin
  Representation
Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation
Wanpeng Zhang
Yilin Li
Boyu Yang
Zongqing Lu
CML
26
0
0
05 Jun 2023
Improving and Benchmarking Offline Reinforcement Learning Algorithms
Improving and Benchmarking Offline Reinforcement Learning Algorithms
Bingyi Kang
Xiao Ma
Yi-Ren Wang
Yang Yue
Shuicheng Yan
OffRL
16
9
0
01 Jun 2023
Normalization Enhances Generalization in Visual Reinforcement Learning
Normalization Enhances Generalization in Visual Reinforcement Learning
Lu Li
Jiafei Lyu
Guozheng Ma
Zilin Wang
Zhen Yang
Xiu Li
Zhiheng Li
OOD
27
8
0
01 Jun 2023
NetHack is Hard to Hack
NetHack is Hard to Hack
Ulyana Piterbarg
Lerrel Pinto
Rob Fergus
24
7
0
30 May 2023
GAN-MPC: Training Model Predictive Controllers with Parameterized Cost
  Functions using Demonstrations from Non-identical Experts
GAN-MPC: Training Model Predictive Controllers with Parameterized Cost Functions using Demonstrations from Non-identical Experts
Returaj Burnwal
Anirban Santara
Nirav P. Bhatt
Balaraman Ravindran
Gaurav Aggarwal
19
0
0
30 May 2023
RLAD: Reinforcement Learning from Pixels for Autonomous Driving in Urban
  Environments
RLAD: Reinforcement Learning from Pixels for Autonomous Driving in Urban Environments
Daniel Coelho
Miguel Oliveira
Vítor M. F. Santos
25
3
0
29 May 2023
Pre-training Contextualized World Models with In-the-wild Videos for
  Reinforcement Learning
Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning
Jialong Wu
Haoyu Ma
Chao Deng
Mingsheng Long
OffRL
36
25
0
29 May 2023
Query-Policy Misalignment in Preference-Based Reinforcement Learning
Query-Policy Misalignment in Preference-Based Reinforcement Learning
Xiao Hu
Jianxiong Li
Xianyuan Zhan
Qing-Shan Jia
Ya Zhang
30
8
0
27 May 2023
Reinforcement Learning with Simple Sequence Priors
Reinforcement Learning with Simple Sequence Priors
Tankred Saanum
N. Éltető
Peter Dayan
Marcel Binz
Eric Schulz
OffRL
31
7
0
26 May 2023
Learning Better with Less: Effective Augmentation for Sample-Efficient
  Visual Reinforcement Learning
Learning Better with Less: Effective Augmentation for Sample-Efficient Visual Reinforcement Learning
Guozheng Ma
Linrui Zhang
Haoyu Wang
Lu Li
Zilin Wang
Zhen Wang
Li Shen
Xueqian Wang
Dacheng Tao
47
10
0
25 May 2023
Making Offline RL Online: Collaborative World Models for Offline Visual
  Reinforcement Learning
Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning
Q. Wang
Jun Yang
Yunbo Wang
Xin Jin
Wenjun Zeng
Xiaokang Yang
OffRL
OnRL
39
3
0
24 May 2023
AlpacaFarm: A Simulation Framework for Methods that Learn from Human
  Feedback
AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback
Yann Dubois
Xuechen Li
Rohan Taori
Tianyi Zhang
Ishaan Gulrajani
Jimmy Ba
Carlos Guestrin
Percy Liang
Tatsunori B. Hashimoto
ALM
45
549
0
22 May 2023
FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon
  Complex Manipulation
FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon Complex Manipulation
Minho Heo
Youngwoon Lee
Doohyun Lee
Joseph J. Lim
30
86
0
22 May 2023
Introspective Tips: Large Language Model for In-Context Decision Making
Introspective Tips: Large Language Model for In-Context Decision Making
Liting Chen
Lu Wang
Hang Dong
Yali Du
Jie Yan
...
Pu Zhao
Si Qin
Saravan Rajmohan
Qingwei Lin
Dongmei Zhang
LLMAG
LRM
42
24
0
19 May 2023
A Generalist Dynamics Model for Control
A Generalist Dynamics Model for Control
Ingmar Schubert
Jingwei Zhang
Jake Bruce
Sarah Bechtle
Emilio Parisotto
Martin Riedmiller
Jost Tobias Springenberg
Arunkumar Byravan
Leonard Hasenclever
N. Heess
AI4CE
41
30
0
18 May 2023
MIMEx: Intrinsic Rewards from Masked Input Modeling
MIMEx: Intrinsic Rewards from Masked Input Modeling
Toru Lin
Allan Jabri
OffRL
31
6
0
15 May 2023
Behavior Contrastive Learning for Unsupervised Skill Discovery
Behavior Contrastive Learning for Unsupervised Skill Discovery
Rushuai Yang
Chenjia Bai
Hongyi Guo
Siyuan Li
Bin Zhao
Zhen Wang
Peng Liu
Xuelong Li
SSL
29
16
0
08 May 2023
Previous
123...567...141516
Next