Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.00690
Cited By
DeepMind Control Suite
2 January 2018
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
Diego de Las Casas
David Budden
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELM
LM&Ro
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Github (4082★)
Papers citing
"DeepMind Control Suite"
50 / 821 papers shown
Title
Generalization in Visual Reinforcement Learning with the Reward Sequence Distribution
Jie Wang
Rui Yang
Zijie Geng
Zhihao Shi
Mingxuan Ye
Qi Zhou
Shuiwang Ji
Bin Li
Yongdong Zhang
Feng Wu
92
6
0
19 Feb 2023
Reinforcement Learning in the Wild with Maximum Likelihood-based Model Transfer
Hannes Eriksson
D. Basu
Tommy Tram
Mina Alibeigi
Christos Dimitrakakis
57
1
0
18 Feb 2023
Imitation from Observation With Bootstrapped Contrastive Learning
M. Sonwa
Johanna Hansen
Eugene Belilovsky
OffRL
62
0
0
13 Feb 2023
Policy-Induced Self-Supervision Improves Representation Finetuning in Visual RL
Sébastien M. R. Arnold
Fei Sha
SSL
57
0
0
12 Feb 2023
Robust Representation Learning by Clustering with Bisimulation Metrics for Visual Reinforcement Learning with Distractions
Qiyuan Liu
Qi Zhou
Rui Yang
Jie Wang
OffRL
OOD
525
15
0
12 Feb 2023
Cross-domain Random Pre-training with Prototypes for Reinforcement Learning
Xin Liu
Yaran Chen
Haoran Li
Boyu Li
Dong Zhao
SSL
136
10
0
11 Feb 2023
Combining Reconstruction and Contrastive Methods for Multimodal Representations in RL
P. Becker
Sebastian Mossburger
Fabian Otto
Gerhard Neumann
SSL
94
2
0
10 Feb 2023
Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Zichuan Lin
Xiapeng Wu
Mingfei Sun
Deheng Ye
Qiang Fu
Wei Yang
Wei Liu
113
3
0
05 Feb 2023
Mind the Gap: Offline Policy Optimization for Imperfect Rewards
Jianxiong Li
Xiao Hu
Haoran Xu
Jingjing Liu
Xianyuan Zhan
Qing-Shan Jia
Ya Zhang
OffRL
83
20
0
03 Feb 2023
Visual Imitation Learning with Patch Rewards
Minghuan Liu
Tairan He
Weinan Zhang
Shuicheng Yan
Zhongwen Xu
SSL
106
14
0
02 Feb 2023
Learning PDE Solution Operator for Continuous Modeling of Time-Series
Yesom Park
Jaemoo Choi
Changyeon Yoon
Changhoon Song
Myung-joo Kang
AI4TS
AI4CE
50
3
0
02 Feb 2023
CRC-RL: A Novel Visual Feature Representation Architecture for Unsupervised Reinforcement Learning
Darshita Jain
A. Majumder
S. Dutta
Swagat Kumar
SSL
69
1
0
31 Jan 2023
Few-Shot Image-to-Semantics Translation for Policy Transfer in Reinforcement Learning
Reimi Sato
Kazuto Fukuchi
Jun Sakuma
Youhei Akimoto
OffRL
55
0
0
31 Jan 2023
Sample Efficient Deep Reinforcement Learning via Local Planning
Dong Yin
S. Thiagarajan
N. Lazić
Nived Rajaraman
Botao Hao
Csaba Szepesvári
71
5
0
29 Jan 2023
Reinforcement Learning from Diverse Human Preferences
Wanqi Xue
Bo An
Shuicheng Yan
Zhongwen Xu
81
26
0
27 Jan 2023
Improving Behavioural Cloning with Positive Unlabeled Learning
Qiang-qiang Wang
Robert McCarthy
David Córdova Bulens
Kevin McGuinness
Noel E. O'Connor
Nico Gürtler
Felix Widmaier
Francisco Roldan Sanchez
S. Redmond
OffRL
OnRL
94
8
0
27 Jan 2023
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
95
10
0
26 Jan 2023
SMART: Self-supervised Multi-task pretrAining with contRol Transformers
Yanchao Sun
Shuang Ma
Ratnesh Madaan
Rogerio Bonatti
Furong Huang
Ashish Kapoor
102
42
0
24 Jan 2023
Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction
Shaofei Cai
Zihao Wang
Xiaojian Ma
Hoang Trung-Dung
Yitao Liang
130
42
0
21 Jan 2023
Learnable Path in Neural Controlled Differential Equations
Sheo Yon Jhin
Minju Jo
Seung-Uk Kook
Noseong Park
Sungpil Woo
Sunhwan Lim
67
6
0
11 Jan 2023
Mastering Diverse Domains through World Models
Danijar Hafner
J. Pašukonis
Jimmy Ba
Timothy Lillicrap
121
617
0
10 Jan 2023
Extreme Q-Learning: MaxEnt RL without Entropy
Divyansh Garg
Joey Hejna
Matthieu Geist
Stefano Ermon
OffRL
96
80
0
05 Jan 2023
Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping
Lina Mezghani
Sainbayar Sukhbaatar
Piotr Bojanowski
A. Lazaric
Alahari Karteek
OffRL
139
19
0
05 Jan 2023
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya Zhang
Yixin Chen
Dacheng Tao
OffRL
148
30
0
29 Dec 2022
Learning Generalizable Representations for Reinforcement Learning via Adaptive Meta-learner of Behavioral Similarities
Jianda Chen
Sinno Jialin Pan
SSL
61
6
0
26 Dec 2022
Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning
Zhecheng Yuan
Zhengrong Xue
Bo Yuan
Xueqian Wang
Yi Wu
Yang Gao
Huazhe Xu
SSL
OffRL
120
74
0
17 Dec 2022
Latent Variable Representation for Reinforcement Learning
Zhaolin Ren
Chenjun Xiao
Tianjun Zhang
Na Li
Zhaoran Wang
Sujay Sanghavi
Dale Schuurmans
Bo Dai
OffRL
106
10
0
17 Dec 2022
Bridging the Gap Between Offline and Online Reinforcement Learning Evaluation Methodologies
Shivakanth Sujit
Pedro H. M. Braga
J. Bornschein
Samira Ebrahimi Kahou
OffRL
76
1
0
15 Dec 2022
On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline
Nicklas Hansen
Zhecheng Yuan
Yanjie Ze
Tongzhou Mu
Aravind Rajeswaran
H. Su
Huazhe Xu
Xiaolong Wang
102
66
0
12 Dec 2022
CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation Learning
Zhao Mandi
Homanga Bharadhwaj
Vincent Moens
Shuran Song
Aravind Rajeswaran
Vikash Kumar
LM&Ro
126
77
0
12 Dec 2022
MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations
Nicklas Hansen
Yixin Lin
H. Su
Xiaolong Wang
Vikash Kumar
Aravind Rajeswaran
OffRL
81
51
0
12 Dec 2022
Curiosity creates Diversity in Policy Search
Paul-Antoine Le Tolguenec
Emmanuel Rachelson
Yann Besse
Dennis G. Wilson
80
2
0
07 Dec 2022
Dynamic Decision Frequency with Continuous Options
Amir-Hossein Karimi
Jun Jin
Jun Luo
A. R. Mahmood
Martin Jägersand
Samuele Tosatto
104
10
0
06 Dec 2022
Funnel-based Reward Shaping for Signal Temporal Logic Tasks in Reinforcement Learning
Naman Saxena
Sandeep Gorantla
Pushpak Jagtap
102
4
0
30 Nov 2022
Tackling Visual Control via Multi-View Exploration Maximization
Mingqi Yuan
Xin Jin
Bo Li
Wenjun Zeng
67
1
0
28 Nov 2022
Choreographer: Learning and Adapting Skills in Imagination
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
Alexandre Lacoste
Sai Rajeswar
125
25
0
23 Nov 2022
Masked Autoencoding for Scalable and Generalizable Decision Making
Fangchen Liu
Hao Liu
Aditya Grover
Pieter Abbeel
OffRL
89
49
0
23 Nov 2022
Model Based Residual Policy Learning with Applications to Antenna Control
Viktor Eriksson Mollerstedt
Alessio Russo
Maxime Bouton
OffRL
77
3
0
16 Nov 2022
Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning
Katherine Metcalf
Miguel Sarabia
B. Theobald
OffRL
91
5
0
12 Nov 2022
The Expertise Problem: Learning from Specialized Feedback
Oliver Daniels-Koch
Rachel Freedman
OffRL
70
18
0
12 Nov 2022
Scalable Modular Synthetic Data Generation for Advancing Aerial Autonomy
Mehrnaz Sabet
Praveen Palanisamy
Sakshi Mishra
88
4
0
10 Nov 2022
Pretraining in Deep Reinforcement Learning: A Survey
Zhihui Xie
Zichuan Lin
Junyou Li
Shuai Li
Deheng Ye
OffRL
OnRL
AI4CE
87
23
0
08 Nov 2022
Model-based Reinforcement Learning with a Hamiltonian Canonical ODE Network
Yao Feng
Yuhong Jiang
Hang Su
Dong Yan
Jun Zhu
103
1
0
02 Nov 2022
Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information
Riashat Islam
Manan Tomar
Alex Lamb
Yonathan Efroni
Hongyu Zang
...
Dipendra Kumar Misra
Xin-hui Li
H. V. Seijen
Rémi Tachet des Combes
John Langford
OffRL
67
7
0
31 Oct 2022
Disentangled (Un)Controllable Features
Jacob E. Kooi
Mark Hoogendoorn
Vincent François-Lavet
DRL
60
0
0
31 Oct 2022
Learning Deep Sensorimotor Policies for Vision-based Autonomous Drone Racing
Jiawei Fu
Yunlong Song
Yongpeng Wu
Feng Yu
Davide Scaramuzza
122
21
0
26 Oct 2022
ERL-Re
2
^2
2
: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation
Jianye Hao
Pengyi Li
Hongyao Tang
Yan Zheng
Xian Fu
Zhaopeng Meng
94
26
0
26 Oct 2022
Evaluating Long-Term Memory in 3D Mazes
J. Pašukonis
Timothy Lillicrap
Danijar Hafner
3DV
88
23
0
24 Oct 2022
On Many-Actions Policy Gradient
Michal Nauman
Marek Cygan
81
0
0
24 Oct 2022
Learning Robust Dynamics through Variational Sparse Gating
A. Jain
Shivakanth Sujit
S. Joshi
Vincent Michalski
Danijar Hafner
Samira Ebrahimi Kahou
73
9
0
21 Oct 2022
Previous
1
2
3
...
7
8
9
...
15
16
17
Next