Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.00690
Cited By
DeepMind Control Suite
2 January 2018
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
Diego de Las Casas
David Budden
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELM
LM&Ro
BDL
Re-assign community
ArXiv (abs)
PDF
HTML
Github (4082★)
Papers citing
"DeepMind Control Suite"
50 / 821 papers shown
Title
Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
Guozheng Ma
Lu Li
Zilin Wang
Li Shen
Pierre-Luc Bacon
Dacheng Tao
OffRL
34
0
0
20 Jun 2025
Zero-Shot Reinforcement Learning Under Partial Observability
Scott Jeen
Tom Bewley
Jonathan M. Cullen
OffRL
32
0
0
18 Jun 2025
Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning
Roger Creus Castanyer
J. Obando-Ceron
Lu Li
Pierre-Luc Bacon
Glen Berseth
Aaron Courville
Pablo Samuel Castro
36
0
0
18 Jun 2025
PB
2
^2
2
: Preference Space Exploration via Population-Based Methods in Preference-Based Reinforcement Learning
Brahim Driss
Alex Davey
Riad Akrour
28
0
0
16 Jun 2025
Flow-Based Policy for Online Reinforcement Learning
Lei Lv
Y. Li
Yu-Juan Luo
F. Sun
Tao Kong
Jiafeng Xu
Xiao Ma
28
0
0
15 Jun 2025
MOORL: A Framework for Integrating Offline-Online Reinforcement Learning
Gaurav Chaudhary
Wassim Uddin Mondal
Laxmidhar Behera
OffRL
110
0
0
11 Jun 2025
SkillBlender: Towards Versatile Humanoid Whole-Body Loco-Manipulation via Skill Blending
Yuxuan Kuang
Haoran Geng
Amine Elhafsi
Tan-Dzung Do
Pieter Abbeel
Jitendra Malik
Marco Pavone
Yue Wang
87
1
0
11 Jun 2025
An Open-Source Software Toolkit & Benchmark Suite for the Evaluation and Adaptation of Multimodal Action Models
Pranav Guruprasad
Yangyue Wang
Sudipta Chowdhury
Jaewoo Song
Harshvardhan Sikka
50
0
0
10 Jun 2025
Intention-Conditioned Flow Occupancy Models
Chongyi Zheng
S. Park
Sergey Levine
Benjamin Eysenbach
AI4TS
OffRL
AI4CE
48
0
0
10 Jun 2025
Multi-Task Reward Learning from Human Ratings
Mingkang Wu
Devin White
Evelyn Rose
Vernon J. Lawhern
Nicholas R. Waytowich
Yongcan Cao
35
0
0
10 Jun 2025
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification
Geonwoo Cho
Jaemoon Lee
Jaegyun Im
Subi Lee
Jihwan Lee
Sundong Kim
40
0
0
06 Jun 2025
Dream to Generalize: Zero-Shot Model-Based Reinforcement Learning for Unseen Visual Distractions
Jeongsoo Ha
Kyungsoo Kim
Yusung Kim
OffRL
VLM
68
6
0
05 Jun 2025
Self-Predictive Dynamics for Generalization of Vision-based Reinforcement Learning
Kyungsoo Kim
Jeongsoo Ha
Yusung Kim
BDL
47
7
0
05 Jun 2025
Optimistic critics can empower small actors
Olya Mastikhina
Dhruv Sreenivas
Pablo Samuel Castro
72
0
0
01 Jun 2025
CLARIFY: Contrastive Preference Reinforcement Learning for Untangling Ambiguous Queries
Ni Mu
Hao Hu
Xiao Hu
Yiqin Yang
Bo Xu
Qing-Shan Jia
62
0
0
31 May 2025
Autonomous Behavior and Whole-Brain Dynamics Emerge in Embodied Zebrafish Agents with Model-based Intrinsic Motivation
Reece Keller
Alyn Tornell
Felix Pei
Xaq Pitkow
Leo Kozachkov
Aran Nayebi
26
0
0
30 May 2025
Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer
Yilun Kong
Guozheng Ma
Qi Zhao
Haoyu Wang
Li Shen
Xueqian Wang
Dacheng Tao
MoE
OffRL
38
1
0
30 May 2025
Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning
Jiashun Liu
Zihao Wu
J. Obando-Ceron
Pablo Samuel Castro
Aaron Courville
L. Pan
31
0
0
29 May 2025
Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners
Michal Nauman
Marek Cygan
Carmelo Sferrazza
Aviral Kumar
Pieter Abbeel
OffRL
100
0
0
29 May 2025
MRSD: Multi-Resolution Skill Discovery for HRL Agents
Shashank Sharma
Janina Hoffmann
Vinay P. Namboodiri
28
0
0
27 May 2025
Beyond Domain Randomization: Event-Inspired Perception for Visually Robust Adversarial Imitation from Videos
Andrea Ramazzina
Vittorio Giammarino
Matteo El-Hariry
Mario Bijelic
VGen
AAML
16
0
0
24 May 2025
ProphetDWM: A Driving World Model for Rolling Out Future Actions and Videos
Xiaodong Wang
Peixi Peng
VGen
1.1K
1
0
24 May 2025
Imagine Beyond! Distributionally Robust Auto-Encoding for State Space Coverage in Online Reinforcement Learning
Nicolas Castanet
Olivier Sigaud
Sylvain Lamprier
OffRL
116
0
0
23 May 2025
Maximum Total Correlation Reinforcement Learning
Bang You
Puze Liu
Huaping Liu
Jan Peters
Oleg Arenz
55
0
0
22 May 2025
World Models as Reference Trajectories for Rapid Motor Adaptation
Carlos Stein Brito
Daniel McNamee
50
0
0
21 May 2025
Saliency-Aware Quantized Imitation Learning for Efficient Robotic Control
Seongmin Park
Hyungmin Kim
Sangwoo kim
Wonseok Jeon
Juyoung Yang
Byeongwook Jeon
Yoonseon Oh
Jungwook Choi
197
0
0
21 May 2025
TD-GRPC: Temporal Difference Learning with Group Relative Policy Constraint for Humanoid Locomotion
Khang Nguyen
Khai Nguyen
An T. Le
Jan Peters
Manfred Huber
Ngo Anh Vien
Minh Nhat Vu
68
0
0
19 May 2025
TeleOpBench: A Simulator-Centric Benchmark for Dual-Arm Dexterous Teleoperation
Hangyu Li
Qin Zhao
Haoran Xu
Xinyu Jiang
Qingwei Ben
...
Jia Zeng
Hanqing Wang
Bo Dai
Junting Dong
Jiangmiao Pang
109
1
0
19 May 2025
SAINT: Attention-Based Modeling of Sub-Action Dependencies in Multi-Action Policies
Matthew Landers
Taylor W. Killian
Thomas Hartvigsen
Afsaneh Doryab
66
0
0
17 May 2025
Zero-Shot Visual Generalization in Robot Manipulation
Sumeet Batra
Gaurav Sukhatme
79
0
0
16 May 2025
Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions
Kehan Long
Jorge Cortés
Nikolay Atanasov
115
1
0
16 May 2025
Learning Diverse Natural Behaviors for Enhancing the Agility of Quadrupedal Robots
Huiqiao Fu
Haoyu Dong
Wentao Xu
Zhehao Zhou
Guizhou Deng
Kaiqiang Tang
D. Dong
Chunlin Chen
69
0
0
15 May 2025
ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts
Jing-Cheng Pang
Kaiyuan Li
Yansen Wang
Si-Hang Yang
Shengyi Jiang
Yang Yu
OffRL
LLMAG
LM&Ro
LRM
67
0
0
15 May 2025
Approximated Behavioral Metric-based State Projection for Federated Reinforcement Learning
Zengxia Guo
Bohui An
Zhongqi Lu
FedML
75
0
0
15 May 2025
ADD: Physics-Based Motion Imitation with Adversarial Differential Discriminators
Ziyu Zhang
S. Bashkirov
Dun Yang
Michael Taylor
Xue Bin Peng
88
0
0
08 May 2025
Trajectory Entropy Reinforcement Learning for Predictable and Robust Control
Bang You
Chenxu Wang
Huaping Liu
65
0
0
07 May 2025
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Shangzhe Li
Zhiao Huang
Hao Su
133
0
0
04 May 2025
Wasserstein Policy Optimization
David Pfau
Ian Davies
Diana Borsa
Joao G. M. Araujo
Brendan D. Tracey
H. V. Hasselt
91
1
0
01 May 2025
Q-function Decomposition with Intervention Semantics with Factored Action Spaces
Junkyu Lee
Tian Gao
Elliot Nelson
Miao Liu
D. Bhattacharjya
Songtao Lu
OffRL
93
0
0
30 Apr 2025
Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning
Mingqi Yuan
Qi Wang
Guozheng Ma
Yue Liu
Xin Jin
Yunbo Wang
Xiaokang Yang
Wenjun Zeng
D. Tao
OffRL
AI4CE
109
0
0
24 Apr 2025
Solving New Tasks by Adapting Internet Video Knowledge
Calvin Luo
Zilai Zeng
Yilun Du
Chen Sun
113
6
0
21 Apr 2025
MAPLE: Encoding Dexterous Robotic Manipulation Priors Learned From Egocentric Videos
Alexey Gavryushin
Xi Wang
Robert J. S. Malate
Chenyu Yang
Xiaojun Jia
Shubh Goel
Davide Liconti
René Zurbrugg
Robert K. Katzschmann
Marc Pollefeys
94
2
0
08 Apr 2025
MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement Learning
Shiguang Sun
Hanbo Zhang
Zeyang Liu
Xinrui Yang
Lipeng Wan
Bing Yan
Xingyu Chen
232
0
0
05 Apr 2025
Adapting World Models with Latent-State Dynamics Residuals
JB Lanier
Kyungmin Kim
Armin Karamzade
Yifei Liu
Ankita Sinha
Kat He
Davide Corsi
Roy Fox
83
0
0
03 Apr 2025
Evolutionary Policy Optimization
Jianren Wang
Yifan Su
Abhinav Gupta
Deepak Pathak
86
0
0
24 Mar 2025
Bootstrapped Model Predictive Control
Yuhang Wang
Hanwei Guo
Sizhe Wang
Long Qian
Xuguang Lan
122
1
0
24 Mar 2025
KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies
Shih-Min Yang
Martin Magnusson
J. A. Stork
Todor Stoyanov
82
0
0
23 Mar 2025
LaMOuR: Leveraging Language Models for Out-of-Distribution Recovery in Reinforcement Learning
Chan Kim
Seung-Woo Seo
Seong-Woo Kim
OODD
457
0
0
21 Mar 2025
VARP: Reinforcement Learning from Vision-Language Model Feedback with Agent Regularized Preferences
Anukriti Singh
Amisha Bhaskar
Peihong Yu
Souradip Chakraborty
Ruthwik Dasyam
Amrit Singh Bedi
Pratap Tokekar
118
0
0
18 Mar 2025
Low-cost Real-world Implementation of the Swing-up Pendulum for Deep Reinforcement Learning Experiments
Peter Böhm
Pauline Pounds
Archie C. Chapman
70
0
0
14 Mar 2025
1
2
3
4
...
15
16
17
Next