Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.00690
Cited By
DeepMind Control Suite
2 January 2018
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
Diego de Las Casas
David Budden
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELM
LM&Ro
BDL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"DeepMind Control Suite"
50 / 312 papers shown
Title
SAINT: Attention-Based Modeling of Sub-Action Dependencies in Multi-Action Policies
Matthew Landers
Taylor W. Killian
Thomas Hartvigsen
Afsaneh Doryab
7
0
0
17 May 2025
Approximated Behavioral Metric-based State Projection for Federated Reinforcement Learning
Zengxia Guo
Bohui An
Zhongqi Lu
FedML
22
0
0
15 May 2025
ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts
Jing-Cheng Pang
Kaiyuan Li
Yansen Wang
Si-Hang Yang
Shengyi Jiang
Yang Yu
OffRL
LLMAG
LM&Ro
LRM
19
0
0
15 May 2025
Learning Diverse Natural Behaviors for Enhancing the Agility of Quadrupedal Robots
Huiqiao Fu
Haoyu Dong
Wentao Xu
Zhehao Zhou
Guizhou Deng
Kaiqiang Tang
D. Dong
Chunlin Chen
24
0
0
15 May 2025
ADD: Physics-Based Motion Imitation with Adversarial Differential Discriminators
Ziyu Zhang
S. Bashkirov
Dun Yang
Michael Taylor
Xue Bin Peng
38
0
0
08 May 2025
Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning
Shangzhe Li
Zhiao Huang
Hao Su
71
0
0
04 May 2025
Q-function Decomposition with Intervention Semantics with Factored Action Spaces
Junkyu Lee
Tian Gao
Elliot Nelson
Miao Liu
D. Bhattacharjya
Songtao Lu
OffRL
45
0
0
30 Apr 2025
Solving New Tasks by Adapting Internet Video Knowledge
Calvin Luo
Zilai Zeng
Yilun Du
Chen Sun
33
1
0
21 Apr 2025
MInCo: Mitigating Information Conflicts in Distracted Visual Model-based Reinforcement Learning
Shiguang Sun
Hanbo Zhang
Zeyang Liu
Xinrui Yang
Lipeng Wan
Bing Yan
Xingyu Chen
Xuguang Lan
40
0
0
05 Apr 2025
LaMOuR: Leveraging Language Models for Out-of-Distribution Recovery in Reinforcement Learning
Chan Kim
Seung-Woo Seo
Seong-Woo Kim
OODD
211
0
0
21 Mar 2025
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee
Youngdo Lee
Takuma Seno
Donghu Kim
Peter Stone
Jaegul Choo
63
1
0
24 Feb 2025
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
Thomas Schmied
Thomas Adler
Vihang Patil
M. Beck
Korbinian Poppel
Johannes Brandstetter
Günter Klambauer
Razvan Pascanu
Sepp Hochreiter
75
5
0
21 Feb 2025
Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning
Bryan L. M. de Oliveira
Murilo L. da Luz
Bruno Brandão
Luana G. B. Martins
Telma W. de L. Soares
Luckeciano C. Melo
OffRL
70
1
0
17 Feb 2025
Skill Expansion and Composition in Parameter Space
Tenglong Liu
Junjie Li
Yinan Zheng
Haoyi Niu
Yixing Lan
Xin Xu
Xianyuan Zhan
58
4
0
09 Feb 2025
Learning Fused State Representations for Control from Multi-View Observations
Zeyu Wang
Yao Li
Xin Li
Hongyu Zang
Romain Laroche
Riashat Islam
OffRL
54
0
0
03 Feb 2025
Dream to Fly: Model-Based Reinforcement Learning for Vision-Based Drone Flight
Angel Romero
Ashwin Shenai
Ismail Geles
Elie Aljalbout
Davide Scaramuzza
79
1
0
24 Jan 2025
RbRL2.0: Integrated Reward and Policy Learning for Rating-based Reinforcement Learning
Mingkang Wu
Devin White
Vernon J. Lawhern
Nicholas R. Waytowich
Yongcan Cao
OffRL
39
0
0
13 Jan 2025
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
A. Jain
Harley Wiltzer
Jesse Farebrother
Irina Rish
Glen Berseth
Sanjiban Choudhury
57
1
0
11 Nov 2024
Approximate Equivariance in Reinforcement Learning
Jung Yeon Park
Sujay Bhatt
Sihan Zeng
Lawson L. S. Wong
Alec Koppel
Sumitra Ganesh
Robin Walters
37
1
0
06 Nov 2024
OGBench: Benchmarking Offline Goal-Conditioned RL
Seohong Park
Kevin Frans
Benjamin Eysenbach
Sergey Levine
OffRL
62
9
0
26 Oct 2024
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Michael Noukhovitch
Shengyi Huang
Sophie Xhonneux
Arian Hosseini
Rishabh Agarwal
Rameswar Panda
OffRL
85
5
0
23 Oct 2024
Uncovering RL Integration in SSL Loss: Objective-Specific Implications for Data-Efficient RL
Ömer Veysel Çağatan
Barış Akgün
OffRL
34
0
0
22 Oct 2024
Diffusing States and Matching Scores: A New Framework for Imitation Learning
Runzhe Wu
Yiding Chen
Gokul Swamy
Kianté Brantley
Wen Sun
DiffM
46
3
0
17 Oct 2024
Reward-free World Models for Online Imitation Learning
Shangzhe Li
Zhiao Huang
H. Su
OffRL
67
1
0
17 Oct 2024
Reinforcement Learning with Euclidean Data Augmentation for State-Based Continuous Control
Jinzhu Luo
Dingyang Chen
Qi Zhang
OffRL
26
0
0
16 Oct 2024
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning
Yarden As
Bhavya Sukhija
Lenart Treven
Carmelo Sferrazza
Stelian Coros
Andreas Krause
33
1
0
12 Oct 2024
Masked Generative Priors Improve World Models Sequence Modelling Capabilities
Cristian Meo
Mircea Lica
Zarif Ikram
Akihiro Nakano
Vedant Shah
Aniket Didolkar
Dianbo Liu
Anirudh Goyal
Justin Dauwels
OffRL
90
0
0
10 Oct 2024
Neuroplastic Expansion in Deep Reinforcement Learning
Jiashun Liu
J. Obando-Ceron
Rameswar Panda
L. Pan
42
3
0
10 Oct 2024
Zero-Shot Generalization of Vision-Based RL Without Data Augmentation
Sumeet Batra
Gaurav Sukhatme
OffRL
DRL
31
2
0
09 Oct 2024
Synthesizing Interpretable Control Policies through Large Language Model Guided Search
Carlo Bosio
Mark W. Mueller
29
0
0
07 Oct 2024
Multi-Type Preference Learning: Empowering Preference-Based Reinforcement Learning with Equal Preferences
Z. Liu
Junjie Xu
Xingjiao Wu
J. Yang
Liang He
26
0
0
11 Sep 2024
Advances in Preference-based Reinforcement Learning: A Review
Youssef Abdelkareem
Shady Shehata
Fakhri Karray
OffRL
51
9
0
21 Aug 2024
BiGym: A Demo-Driven Mobile Bi-Manual Manipulation Benchmark
Nikita Chernyadev
Nicholas Backshall
Xiao Ma
Yunfan Lu
Younggyo Seo
Stephen James
22
11
0
10 Jul 2024
Frequency and Generalisation of Periodic Activation Functions in Reinforcement Learning
Augustine N. Mavor-Parker
Matthew J. Sargent
Caswell Barry
Lewis D. Griffin
Clare Lyle
47
2
0
09 Jul 2024
Reproducibility in Machine Learning-based Research: Overview, Barriers and Drivers
Harald Semmelrock
Tony Ross-Hellauer
Simone Kopeinik
Dieter Theiler
Armin Haberl
Stefan Thalmann
Dominik Kowald
65
6
0
20 Jun 2024
BAKU: An Efficient Transformer for Multi-Task Policy Learning
Siddhant Haldar
Zhuoran Peng
Lerrel Pinto
OffRL
46
28
0
11 Jun 2024
Decoupling regularization from the action space
Sobhan Mohammadpour
Emma Frejinger
Pierre-Luc Bacon
37
0
0
10 Jun 2024
Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity
Calarina Muslimani
Bram Grooten
Deepak Ranganatha Sastry Mamillapalli
Mykola Pechenizkiy
Decebal Constantin Mocanu
Matthew E. Taylor
54
0
0
10 Jun 2024
NeoRL: Efficient Exploration for Nonepisodic RL
Bhavya Sukhija
Lenart Treven
Florian Dorfler
Stelian Coros
Andreas Krause
OffRL
38
0
0
03 Jun 2024
Value Improved Actor Critic Algorithms
Yaniv Oren
Moritz A. Zanger
Pascal R. van der Vaart
M. Spaan
Wendelin Bohmer
Wendelin Bohmer
OffRL
33
0
0
03 Jun 2024
Do's and Don'ts: Learning Desirable Skills with Instruction Videos
Hyunseung Kim
ByungKun Lee
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Jaegul Choo
39
1
0
01 Jun 2024
Intrinsic Dynamics-Driven Generalizable Scene Representations for Vision-Oriented Decision-Making Applications
Dayang Liang
Jinyang Lai
Yunlong Liu
33
0
0
30 May 2024
Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning
Shangding Gu
Bilgehan Sel
Yuhao Ding
Lu Wang
Qingwei Lin
Alois Knoll
Ming Jin
42
1
0
26 May 2024
Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Michal Nauman
M. Ostaszewski
Krzysztof Jankowski
Piotr Milo's
Marek Cygan
OffRL
47
16
0
25 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
82
43
0
23 May 2024
AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents
Christopher Rawles
Sarah Clinckemaillie
Yifan Chang
Jonathan Waltz
Gabrielle Lau
...
Daniel Toyama
Robert Berry
Divya Tyamagundlu
Timothy Lillicrap
Oriana Riva
LLMAG
69
44
0
23 May 2024
Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning
Xin Liu
Yaran Chen
Dong Zhao
43
1
0
20 May 2024
CTD4 -- A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics
David Valencia
Henry Williams
Trevor Gee
Bruce A MacDonaland
Minas V. Liarokapis
Minas Liarokapis
OffRL
34
2
0
04 May 2024
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Calarina Muslimani
Matthew E. Taylor
OffRL
46
2
0
30 Apr 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRL
OnRL
27
0
0
31 Mar 2024
1
2
3
4
5
6
7
Next