Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.12983
Cited By
dm_control: Software and Tasks for Continuous Control
22 June 2020
Yuval Tassa
S. Tunyasuvunakool
Alistair Muldal
Yotam Doron
Piotr Trochim
Siqi Liu
Steven Bohez
J. Merel
Tom Erez
Timothy Lillicrap
N. Heess
LM&Ro
Re-assign community
ArXiv
PDF
HTML
Papers citing
"dm_control: Software and Tasks for Continuous Control"
50 / 96 papers shown
Title
Fast Flow-based Visuomotor Policies via Conditional Optimal Transport Couplings
Andreas Sochopoulos
Nikolay Malkin
Nikolaos Tsagkas
João Moura
Michael Gienger
S. Vijayakumar
50
1
0
02 May 2025
Quattro: Transformer-Accelerated Iterative Linear Quadratic Regulator Framework for Fast Trajectory Optimization
Yue Wang
Hoayu Wang
Zhaoxing Li
49
0
0
02 Apr 2025
Uncertainty Representations in State-Space Layers for Deep Reinforcement Learning under Partial Observability
Carlos E. Luis
A. Bottero
Julia Vinogradska
Felix Berkenkamp
Jan Peters
78
1
0
20 Feb 2025
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Michael Psenka
Alejandro Escontrela
Pieter Abbeel
Yi Ma
DiffM
93
24
0
17 Feb 2025
Mirror Descent Actor Critic via Bounded Advantage Learning
Ryo Iwaki
93
0
0
06 Feb 2025
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
A. Jain
Harley Wiltzer
Jesse Farebrother
Irina Rish
Glen Berseth
Sanjiban Choudhury
57
1
0
11 Nov 2024
State Chrono Representation for Enhancing Generalization in Reinforcement Learning
Jianda Chen
Wen Zheng Terence Ng
Zichen Chen
Sinno Jialin Pan
Tianwei Zhang
OffRL
37
0
0
09 Nov 2024
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRL
DiffM
114
2
0
23 Oct 2024
Reward-free World Models for Online Imitation Learning
Shangzhe Li
Zhiao Huang
H. Su
OffRL
67
1
0
17 Oct 2024
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
41
1
0
11 Oct 2024
SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
Haoyi Zhu
Honghui Yang
Yating Wang
Jiange Yang
Limin Wang
Tong He
3DH
51
6
0
10 Oct 2024
Latent Space Energy-based Neural ODEs
Sheng Cheng
Deqian Kong
Jianwen Xie
Kookjin Lee
Ying Nian Wu
Yezhou Yang
DiffM
155
1
0
05 Sep 2024
An Examination of Offline-Trained Encoders in Vision-Based Deep Reinforcement Learning for Autonomous Driving
S. Mohammed
Alp Argun
Nicolas Bonnotte
Gerd Ascheid
OffRL
28
0
0
02 Sep 2024
TimeLDM: Latent Diffusion Model for Unconditional Time Series Generation
Jian Qian
Miao Sun
Sifan Zhou
Biao Wan
Minhao Li
Patrick Chiang
41
7
0
05 Jul 2024
Exploration by Learning Diverse Skills through Successor State Measures
Paul-Antoine Le Tolguenec
Yann Besse
Florent Teichteil-Königsbuch
Dennis G. Wilson
Emmanuel Rachelson
40
0
0
14 Jun 2024
Visual Representation Learning with Stochastic Frame Prediction
Huiwon Jang
Dongyoung Kim
Junsu Kim
Jinwoo Shin
Pieter Abbeel
Younggyo Seo
42
2
0
11 Jun 2024
Phase-Amplitude Reduction-Based Imitation Learning
Satoshi Yamamori
Jun Morimoto
26
0
0
06 Jun 2024
Effective Reinforcement Learning Based on Structural Information Principles
Xianghua Zeng
Hao Peng
Dingli Su
Angsheng Li
40
0
0
15 Apr 2024
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
C. Voelcker
Igor Gilitschenski
Amir-massoud Farahmand
Eric Eaton
36
3
0
09 Mar 2024
Improving a Proportional Integral Controller with Reinforcement Learning on a Throttle Valve Benchmark
Paul Daoudi
B. Mavkov
Bogdan Robu
Christophe Prieur
Emmanuel Witrant
M. Barlier
Ludovic Dos Santos
28
2
0
21 Feb 2024
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Jost Tobias Springenberg
A. Abdolmaleki
Jingwei Zhang
Oliver Groth
Michael Bloesch
...
Sarah Bechtle
Steven Kapturowski
Roland Hafner
N. Heess
Martin Riedmiller
OffRL
LRM
27
12
0
08 Feb 2024
Transductive Reward Inference on Graph
B. Qu
Xiaofeng Cao
Qing Guo
Yi Chang
Ivor W. Tsang
Chengqi Zhang
OffRL
38
0
0
06 Feb 2024
ALMANACS: A Simulatability Benchmark for Language Model Explainability
Edmund Mills
Shiye Su
Stuart J. Russell
Scott Emmons
54
7
0
20 Dec 2023
Vision-Language Models as a Source of Rewards
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
Harris Chan
Gheorghe Comanici
...
Yannick Schroecker
Stephen Spencer
Richie Steigerwald
Luyu Wang
Lei Zhang
VLM
LRM
45
26
0
14 Dec 2023
On Task-Relevant Loss Functions in Meta-Reinforcement Learning and Online LQR
Jaeuk Shin
Giho Kim
Howon Lee
Joonho Han
Insoon Yang
OffRL
33
1
0
09 Dec 2023
EduGym: An Environment and Notebook Suite for Reinforcement Learning Education
Thomas M. Moerland
Matthias Muller-Brockhausen
Zhao Yang
Andrius Bernatavicius
Koen Ponse
Tom Kouwenhoven
Andreas Sauter
Michiel van der Meer
Bram M. Renting
Aske Plaat
OffRL
31
0
0
17 Nov 2023
Robust Adversarial Reinforcement Learning via Bounded Rationality Curricula
Aryaman Reddi
Maximilian Tölle
Jan Peters
Georgia Chalvatzaki
Carlo DÉramo
42
4
0
03 Nov 2023
Boosting Continuous Control with Consistency Policy
Yuhui Chen
Haoran Li
Dongbin Zhao
OffRL
41
20
0
10 Oct 2023
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning
Trevor A. McInroe
Adam Jelley
Stefano V. Albrecht
Amos Storkey
OffRL
OnRL
28
6
0
09 Oct 2023
Foundational Policy Acquisition via Multitask Learning for Motor Skill Generation
Satoshi Yamamori
Jun Morimoto
26
0
0
31 Aug 2023
Adversarial Style Transfer for Robust Policy Optimization in Deep Reinforcement Learning
Md Masudur Rahman
Yexiang Xue
29
4
0
29 Aug 2023
Simplified Temporal Consistency Reinforcement Learning
Yi Zhao
Wenshuai Zhao
Rinu Boney
Arno Solin
Joni Pajarinen
OffRL
30
13
0
15 Jun 2023
Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation
David Brandfonbrener
Ofir Nachum
Joan Bruna
AI4CE
26
21
0
26 May 2023
Conditional Mutual Information for Disentangled Representations in Reinforcement Learning
Mhairi Dunion
Trevor A. McInroe
K. Luck
Josiah P. Hanna
Stefano V. Albrecht
OOD
DRL
22
17
0
23 May 2023
Reducing the Cost of Cycle-Time Tuning for Real-World Policy Optimization
Homayoon Farrahi
Rupam Mahmood
26
5
0
09 May 2023
Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
Tuomas Haarnoja
Ben Moran
Guy Lever
Sandy H. Huang
Dhruva Tirumala
...
Andrea Huber
N. Hurley
F. Nori
R. Hadsell
N. Heess
50
143
0
26 Apr 2023
Hierarchical State Abstraction Based on Structural Information Principles
Xianghua Zeng
Hao Peng
Angsheng Li
Chunyang Liu
Lifang He
Philip S. Yu
31
18
0
24 Apr 2023
A State Augmentation based approach to Reinforcement Learning from Human Preferences
Mudit Verma
Subbarao Kambhampati
33
2
0
17 Feb 2023
NeuronsGym: A Hybrid Framework and Benchmark for Robot Tasks with Sim2Real Policy Learning
Haoran Li
Shasha Liu
Mingjun Ma
Guangzheng Hu
Yaran Chen
Dong Zhao
27
3
0
07 Feb 2023
Neural Episodic Control with State Abstraction
Zhuo Li
Derui Zhu
Yujing Hu
Xiaofei Xie
L. Ma
Yan Zheng
Yan Song
Yingfeng Chen
Jianjun Zhao
OffRL
20
14
0
27 Jan 2023
On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective
Ying Wen
Bo Liu
M. Zhou
Shufang Hou
Zhe Cao
Chenyang Le
Jingxiao Chen
Zheng Tian
Weinan Zhang
Jun Wang
AI4CE
26
10
0
24 Dec 2022
Few-Shot Preference Learning for Human-in-the-Loop RL
Joey Hejna
Dorsa Sadigh
OffRL
32
92
0
06 Dec 2022
Learning General World Models in a Handful of Reward-Free Deployments
Yingchen Xu
Jack Parker-Holder
Aldo Pacchiano
Philip J. Ball
Oleh Rybkin
Stephen J. Roberts
Tim Rocktaschel
Edward Grefenstette
OffRL
57
9
0
23 Oct 2022
Solving Continuous Control via Q-learning
Tim Seyde
Peter Werner
Wilko Schwarting
Igor Gilitschenski
Martin Riedmiller
Daniela Rus
Markus Wulfmeier
OffRL
LRM
35
22
0
22 Oct 2022
RMBench: Benchmarking Deep Reinforcement Learning for Robotic Manipulator Control
Yanfei Xiang
Xin Wang
Shu Hu
Bin Zhu
Xiaomeng Huang
Xi Wu
Siwei Lyu
SSL
29
5
0
20 Oct 2022
On Uncertainty in Deep State Space Models for Model-Based Reinforcement Learning
P. Becker
Gerhard Neumann
30
9
0
17 Oct 2022
S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning
Daesol Cho
D. Shim
H. J. Kim
OffRL
42
11
0
30 Sep 2022
Learning Dexterous Manipulation from Exemplar Object Trajectories and Pre-Grasps
Sudeep Dasari
Abhi Gupta
Vikash Kumar
52
42
0
22 Sep 2022
Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration
Zijian Gao
Yiying Li
Kele Xu
Yuanzhao Zhai
Dawei Feng
Bo Ding
Xinjun Mao
Huaimin Wang
38
0
0
24 Aug 2022
MoCapAct: A Multi-Task Dataset for Simulated Humanoid Control
Nolan Wagener
Andrey Kolobov
Felipe Vieira Frujeri
Ricky Loynd
Ching-An Cheng
Matthew J. Hausknecht
27
21
0
15 Aug 2022
1
2
Next