ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.16828
  4. Cited By
TD-MPC2: Scalable, Robust World Models for Continuous Control

TD-MPC2: Scalable, Robust World Models for Continuous Control

25 October 2023
Nicklas Hansen
Hao Su
Xiaolong Wang
    MU
ArXivPDFHTML

Papers citing "TD-MPC2: Scalable, Robust World Models for Continuous Control"

50 / 106 papers shown
Title
Reward-free World Models for Online Imitation Learning
Reward-free World Models for Online Imitation Learning
Shangzhe Li
Zhiao Huang
H. Su
OffRL
67
1
0
17 Oct 2024
Make the Pertinent Salient: Task-Relevant Reconstruction for Visual
  Control with Distractions
Make the Pertinent Salient: Task-Relevant Reconstruction for Visual Control with Distractions
Kyungmin Kim
JB Lanier
Pierre Baldi
Charless C. Fowlkes
Roy Fox
35
1
0
13 Oct 2024
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement
  Learning
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Hyunseung Kim
Jun Jet Tai
K. Subramanian
Peter R. Wurman
Jaegul Choo
Peter Stone
Takuma Seno
OffRL
78
8
0
13 Oct 2024
Learning to Walk from Three Minutes of Real-World Data with
  Semi-structured Dynamics Models
Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics Models
Jacob Levy
T. Westenbroek
David Fridovich-Keil
47
7
0
11 Oct 2024
Can we hop in general? A discussion of benchmark selection and design
  using the Hopper environment
Can we hop in general? A discussion of benchmark selection and design using the Hopper environment
C. Voelcker
Marcel Hussing
Eric Eaton
OffRL
31
3
0
11 Oct 2024
Zero-Shot Offline Imitation Learning via Optimal Transport
Zero-Shot Offline Imitation Learning via Optimal Transport
Thomas Rupf
Marco Bagatella
Nico Gürtler
Jonas Frey
Georg Martius
OffRL
243
0
0
11 Oct 2024
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
46
2
0
11 Oct 2024
Zero-Shot Generalization of Vision-Based RL Without Data Augmentation
Zero-Shot Generalization of Vision-Based RL Without Data Augmentation
Sumeet Batra
Gaurav Sukhatme
OffRL
DRL
41
2
0
09 Oct 2024
Diffusion Model Predictive Control
Diffusion Model Predictive Control
Guangyao Zhou
Sivaramakrishnan Swaminathan
Rajkumar Vasudeva Raju
J. S. Guntupalli
Wolfgang Lehrach
Joseph Ortiz
Antoine Dedieu
Miguel Lázaro-Gredilla
Kevin P. Murphy
39
8
0
07 Oct 2024
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Jasmine Bayrooti
Carl Henrik Ek
Amanda Prorok
50
0
0
07 Oct 2024
Open-World Reinforcement Learning over Long Short-Term Imagination
Open-World Reinforcement Learning over Long Short-Term Imagination
Jiajian Li
Q. Wang
Yunbo Wang
Xin Jin
Yang Li
Wenjun Zeng
Xiaokang Yang
OCL
VLM
69
1
0
04 Oct 2024
Grounded Answers for Multi-agent Decision-making Problem through
  Generative World Model
Grounded Answers for Multi-agent Decision-making Problem through Generative World Model
Zeyang Liu
Xinrui Yang
Shiguang Sun
Long Qian
Lipeng Wan
Xingyu Chen
Xuguang Lan
44
3
0
03 Oct 2024
ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for
  Generalizable Embodied AI
ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AI
Stone Tao
Fanbo Xiang
Arth Shukla
Yuzhe Qin
Xander Hinrichsen
...
Zhiao Huang
Roberto Calandra
Rui Chen
Shan Luo
Hao Su
25
28
0
01 Oct 2024
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Jie Cheng
Ruixi Qiao
Gang Xiong
Binhua Li
Yingwei Ma
Binhua Li
Yongbin Li
Yisheng Lv
OffRL
OnRL
LM&Ro
50
3
0
01 Oct 2024
Model-Free versus Model-Based Reinforcement Learning for Fixed-Wing UAV
  Attitude Control Under Varying Wind Conditions
Model-Free versus Model-Based Reinforcement Learning for Fixed-Wing UAV Attitude Control Under Varying Wind Conditions
David Olivares
Pierre Fournier
Pavan Vasishta
Julien Marzat
27
0
0
26 Sep 2024
One-shot World Models Using a Transformer Trained on a Synthetic Prior
One-shot World Models Using a Transformer Trained on a Synthetic Prior
Fabio Ferreira
Moreno Schlageter
Raghu Rajan
André Biedenkapp
Frank Hutter
41
0
0
21 Sep 2024
PIP-Loco: A Proprioceptive Infinite Horizon Planning Framework for Quadrupedal Robot Locomotion
PIP-Loco: A Proprioceptive Infinite Horizon Planning Framework for Quadrupedal Robot Locomotion
Aditya Shirwatkar
Naman Saxena
Kishore Chandra
Shishir Kolathaya
52
3
0
14 Sep 2024
MPPI-Generic: A CUDA Library for Stochastic Trajectory Optimization
MPPI-Generic: A CUDA Library for Stochastic Trajectory Optimization
Bogdan I. Vlahov
Jason Gibson
Manan S. Gandhi
Evangelos A. Theodorou
39
0
0
11 Sep 2024
MuJoCo MPC for Humanoid Control: Evaluation on HumanoidBench
MuJoCo MPC for Humanoid Control: Evaluation on HumanoidBench
Moritz Meser
Aditya Bhatt
Boris Belousov
Jan Peters
29
2
0
01 Aug 2024
QT-TDM: Planning with Transformer Dynamics Model and Autoregressive
  Q-Learning
QT-TDM: Planning with Transformer Dynamics Model and Autoregressive Q-Learning
Mostafa Kotb
C. Weber
Muhammad Burhan Hafez
Stefan Wermter
41
1
0
26 Jul 2024
Residual-MPPI: Online Policy Customization for Continuous Control
Residual-MPPI: Online Policy Customization for Continuous Control
Pengcheng Wang
Chenran Li
Catherine Weaver
Kenta Kawamoto
Masayoshi Tomizuka
Chen Tang
Wei Zhan
OffRL
37
3
0
01 Jul 2024
Learning Abstract World Model for Value-preserving Planning with Options
Learning Abstract World Model for Value-preserving Planning with Options
Rafael Rodríguez-Sánchez
George Konidaris
46
1
0
22 Jun 2024
Decentralized Transformers with Centralized Aggregation are
  Sample-Efficient Multi-Agent World Models
Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models
Yang Zhang
Chenjia Bai
Bin Zhao
Junchi Yan
Xiu Li
Xuelong Li
OffRL
27
0
0
22 Jun 2024
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
UniZero: Generalized and Efficient Planning with Scalable Latent World Models
Yuan Pu
Yazhe Niu
Jiyuan Ren
Zhenjie Yang
Hongsheng Li
Yu Liu
OffRL
54
1
0
15 Jun 2024
Leveraging Locality to Boost Sample Efficiency in Robotic Manipulation
Leveraging Locality to Boost Sample Efficiency in Robotic Manipulation
Tong Zhang
Yingdong Hu
Jiacheng You
Yang Gao
33
7
0
15 Jun 2024
RoboGolf: Mastering Real-World Minigolf with a Reflective Multi-Modality
  Vision-Language Model
RoboGolf: Mastering Real-World Minigolf with a Reflective Multi-Modality Vision-Language Model
Hantao Zhou
Tianying Ji
Lukas Sommerhalder
Michael Goerner
Norman Hendrich
Jianwei Zhang
Fuchun Sun
Huazhe Xu
50
0
0
14 Jun 2024
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models
  in Decision Making
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yi Ma
Pengyi Li
Yan Zheng
DiffM
58
9
0
13 Jun 2024
Is Value Functions Estimation with Classification Plug-and-play for
  Offline Reinforcement Learning?
Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning?
Denis Tarasov
Kirill Brilliantov
Dmitrii Kharlapenko
OffRL
41
2
0
10 Jun 2024
iQRL -- Implicitly Quantized Representations for Sample-efficient
  Reinforcement Learning
iQRL -- Implicitly Quantized Representations for Sample-efficient Reinforcement Learning
Aidan Scannell
Kalle Kujanpää
Yi Zhao
Mohammadreza Nakhaei
Dieter Büchler
Joni Pajarinen
SSL
60
5
0
04 Jun 2024
Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical
  Behaviors in Deep Off-Policy RL
Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL
Yu-Juan Luo
Tianying Ji
Gang Hua
Jianwei Zhang
Huazhe Xu
Xianyuan Zhan
OffRL
OnRL
50
2
0
28 May 2024
Adaptive Horizon Actor-Critic for Policy Learning in Contact-Rich
  Differentiable Simulation
Adaptive Horizon Actor-Critic for Policy Learning in Contact-Rich Differentiable Simulation
Ignat Georgiev
K. Srinivasan
Jie Xu
Eric Heiden
Animesh Garg
43
8
0
28 May 2024
A Recipe for Unbounded Data Augmentation in Visual Reinforcement
  Learning
A Recipe for Unbounded Data Augmentation in Visual Reinforcement Learning
Abdulaziz Almuzairee
Nicklas Hansen
Henrik I. Christensen
50
7
0
27 May 2024
Bigger, Regularized, Optimistic: scaling for compute and
  sample-efficient continuous control
Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control
Michal Nauman
M. Ostaszewski
Krzysztof Jankowski
Piotr Milo's
Marek Cygan
OffRL
52
17
0
25 May 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models
iVideoGPT: Interactive VideoGPTs are Scalable World Models
Jialong Wu
Shaofeng Yin
Ningya Feng
Xu He
Dong Li
Haifeng Zhang
Mingsheng Long
VGen
54
26
0
24 May 2024
Learning Latent Dynamic Robust Representations for World Models
Learning Latent Dynamic Robust Representations for World Models
Ruixiang Sun
Hongyu Zang
Xin-hui Li
Riashat Islam
41
5
0
10 May 2024
Overcoming Knowledge Barriers: Online Imitation Learning from Visual Observation with Pretrained World Models
Overcoming Knowledge Barriers: Online Imitation Learning from Visual Observation with Pretrained World Models
Xingyuan Zhang
Philip Becker-Ehmck
Patrick van der Smagt
Maximilian Karl
OffRL
52
0
0
29 Apr 2024
Dreaming of Many Worlds: Learning Contextual World Models Aids Zero-Shot
  Generalization
Dreaming of Many Worlds: Learning Contextual World Models Aids Zero-Shot Generalization
Sai Prasanna
Karim Farid
Raghu Rajan
André Biedenkapp
65
2
0
16 Mar 2024
HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion
  and Manipulation
HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation
Carmelo Sferrazza
Dun-Ming Huang
Xingyu Lin
Youngwoon Lee
Pieter Abbeel
60
37
0
15 Mar 2024
ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic
  Manipulation
ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation
Guanxing Lu
Shiyi Zhang
Ziwei Wang
Changliu Liu
Jiwen Lu
Yansong Tang
59
51
0
13 Mar 2024
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Marcel Hussing
C. Voelcker
Igor Gilitschenski
Amir-massoud Farahmand
Eric Eaton
47
3
0
09 Mar 2024
3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple
  3D Representations
3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations
Yanjie Ze
Gu Zhang
Kangning Zhang
Chenyuan Hu
Muhan Wang
Huazhe Xu
VGen
45
80
0
06 Mar 2024
Stop Regressing: Training Value Functions via Classification for
  Scalable Deep RL
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL
Jesse Farebrother
Jordi Orbay
Q. Vuong
Adrien Ali Taïga
Yevgen Chebotar
...
Sergey Levine
Pablo Samuel Castro
Aleksandra Faust
Aviral Kumar
Rishabh Agarwal
OffRL
61
57
0
06 Mar 2024
Language-guided Skill Learning with Temporal Variational Inference
Language-guided Skill Learning with Temporal Variational Inference
Haotian Fu
Pratyusha Sharma
Elias Stengel-Eskin
George Konidaris
Nicolas Le Roux
Marc-Alexandre Côté
Xingdi Yuan
41
7
0
26 Feb 2024
Task-conditioned adaptation of visual features in multi-task policy
  learning
Task-conditioned adaptation of visual features in multi-task policy learning
Pierre Marza
L. Matignon
Olivier Simonin
Christian Wolf
53
2
0
12 Feb 2024
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Jost Tobias Springenberg
A. Abdolmaleki
Jingwei Zhang
Oliver Groth
Michael Bloesch
...
Sarah Bechtle
Steven Kapturowski
Roland Hafner
N. Heess
Martin Riedmiller
OffRL
LRM
35
12
0
08 Feb 2024
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for
  Offline Reinforcement Learning
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning
Zihan Ding
Amy Zhang
Yuandong Tian
Qinqing Zheng
OffRL
55
17
0
05 Feb 2024
MPC-Inspired Reinforcement Learning for Verifiable Model-Free Control
MPC-Inspired Reinforcement Learning for Verifiable Model-Free Control
Yiwen Lu
Zishuo Li
Yihan Zhou
Na Li
Yilin Mo
27
2
0
08 Dec 2023
H-GAP: Humanoid Control with a Generalist Planner
H-GAP: Humanoid Control with a Generalist Planner
Zhengyao Jiang
Yingchen Xu
Nolan Wagener
Yicheng Luo
Michael Janner
Edward Grefenstette
Tim Rocktaschel
Yuandong Tian
AI4CE
40
5
0
05 Dec 2023
A Unified View on Solving Objective Mismatch in Model-Based
  Reinforcement Learning
A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning
Ran Wei
Nathan Lambert
Anthony D. McDonald
Alfredo Garcia
Roberto Calandra
38
7
0
10 Oct 2023
Actor-Critic Model Predictive Control
Actor-Critic Model Predictive Control
Angel Romero
Yunlong Song
Davide Scaramuzza
52
36
0
16 Jun 2023
Previous
123
Next