ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.03306
  4. Cited By
MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot
  Learning

MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning

6 January 2024
Rafael Rafailov
Kyle Hatch
Victor Kolev
John D. Martin
Mariano Phielipp
Chelsea Finn
    OffRL
    OnRL
ArXivPDFHTML

Papers citing "MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning"

30 / 30 papers shown
Title
ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy
ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy
Yuhui Chen
Shuai Tian
Shugao Liu
Yingting Zhou
Haoran Li
Dongbin Zhao
OffRL
152
7
0
08 Feb 2025
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
The Evolution of Reinforcement Learning in Quantitative Finance: A Survey
Nikolaos Pippas
Cagatay Turkay
Elliot A. Ludvig
AIFin
147
3
0
20 Aug 2024
Hierarchical Model-Based Imitation Learning for Planning in Autonomous
  Driving
Hierarchical Model-Based Imitation Learning for Planning in Autonomous Driving
Eli Bronstein
Mark Palatucci
Dominik Notz
Brandyn White
Alex Kuefler
...
Punit Shah
Evan Racah
Benjamin Frenkel
Shimon Whiteson
Drago Anguelov
65
58
0
18 Oct 2022
A Generalist Agent
A Generalist Agent
Scott E. Reed
Konrad Zolna
Emilio Parisotto
Sergio Gomez Colmenarejo
Alexander Novikov
...
Yutian Chen
R. Hadsell
Oriol Vinyals
Mahyar Bordbar
Nando de Freitas
LM&Ro
LLMAG
AI4CE
186
810
0
12 May 2022
StretchBEV: Stretching Future Instance Prediction Spatially and
  Temporally
StretchBEV: Stretching Future Instance Prediction Spatially and Temporally
Adil Kaan Akan
Fatma Guney
43
47
0
25 Mar 2022
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
276
899
0
12 Oct 2021
Behavioral Priors and Dynamics Models: Improving Performance and Domain
  Transfer in Offline RL
Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL
Catherine Cang
Aravind Rajeswaran
Pieter Abbeel
Michael Laskin
OffRL
63
29
0
16 Jun 2021
Mitigating Covariate Shift in Imitation Learning via Offline Data
  Without Great Coverage
Mitigating Covariate Shift in Imitation Learning via Offline Data Without Great Coverage
Jonathan D. Chang
Masatoshi Uehara
Dhruv Sreenivas
Rahul Kidambi
Wen Sun
OffRL
70
32
0
06 Jun 2021
FIERY: Future Instance Prediction in Bird's-Eye View from Surround
  Monocular Cameras
FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras
Anthony Hu
Zak Murez
Nikhil C. Mohan
Sofía Dudas
Jeffrey Hawke
Vijay Badrinarayanan
R. Cipolla
Alex Kendall
173
258
0
21 Apr 2021
Offline Reinforcement Learning from Images with Latent Space Models
Offline Reinforcement Learning from Images with Latent Space Models
Rafael Rafailov
Tianhe Yu
Aravind Rajeswaran
Chelsea Finn
OffRL
61
127
0
21 Dec 2020
Mastering Atari with Discrete World Models
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
93
849
0
05 Oct 2020
On the model-based stochastic value gradient for continuous
  reinforcement learning
On the model-based stochastic value gradient for continuous reinforcement learning
Brandon Amos
Samuel Stanton
Denis Yarats
A. Wilson
56
71
0
28 Aug 2020
Model-Based Offline Planning
Model-Based Offline Planning
Arthur Argenson
Gabriel Dulac-Arnold
OffRL
52
154
0
12 Aug 2020
Overcoming Model Bias for Robust Offline Deep Reinforcement Learning
Overcoming Model Bias for Robust Offline Deep Reinforcement Learning
Phillip Swazinna
Steffen Udluft
Thomas Runkler
OffRL
50
83
0
12 Aug 2020
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
Ashvin Nair
Abhishek Gupta
Murtaza Dalal
Sergey Levine
OffRL
OnRL
88
607
0
16 Jun 2020
Conservative Q-Learning for Offline Reinforcement Learning
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
131
1,806
0
08 Jun 2020
Model-Augmented Actor-Critic: Backpropagating through Paths
Model-Augmented Actor-Critic: Backpropagating through Paths
I. Clavera
Yao Fu
Pieter Abbeel
61
88
0
16 May 2020
MOReL : Model-Based Offline Reinforcement Learning
MOReL : Model-Based Offline Reinforcement Learning
Rahul Kidambi
Aravind Rajeswaran
Praneeth Netrapalli
Thorsten Joachims
OffRL
85
668
0
12 May 2020
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GP
OffRL
210
1,359
0
15 Apr 2020
Dream to Control: Learning Behaviors by Latent Imagination
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
108
1,349
0
03 Dec 2019
When to Trust Your Model: Model-Based Policy Optimization
When to Trust Your Model: Model-Based Policy Optimization
Michael Janner
Justin Fu
Marvin Zhang
Sergey Levine
OffRL
83
948
0
19 Jun 2019
Tighter Problem-Dependent Regret Bounds in Reinforcement Learning
  without Domain Knowledge using Value Function Bounds
Tighter Problem-Dependent Regret Bounds in Reinforcement Learning without Domain Knowledge using Value Function Bounds
Andrea Zanette
Emma Brunskill
OffRL
95
275
0
01 Jan 2019
Learning Latent Dynamics for Planning from Pixels
Learning Latent Dynamics for Planning from Pixels
Danijar Hafner
Timothy Lillicrap
Ian S. Fischer
Ruben Villegas
David R Ha
Honglak Lee
James Davidson
BDL
84
1,430
0
12 Nov 2018
Model-Based Reinforcement Learning via Meta-Policy Optimization
Model-Based Reinforcement Learning via Meta-Policy Optimization
I. Clavera
Jonas Rothfuss
John Schulman
Yasuhiro Fujita
Tamim Asfour
Pieter Abbeel
66
226
0
14 Sep 2018
World Models
World Models
David R Ha
Jürgen Schmidhuber
SyDa
113
1,075
0
27 Mar 2018
Model-Based Value Estimation for Efficient Model-Free Reinforcement
  Learning
Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning
Vladimir Feinberg
Alvin Wan
Ion Stoica
Michael I. Jordan
Joseph E. Gonzalez
Sergey Levine
OffRL
56
317
0
28 Feb 2018
Model-Ensemble Trust-Region Policy Optimization
Model-Ensemble Trust-Region Policy Optimization
Thanard Kurutach
I. Clavera
Yan Duan
Aviv Tamar
Pieter Abbeel
65
451
0
28 Feb 2018
CARLA: An Open Urban Driving Simulator
CARLA: An Open Urban Driving Simulator
Alexey Dosovitskiy
G. Ros
Felipe Codevilla
Antonio M. López
V. Koltun
VLM
133
5,146
0
10 Nov 2017
Generative Adversarial Imitation Learning
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
131
3,098
0
10 Jun 2016
Embed to Control: A Locally Linear Latent Dynamics Model for Control
  from Raw Images
Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
Manuel Watter
Jost Tobias Springenberg
Joschka Boedecker
Martin Riedmiller
BDL
60
844
0
24 Jun 2015
1