Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2207.12141
Cited By
Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy
25 July 2022
Xiyao Wang
Wichayaporn Wongkamjan
Furong Huang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy"
9 / 9 papers shown
Title
Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning
Xiyao Wang
Linfeng Song
Ye Tian
Dian Yu
Baolin Peng
Haitao Mi
Furong Huang
Dong Yu
LRM
55
11
0
09 Oct 2024
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption
Bernd Frauenknecht
Artur Eisele
Devdutt Subhasish
Friedrich Solowjow
Sebastian Trimpe
49
5
0
29 May 2024
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching
Yecheng Jason Ma
K. Sivakumar
Jason Yan
Osbert Bastani
Dinesh Jayaraman
OffRL
MU
32
6
0
22 May 2023
Relative Policy-Transition Optimization for Fast Policy Transfer
Jiawei Xu
Cheng Zhou
Yizheng Zhang
Zhengyou Zhang
Lei Han
21
0
0
13 Jun 2022
ED2: Environment Dynamics Decomposition World Models for Continuous Control
Jianye Hao
Yifu Yuan
Cong Wang
Zhen Wang
OffRL
16
1
0
06 Dec 2021
On-Policy Model Errors in Reinforcement Learning
Lukas P. Frohlich
Maksym Lefarov
Melanie Zeilinger
Felix Berkenkamp
OnRL
57
6
0
15 Oct 2021
Mismatched No More: Joint Model-Policy Optimization for Model-Based RL
Benjamin Eysenbach
Alexander Khazatsky
Sergey Levine
Ruslan Salakhutdinov
OffRL
206
44
0
06 Oct 2021
Large Batch Experience Replay
Thibault Lahire
M. Geist
Emmanuel Rachelson
OffRL
56
13
0
04 Oct 2021
Model-based Policy Optimization with Unsupervised Model Adaptation
Jian Shen
Han Zhao
Weinan Zhang
Yong Yu
30
27
0
19 Oct 2020
1