Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.00564
Cited By
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
1 October 2024
Jie Cheng
Ruixi Qiao
Gang Xiong
Binhua Li
Yingwei Ma
Binhua Li
Yongbin Li
Yisheng Lv
OffRL
OnRL
LM&Ro
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"
3 / 3 papers shown
Title
Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning
Jie Cheng
Ruixi Qiao
Lijun Li
Chao Guo
J. Z. Wang
Gang Xiong
Yisheng Lv
Fei-Yue Wang
LRM
139
0
0
21 Apr 2025
DyWA: Dynamics-adaptive World Action Model for Generalizable Non-prehensile Manipulation
Jiangran Lyu
Ziming Li
Xuesong Shi
Chaoyi Xu
Yizhou Wang
He Wang
47
0
0
21 Mar 2025
Skill Expansion and Composition in Parameter Space
Tenglong Liu
J. Li
Yinan Zheng
Haoyi Niu
Yixing Lan
Xin Xu
Xianyuan Zhan
56
4
0
09 Feb 2025
1