Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10897
Cited By
v1
v2 (latest)
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning
24 October 2019
Tianhe Yu
Deirdre Quillen
Zhanpeng He
Ryan Julian
Avnish Narayan
Hayden Shively
Adithya Bellathur
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning"
50 / 381 papers shown
Title
Diffusion Models for Robotic Manipulation: A Survey
Rosa Wolf
Yitian Shi
Sheng Liu
Rania Rayyes
125
2
0
01 Jul 2025
GoalLadder: Incremental Goal Discovery with Vision-Language Models
Alexey Zakharov
Shimon Whiteson
10
0
0
19 Jun 2025
CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion
Jiahua Ma
Yiran Qin
Yixiong Li
Xuanqi Liao
Yulan Guo
Ruimao Zhang
17
0
0
17 Jun 2025
An Open-Source Software Toolkit & Benchmark Suite for the Evaluation and Adaptation of Multimodal Action Models
Pranav Guruprasad
Yangyue Wang
Sudipta Chowdhury
Jaewoo Song
Harshvardhan Sikka
27
0
0
10 Jun 2025
Time-Aware World Model for Adaptive Prediction and Control
Anh N. Nhu
Sanghyun Son
Ming-Chyuan Lin
AI4TS
TTA
34
0
0
10 Jun 2025
Self-Adapting Improvement Loops for Robotic Learning
Calvin Luo
Zilai Zeng
Mingxi Jia
Yilun Du
Chen Sun
17
0
0
07 Jun 2025
Is Optimal Transport Necessary for Inverse Reinforcement Learning?
Zixuan Dong
Yumi Omori
Keith Ross
10
0
0
07 Jun 2025
Gradient Similarity Surgery in Multi-Task Deep Learning
Thomas Borsani
Andrea Rosani
Giuseppe Nicosia
Giuseppe Di Fatta
MedIm
45
0
0
06 Jun 2025
Gradual Transition from Bellman Optimality Operator to Bellman Operator in Online Reinforcement Learning
Motoki Omura
Kazuki Ota
Takayuki Osa
Yusuke Mukuta
Tatsuya Harada
OffRL
39
0
0
06 Jun 2025
Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving
Li Zeqiao
Wang Yijing
Wang Haoyu
Li Zheng
Li Peng
Zuo zhiqiang
Hu Chuan
112
0
0
04 Jun 2025
Self-Composing Policies for Scalable Continual Reinforcement Learning
Mikel Malagón
Josu Ceberio
Jose A. Lozano
CLL
35
5
0
04 Jun 2025
FreqPolicy: Frequency Autoregressive Visuomotor Policy with Continuous Tokens
Yiming Zhong
Yumeng Liu
Chuyang Xiao
Zemin Yang
Youzhuo Wang
Yufei Zhu
Ye-ling Shi
Yujing Sun
X. Zhu
Yuexin Ma
52
0
0
02 Jun 2025
CLARIFY: Contrastive Preference Reinforcement Learning for Untangling Ambiguous Queries
Ni Mu
Hao Hu
Xiao Hu
Yiqin Yang
Bo Xu
Qing-Shan Jia
29
0
0
31 May 2025
LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks
Yi Yang
Jiaxuan Sun
Siqi Kou
Yihan Wang
Zhijie Deng
LM&Ro
25
0
0
31 May 2025
Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer
Yilun Kong
Guozheng Ma
Qi Zhao
Haoyu Wang
Li Shen
Xueqian Wang
Dacheng Tao
MoE
OffRL
31
1
0
30 May 2025
Bigger, Regularized, Categorical: High-Capacity Value Functions are Efficient Multi-Task Learners
Michal Nauman
Marek Cygan
Carmelo Sferrazza
Aviral Kumar
Pieter Abbeel
OffRL
96
0
0
29 May 2025
TeViR: Text-to-Video Reward with Diffusion Models for Efficient Reinforcement Learning
Yuhui Chen
Haoran Li
Zhennan Jiang
Haowei Wen
Dongbin Zhao
51
0
0
26 May 2025
WorldEval: World Model as Real-World Robot Policies Evaluator
Yaxuan Li
Yichen Zhu
Junjie Wen
Chaomin Shen
Yi Xu
OffRL
VGen
12
0
0
25 May 2025
Imagine Beyond! Distributionally Robust Auto-Encoding for State Space Coverage in Online Reinforcement Learning
Nicolas Castanet
Olivier Sigaud
Sylvain Lamprier
OffRL
108
0
0
23 May 2025
AnyBody: A Benchmark Suite for Cross-Embodiment Manipulation
Meenal Parakh
Alexandre Kirchmeyer
Beining Han
Jia Deng
LM&Ro
144
0
0
21 May 2025
Object-Centric Representations Improve Policy Generalization in Robot Manipulation
Alexandre Chapin
Bruno Machado
Emmanuel Dellandrea
Liming Chen
OCL
134
0
0
16 May 2025
ReWiND: Language-Guided Rewards Teach Robot Policies without New Demonstrations
Jiahui Zhang
Yusen Luo
Abrar Anwar
Sumedh Anand Sontakke
Joseph J Lim
Jesse Thomason
Erdem Biyik
Jesse Zhang
OffRL
LM&Ro
126
0
0
16 May 2025
H
3
^3
3
DP: Triply-Hierarchical Diffusion Policy for Visuomotor Learning
Yiyang Lu
Yufeng Tian
Zhecheng Yuan
Xinyu Wang
Pu Hua
Zhengrong Xue
Huazhe Xu
98
1
0
12 May 2025
Constant-Memory Strategies in Stochastic Games: Best Responses and Equilibria
Fengming Zhu
Fangzhen Lin
60
0
0
11 May 2025
Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach
Minting Pan
Yitao Zheng
Jiajian Li
Yunbo Wang
Xiaokang Yang
OffRL
122
0
0
10 May 2025
CLAM: Continuous Latent Action Models for Robot Learning from Unlabeled Demonstrations
Anthony Liang
Pavel Czempin
Matthew Hong
Yutai Zhou
Erdem Biyik
Stephen Tu
148
1
0
08 May 2025
Policy-labeled Preference Learning: Is Preference Enough for RLHF?
Taehyun Cho
Seokhun Ju
Seungyub Han
Dohyeong Kim
Kyungjae Lee
Jungwoo Lee
OffRL
112
0
0
06 May 2025
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Haoran Geng
Feishi Wang
Songlin Wei
Yuchen Li
Bangjun Wang
...
Hao Dong
Siyuan Huang
Yue Wang
Jitendra Malik
Pieter Abbeel
182
8
0
26 Apr 2025
Solving New Tasks by Adapting Internet Video Knowledge
Calvin Luo
Zilai Zeng
Yilun Du
Chen Sun
111
6
0
21 Apr 2025
DiffOG: Differentiable Policy Trajectory Optimization with Generalizability
Zhengtong Xu
Zichen Miao
Qiang Qiu
Zhe Zhang
Yu She
170
0
0
18 Apr 2025
TW-CRL: Time-Weighted Contrastive Reward Learning for Efficient Inverse Reinforcement Learning
Yuxuan Li
Yicheng Gao
Ning Yang
Stephen Xia
OffRL
131
0
0
08 Apr 2025
Harmonia: A Multi-Agent Reinforcement Learning Approach to Data Placement and Migration in Hybrid Storage Systems
Rakesh Nadig
Vamanan Arulchelvan
Rahul Bera
Taha Shahroodi
Gagandeep Singh
Mohammad Sadrosadati
Jisung Park
O. Mutlu
Onur Mutlu
133
0
0
26 Mar 2025
VARP: Reinforcement Learning from Vision-Language Model Feedback with Agent Regularized Preferences
Anukriti Singh
Amisha Bhaskar
Peihong Yu
Souradip Chakraborty
Ruthwik Dasyam
Amrit Singh Bedi
Pratap Tokekar
106
0
0
18 Mar 2025
A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
Xin Wen
Bingchen Zhao
Yilun Chen
Jiangmiao Pang
Xiaojuan Qi
LM&Ro
222
0
0
10 Mar 2025
Mastering Continual Reinforcement Learning through Fine-Grained Sparse Network Allocation and Dormant Neuron Exploration
Chengqi Zheng
Haiyan Yin
Jianda Chen
Terence Ng
Yew-Soon Ong
Ivor Tsang
CLL
438
0
0
07 Mar 2025
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
Hyungkyu Kang
Min-hwan Oh
OffRL
119
0
0
07 Mar 2025
Multi-Task Reinforcement Learning Enables Parameter Scaling
Reginald McLean
Evangelos Chataroulas
Jordan Terry
Isaac Woungang
Nariman Farsad
Pablo Samuel Castro
LRM
152
1
0
07 Mar 2025
RA-DP: Rapid Adaptive Diffusion Policy for Training-Free High-frequency Robotics Replanning
Xi Ye
Rui Heng Yang
Jun Jin
Yiming Li
Amir Rasouli
73
0
0
06 Mar 2025
Teaching Metric Distance to Autoregressive Multimodal Foundational Models
Jiwan Chung
Saejin Kim
Yongrae Jo
Jinho Park
Dongjun Min
Youngjae Yu
250
0
0
04 Mar 2025
Multi-Stage Manipulation with Demonstration-Augmented Reward, Policy, and World Model Learning
Adrià López Escoriza
Nicklas Hansen
Stone Tao
Tongzhou Mu
H. Su
OffRL
90
1
0
03 Mar 2025
Falcon: Fast Visuomotor Policies via Partial Denoising
Haojun Chen
Minghao Liu
Xiaojian Ma
Zailin Ma
Huimin Wu
...
Yuanpei Chen
Yifan Zhong
Mingzhi Wang
Qing Li
Yaodong Yang
VGen
145
1
0
01 Mar 2025
Learning Policy Committees for Effective Personalization in MDPs with Diverse Tasks
Luise Ge
Michael Lanier
Anindya Sarkar
Bengisu Guresti
Yevgeniy Vorobeychik
Chongjie Zhang
192
0
0
26 Feb 2025
Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning
Jaehyeon Son
Soochan Lee
Gunhee Kim
OffRL
129
4
0
26 Feb 2025
Yes, Q-learning Helps Offline In-Context RL
Denis Tarasov
Alexander Nikulin
Ilya Zisman
Albina Klepach
Andrei Polubarov
Nikita Lyubaykin
Alexander Derevyagin
Igor Kiselev
Vladislav Kurenkov
OffRL
OnRL
490
3
0
24 Feb 2025
DemoGen: Synthetic Demonstration Generation for Data-Efficient Visuomotor Policy Learning
Zhengrong Xue
Shuying Deng
Zhenyang Chen
Yixuan Wang
Zhecheng Yuan
Huazhe Xu
114
9
0
24 Feb 2025
MILE: Model-based Intervention Learning
Yigit Korkmaz
Erdem Bıyık
146
2
0
21 Feb 2025
Efficient Evaluation of Multi-Task Robot Policies With Active Experiment Selection
Abrar Anwar
Rohan Gupta
Zain Merchant
Sayan Ghosh
Willie Neiswanger
Jesse Thomason
OffRL
194
1
0
14 Feb 2025
Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning
Egor Cherepanov
Nikita Kachaev
A. Kovalev
Aleksandr I. Panov
OffRL
164
0
0
14 Feb 2025
PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning
Angel Villar-Corrales
Sven Behnke
231
4
0
11 Feb 2025
Skill Expansion and Composition in Parameter Space
Tenglong Liu
Junjie Li
Yinan Zheng
Haoyi Niu
Yixing Lan
Xin Xu
Xianyuan Zhan
127
4
0
09 Feb 2025
1
2
3
4
5
6
7
8
Next