Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.10897
Cited By
v1
v2 (latest)
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning
24 October 2019
Tianhe Yu
Deirdre Quillen
Zhanpeng He
Ryan Julian
Avnish Narayan
Hayden Shively
Adithya Bellathur
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning"
50 / 381 papers shown
Title
Learning Fused State Representations for Control from Multi-View Observations
Zeyu Wang
Yao Li
Xin Li
Hongyu Zang
Romain Laroche
Riashat Islam
OffRL
171
1
0
03 Feb 2025
Reinforcement Learning of Flexible Policies for Symbolic Instructions with Adjustable Mapping Specifications
Wataru Hatanaka
R. Yamashina
Takamitsu Matsubara
237
0
0
31 Jan 2025
Episodic Novelty Through Temporal Distance
Y. Jiang
Qihan Liu
Yiqin Yang
Xiaoteng Ma
Dianyu Zhong
...
Jun Yang
Bin Liang
Bo Xu
Chongjie Zhang
Qianchuan Zhao
OffRL
103
1
0
28 Jan 2025
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models
Xintong Duan
Yutong He
Fahim Tajwar
Wen-Tse Chen
Ruslan Salakhutdinov
Jeff Schneider
OffRL
AI4CE
153
1
0
22 Jan 2025
Gradient-Based Multi-Objective Deep Learning: Algorithms, Theories, Applications, and Beyond
Weiyu Chen
Xiaoyuan Zhang
Baijiong Lin
Xi Lin
Han Zhao
Qingfu Zhang
James T. Kwok
172
5
0
19 Jan 2025
Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning
Anthony Kobanda
Rémy Portelas
Odalric-Ambrym Maillard
Ludovic Denoyer
OffRL
CLL
176
1
0
19 Dec 2024
Grounding Video Models to Actions through Goal Conditioned Exploration
Yunhao Luo
Yilun Du
LM&Ro
VGen
141
5
0
11 Nov 2024
Problem Space Transformations for Out-of-Distribution Generalisation in Behavioural Cloning
Kiran Doshi
Marco Bagatella
Stelian Coros
70
1
0
06 Nov 2024
N-Gram Induction Heads for In-Context RL: Improving Stability and Reducing Data Needs
Ilya Zisman
Alexander Nikulin
Andrei Polubarov
Nikita Lyubaykin
Vladislav Kurenkov
Andrei Polubarov
Igor Kiselev
Vladislav Kurenkov
OffRL
134
2
0
04 Nov 2024
Incremental Learning of Retrievable Skills For Efficient Continual Task Adaptation
Daehee Lee
Minjong Yoo
Woo Kyung Kim
Wonje Choi
Honguk Woo
CLL
184
8
0
30 Oct 2024
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
Thomas Schmied
Thomas Adler
Vihang Patil
M. Beck
Korbinian Poppel
Johannes Brandstetter
Günter Klambauer
Razvan Pascanu
Sepp Hochreiter
201
7
0
29 Oct 2024
OGBench: Benchmarking Offline Goal-Conditioned RL
Seohong Park
Kevin Frans
Benjamin Eysenbach
Sergey Levine
OffRL
146
29
0
26 Oct 2024
On-Robot Reinforcement Learning with Goal-Contrastive Rewards
Ondrej Biza
Thomas Weng
Lingfeng Sun
Karl Schmeckpeper
Tarik Kelestemur
Yecheng Jason Ma
Robert Platt
Jan-Willem van de Meent
Lawson L. S. Wong
OffRL
147
0
0
25 Oct 2024
MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning
Suning Huang
Zheyu Zhang
Tianhai Liang
Yihan Xu
Zhehao Kou
Chenhao Lu
Guowei Xu
Zhengrong Xue
Huazhe Xu
MoE
125
7
0
19 Oct 2024
VideoAgent: Self-Improving Video Generation
Achint Soni
Sreyas Venkataraman
Abhranil Chandra
Sebastian Fischmeister
Percy Liang
Bo Dai
Sherry Yang
LM&Ro
VGen
151
11
0
14 Oct 2024
TOP-ERL: Transformer-based Off-Policy Episodic Reinforcement Learning
Ge Li
Dong Tian
Hongyi Zhou
Xinkai Jiang
Rudolf Lioutikov
Gerhard Neumann
OffRL
514
4
0
12 Oct 2024
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
C. Voelcker
Marcel Hussing
Eric Eaton
Amir-massoud Farahmand
Igor Gilitschenski
126
5
0
11 Oct 2024
SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
Haoyi Zhu
Honghui Yang
Yating Wang
Jiange Yang
Limin Wang
Tong He
3DH
126
9
0
10 Oct 2024
ConML: A Universal Meta-Learning Framework with Task-Level Contrastive Learning
Shiguang Wu
Yaqing Wang
Yatao Bian
Quanming Yao
CLL
137
0
0
08 Oct 2024
Active Fine-Tuning of Multi-Task Policies
Marco Bagatella
Jonas Hübotter
Georg Martius
Andreas Krause
149
0
0
07 Oct 2024
ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
Ehsan Futuhi
Shayan Karimi
Chao Gao
Martin Müller
99
1
0
07 Oct 2024
MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents
Junpeng Yue
Xinru Xu
Börje F. Karlsson
Zongqing Lu
116
1
0
04 Oct 2024
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Jie Cheng
Ruixi Qiao
Gang Xiong
Binhua Li
Yingwei Ma
Binhua Li
Yongbin Li
Yisheng Lv
OffRL
OnRL
LM&Ro
133
4
0
01 Oct 2024
Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner
Chenyou Fan
Chenjia Bai
Zhao Shan
Haoran He
Yang Zhang
Zhen Wang
100
3
0
30 Sep 2024
Discrete Policy: Learning Disentangled Action Space for Multi-Task Robotic Manipulation
Kun Wu
Yichen Zhu
Jinming Li
Junjie Wen
Ning Liu
Zhiyuan Xu
Qinru Qiu
184
8
0
27 Sep 2024
TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation
Junjie Wen
Yinlin Zhu
Jinming Li
Minjie Zhu
Kun Wu
...
Ran Cheng
Yaxin Peng
Chaomin Shen
Feifei Feng
Jian Tang
LM&Ro
176
70
0
19 Sep 2024
HiRT: Enhancing Robotic Control with Hierarchical Robot Transformers
Jianke Zhang
Yanjiang Guo
Xiaoyu Chen
Yen-Jen Wang
Yucheng Hu
Chengming Shi
Jianyu Chen
92
13
0
12 Sep 2024
One Policy to Run Them All: an End-to-end Learning Approach to Multi-Embodiment Locomotion
Nico Bohlinger
Grzegorz Czechmanowski
Maciej Krupka
Piotr Kicki
Krzysztof Walas
Jan Peters
Davide Tateo
109
21
0
10 Sep 2024
CHIRPs: Change-Induced Regret Proxy metrics for Lifelong Reinforcement Learning
John Birkbeck
Adam Sobey
Federico Cerutti
Katherine Heseltine Hurley Flynn
Timothy J. Norman
88
0
0
05 Sep 2024
Semantically Controllable Augmentations for Generalizable Robot Learning
Zoey Chen
Zhao Mandi
Homanga Bharadhwaj
Mohit Sharma
Shuran Song
Abhishek Gupta
Vikash Kumar
LM&Ro
105
7
0
02 Sep 2024
Advances in Preference-based Reinforcement Learning: A Review
Youssef Abdelkareem
Shady Shehata
Fakhri Karray
OffRL
96
10
0
21 Aug 2024
Model-Based Transfer Learning for Contextual Reinforcement Learning
Jung-Hoon Cho
Vindula Jayawardana
Sirui Li
Cathy Wu
OffRL
158
0
0
08 Aug 2024
Robust Fast Adaptation from Adversarially Explicit Task Distribution Generation
Cheems Wang
Yiqin Lv
Yixiu Mao
Yun Qu
Yi Tian Xu
Xiangyang Ji
OOD
TTA
155
7
0
28 Jul 2024
Graceful task adaptation with a bi-hemispheric RL agent
Grant Nicholas
L. Kuhlmann
Gideon Kowadlo
72
0
0
16 Jul 2024
Mitigating the Human-Robot Domain Discrepancy in Visual Pre-training for Robotic Manipulation
Jiaming Zhou
Teli Ma
Kun-Yu Lin
Ronghe Qiu
Zifan Wang
Junwei Liang
149
7
0
20 Jun 2024
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
Alexander Nikulin
Ilya Zisman
Alexey Zemtsov
Viacheslav Sinii
199
7
0
13 Jun 2024
Grounding Multimodal Large Language Models in Actions
Andrew Szot
Bogdan Mazoure
Harsh Agrawal
Devon Hjelm
Z. Kira
Alexander Toshev
LM&Ro
88
14
0
12 Jun 2024
ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation
Guanxing Lu
Zifeng Gao
Tianxing Chen
Wen-Dao Dai
Ziwei Wang
Yansong Tang
Yansong Tang
DiffM
174
20
0
03 Jun 2024
Learning Manipulation by Predicting Interaction
Jia Zeng
Qingwen Bu
Bangjun Wang
Wenke Xia
Li Chen
...
Heming Cui
Bin Zhao
Xuelong Li
Yu Qiao
Hongyang Li
128
26
0
01 Jun 2024
Ego-Foresight: Self-supervised Learning of Agent-Aware Representations for Improved RL
Manuel S. Nunes
Atabak Dehban
Y. Demiris
J. Santos-Victor
97
0
0
27 May 2024
VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-horizon Manipulation
Kuo-Han Hung
Pang-Chi Lo
Jia-Fong Yeh
Han-Yuan Hsu
Yi-Ting Chen
Winston H. Hsu
157
0
0
26 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
330
54
0
23 May 2024
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Calarina Muslimani
Matthew E. Taylor
OffRL
126
2
0
30 Apr 2024
What Foundation Models can Bring for Robot Learning in Manipulation : A Survey
Dingzhe Li
Yixiang Jin
A. Yong
Yong A
Hongze Yu
...
Huaping Liu
Gang Hua
F. Sun
Jianwei Zhang
Bin Fang
AI4CE
LM&Ro
217
15
0
28 Apr 2024
Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback
Qiwei Di
Jiafan He
Quanquan Gu
110
1
0
16 Apr 2024
Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
Yibo Wang
Jiang Zhao
OffRL
OnRL
90
0
0
31 Mar 2024
RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences
Jie Cheng
Gang Xiong
Xingyuan Dai
Qinghai Miao
Yisheng Lv
Fei-Yue Wang
114
19
0
27 Feb 2024
Grasp, See, and Place: Efficient Unknown Object Rearrangement with Policy Structure Prior
Kechun Xu
Zhongxiang Zhou
Jun Wu
Haojian Lu
Rong Xiong
Yue Wang
106
3
0
23 Feb 2024
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
Ruijie Zheng
Yongyuan Liang
Xiyao Wang
Shuang Ma
Hal Daumé
Huazhe Xu
John Langford
Praveen Palanisamy
Kalyan Shankar Basu
Furong Huang
103
8
0
09 Feb 2024
DiffTORI: Differentiable Trajectory Optimization for Deep Reinforcement and Imitation Learning
Weikang Wan
Ziyu Wang
Yufei Wang
Zackory M. Erickson
David Held
123
4
0
08 Feb 2024
Previous
1
2
3
4
5
6
7
8
Next