Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.07864
Cited By
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
10 October 2024
Songming Liu
Lingxuan Wu
Bangguo Li
Hengkai Tan
Huayu Chen
Zhengyi Wang
Ke Xu
Hang Su
Jun Zhu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation"
50 / 57 papers shown
Title
Unveiling the Potential of Vision-Language-Action Models with Open-Ended Multimodal Instructions
Wei Zhao
Gongsheng Li
Zhefei Gong
Pengxiang Ding
H. Zhao
Donglin Wang
LM&Ro
22
0
0
16 May 2025
VTLA: Vision-Tactile-Language-Action Model with Preference Learning for Insertion Manipulation
Chaofan Zhang
Peng Hao
Xiaoge Cao
Xiaoshuai Hao
Shaowei Cui
Shuo Wang
32
0
0
14 May 2025
From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation
Yifu Yuan
Haiqin Cui
Yibin Chen
Zibin Dong
Fei Ni
Longxin Kou
Jinyi Liu
Pengyi Li
Yan Zheng
Jianye Hao
31
0
0
13 May 2025
Training Strategies for Efficient Embodied Reasoning
William Chen
Suneel Belkhale
Suvir Mirchandani
Oier Mees
Danny Driess
Karl Pertsch
Sergey Levine
OffRL
LRM
23
0
0
13 May 2025
GelFusion: Enhancing Robotic Manipulation under Visual Constraints via Visuotactile Fusion
Shulong Jiang
Shiqi Zhao
Yuxuan Fan
Peng Yin
34
0
0
12 May 2025
D-CODA: Diffusion for Coordinated Dual-Arm Data Augmentation
Isabella Liu
Jason Chen
Gaurav Sukhatme
Daniel Seita
52
0
0
08 May 2025
RoboOS: A Hierarchical Embodied Framework for Cross-Embodiment and Multi-Agent Collaboration
Huajie Tan
Xiaoshuai Hao
Minglan Lin
Pengwei Wang
Yaoxu Lyu
Mingyu Cao
Zhongyuan Wang
S. Zhang
LM&Ro
48
0
0
06 May 2025
DeCo: Task Decomposition and Skill Composition for Zero-Shot Generalization in Long-Horizon 3D Manipulation
Zixuan Chen
Junhui Yin
Yangtao Chen
Jing Huo
Pinzhuo Tian
Jieqi Shi
Yiwen Hou
Yongqian Li
Yang Gao
35
0
0
01 May 2025
A Survey of Robotic Navigation and Manipulation with Physics Simulators in the Era of Embodied AI
Lik Hang Kenny Wong
Xueyang Kang
Kaixin Bai
Jianwei Zhang
56
0
0
01 May 2025
PRISM: Projection-based Reward Integration for Scene-Aware Real-to-Sim-to-Real Transfer with Few Demonstrations
Haowen Sun
Haoran Wang
Chengzhong Ma
Shaolong Zhang
Jiawei Ye
Xingyu Chen
Xuguang Lan
OffRL
53
1
0
29 Apr 2025
STDArm: Transferring Visuomotor Policies From Static Data Training to Dynamic Robot Manipulation
YiFan Duan
Heng Li
Yilong Wu
Wenhao Yu
Xinran Zhang
Yedong Shen
Jianmin Ji
Yuhang Zhang
45
0
0
26 Apr 2025
Demonstrating DVS: Dynamic Virtual-Real Simulation Platform for Mobile Robotic Tasks
Zijie Zheng
Zeshun Li
Yunpeng Wang
Qinghongbing Xie
Long Zeng
63
0
0
26 Apr 2025
π
0.5
π_{0.5}
π
0.5
: a Vision-Language-Action Model with Open-World Generalization
Physical Intelligence
Kevin Black
Noah Brown
James Darpinian
Karan Dhabalia
...
Homer Walke
Anna Walling
Haohuan Wang
Lili Yu
Ury Zhilinsky
LM&Ro
VLM
39
10
0
22 Apr 2025
Adversarial Locomotion and Motion Imitation for Humanoid Policy Learning
Jiyuan Shi
Xinzhe Liu
Dewei Wang
Ouyang Lu
Sören Schwertfeger
Fuchun Sun
Chenjia Bai
X. Li
47
0
0
19 Apr 2025
DiffOG: Differentiable Policy Trajectory Optimization with Generalizability
Zhengtong Xu
Zichen Miao
Qiang Qiu
Zhe Zhang
Yu She
60
0
0
18 Apr 2025
A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation
Rongtao Xu
J. Zhang
Minghao Guo
Youpeng Wen
H. Yang
...
Liqiong Wang
Yuxuan Kuang
Meng Cao
Feng Zheng
Xiaodan Liang
47
3
0
17 Apr 2025
Efficient Task-specific Conditional Diffusion Policies: Shortcut Model Acceleration and SO(3) Optimization
Haiyong Yu
Yanqiong Jin
Yonghao He
Wei Sui
32
0
0
14 Apr 2025
Diffusion Models for Robotic Manipulation: A Survey
Rosa Wolf
Yitian Shi
Sheng Liu
Rania Rayyes
51
1
0
11 Apr 2025
Disambiguate Gripper State in Grasp-Based Tasks: Pseudo-Tactile as Feedback Enables Pure Simulation Learning
Yifei Yang
Lu Chen
Zherui Song
Yenan Chen
Wentao Sun
Zhongxiang Zhou
R. Xiong
Yixuan Wang
38
0
0
31 Mar 2025
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy
Zhi Hou
Tianyi Zhang
Yuwen Xiong
Haonan Duan
Hengjun Pu
...
Chengyang Zhao
X. Zhu
Yu Qiao
Jifeng Dai
Y. Chen
59
1
0
25 Mar 2025
DataPlatter: Boosting Robotic Manipulation Generalization with Minimal Costly Data
Liming Zheng
Feng Yan
Fanfan Liu
C. Feng
Yufeng Zhong
Yiyang Huang
Lin Ma
47
0
0
25 Mar 2025
Curiosity-Diffuser: Curiosity Guide Diffusion Models for Reliability
Zihao Liu
Xing Liu
Yizhai Zhang
Zhengxiong Liu
Panfeng Huang
66
0
0
19 Mar 2025
Learning Bimanual Manipulation via Action Chunking and Inter-Arm Coordination with Transformers
Tomohiro Motoda
Ryo Hanai
Ryoichi Nakajo
Masaki Murooka
Floris Erich
Y. Domae
52
1
0
18 Mar 2025
MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation
Zhenyu Wu
Yuheng Zhou
Xiuwei Xu
Zehua Wang
Haibin Yan
49
2
0
17 Mar 2025
Modality-Composable Diffusion Policy via Inference-Time Distribution-level Composition
Jiahang Cao
Qiang Zhang
Hanzhong Guo
Jiaxu Wang
Hao-Ran Cheng
Renjing Xu
DiffM
60
0
0
16 Mar 2025
Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills
Haoqi Yuan
Yu Bai
Yuhui Fu
Bohan Zhou
Yicheng Feng
Xinrun Xu
Yi Zhan
Börje F. Karlsson
Zongqing Lu
LM&Ro
88
0
0
16 Mar 2025
Adversarial Data Collection: Human-Collaborative Perturbations for Efficient and Robust Robotic Imitation Learning
Siyuan Huang
Yue Liao
Siyuan Feng
Shu Jiang
Si Liu
Hongsheng Li
Maoqing Yao
Guanghui Ren
AAML
60
1
0
14 Mar 2025
Rethinking Bimanual Robotic Manipulation: Learning with Decoupled Interaction Framework
Jian-Jian Jiang
Xiao-Ming Wu
Yi-Xiang He
Ling-an Zeng
Yi-Lin Wei
Dandan Zhang
Wei-Shi Zheng
48
2
0
13 Mar 2025
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
Jiaming Liu
Hao Chen
Pengju An
Zhuoyang Liu
Renrui Zhang
...
Chengkai Hou
Mengdi Zhao
KC alex Zhou
Pheng-Ann Heng
S. Zhang
72
8
0
13 Mar 2025
Towards Fast, Memory-based and Data-Efficient Vision-Language Policy
Haoxuan Li
Sixu Yan
Yongqian Li
Xinggang Wang
LM&Ro
64
0
0
13 Mar 2025
TPDiff: Temporal Pyramid Video Diffusion Model
L. Ran
Mike Zheng Shou
80
0
0
12 Mar 2025
Efficient Alignment of Unconditioned Action Prior for Language-conditioned Pick and Place in Clutter
Kechun Xu
Xunlong Xia
Kaixuan Wang
Yifei Yang
Yunxuan Mao
Bing Deng
R. Xiong
Yixuan Wang
OffRL
72
0
0
12 Mar 2025
TLA: Tactile-Language-Action Model for Contact-Rich Manipulation
Peng Hao
Chaofan Zhang
Dingzhe Li
Xiaoge Cao
Xiaoshuai Hao
Shaowei Cui
Shuo Wang
LM&Ro
52
7
0
11 Mar 2025
FP3: A 3D Foundation Policy for Robotic Manipulation
Rujia Yang
Geng Chen
Chuan Wen
Yang Gao
LM&Ro
78
1
0
11 Mar 2025
SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Safe Reinforcement Learning
Borong Zhang
Yuhao Zhang
Yalan Qin
Yingshan Lei
Josef Dai
Yuanpei Chen
Yaodong Yang
66
4
0
05 Mar 2025
AirExo-2: Scaling up Generalizable Robotic Imitation Learning with Low-Cost Exoskeletons
Hongjie Fang
Chenxi Wang
Yiming Wang
J. Chen
Shangning Xia
...
Xinyu Zhan
Lixin Yang
Weiming Wang
Cewu Lu
Hao-Shu Fang
84
1
0
05 Mar 2025
Generative Artificial Intelligence in Robotic Manipulation: A Survey
Anton van den Hengel
Peng Yun
Jun Cen
Junhao Cai
DiDi Zhu
...
Qifeng Chen
Jia Pan
Wei Zhang
Bo Yang
Hua Chen
59
1
0
05 Mar 2025
A Taxonomy for Evaluating Generalist Robot Policies
Jensen Gao
Suneel Belkhale
Sudeep Dasari
Ashwin Balakrishna
Dhruv Shah
Dorsa Sadigh
LM&Ro
50
4
0
03 Mar 2025
Diffusion Stabilizer Policy for Automated Surgical Robot Manipulations
Chonlam Ho
Jianshu Hu
Haoran Wang
Qi Dou
Yutong Ban
MedIm
73
1
0
03 Mar 2025
Fast Visuomotor Policies via Partial Denoising
Haojun Chen
Minghao Liu
Xiaojian Ma
Zailin Ma
Huimin Wu
...
Yuanpei Chen
Yifan Zhong
Mingzhi Wang
Qing Li
Yaodong Yang
VGen
31
0
0
01 Mar 2025
VDT-Auto: End-to-end Autonomous Driving with VLM-Guided Diffusion Transformers
Ziang Guo
Konstantin Gubernatorov
Selamawit Asfaw
Zakhar Yagudin
Dzmitry Tsetserukou
47
1
0
27 Feb 2025
Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models
Lucy Xiaoyang Shi
Brian Ichter
Michael Equi
Liyiming Ke
Karl Pertsch
...
Adrian Li-Bell
Danny Driess
Lachy Groom
Sergey Levine
Chelsea Finn
LM&Ro
LRM
95
7
0
26 Feb 2025
BFA: Best-Feature-Aware Fusion for Multi-View Fine-grained Manipulation
Zihan Lan
Weixin Mao
Hao Li
Le Wang
Tiancai Wang
Haoqiang Fan
Osamu Yoshie
EgoV
67
2
0
20 Feb 2025
Towards Fusing Point Cloud and Visual Representations for Imitation Learning
Atalay Donat
Xiaogang Jia
Xi Huang
Aleksandar Taranovic
Denis Blessing
Ge Li
Hongyi Zhou
Hanyi Zhang
Rudolf Lioutikov
Gerhard Neumann
3DPC
SSL
73
1
0
20 Feb 2025
IMLE Policy: Fast and Sample Efficient Visuomotor Policy Learning via Implicit Maximum Likelihood Estimation
Krishan Rana
Robert Lee
David Pershouse
Niko Suenderhauf
VGen
56
0
0
17 Feb 2025
Generative Multi-Agent Collaboration in Embodied AI: A Systematic Review
Di Wu
Xian Wei
Guang Chen
Hao Shen
Xiangfeng Wang
Wenhao Li
Bo Jin
65
2
0
17 Feb 2025
RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation
Kun Wu
Chengkai Hou
Jiaming Liu
Zhengping Che
Xiaozhu Ju
...
Zhenyu Wang
Pengju An
Siyuan Qian
S. Zhang
Jian Tang
LM&Ro
113
15
0
17 Feb 2025
3D-Grounded Vision-Language Framework for Robotic Task Planning: Automated Prompt Synthesis and Supervised Reasoning
Guoqin Tang
Qingxuan Jia
Zeyuan Huang
Gang Chen
Ning Ji
Zhipeng Yao
66
0
0
13 Feb 2025
Imit Diff: Semantics Guided Diffusion Transformer with Dual Resolution Fusion for Imitation Learning
Yuhang Dong
Haizhou Ge
Yupei Zeng
Jingyang Zhang
Beiwen Tian
...
Yufei Jia
Ruixiang Wang
Ran Yi
Guyue Zhou
Longhua Ma
56
0
0
11 Feb 2025
Embrace Collisions: Humanoid Shadowing for Deployable Contact-Agnostics Motions
Ziwen Zhuang
Hang Zhao
46
6
0
03 Feb 2025
1
2
Next