Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.20795
Cited By
Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt
27 May 2025
Xiang Zhu
Yichen Liu
Hezhong Li
Jianyu Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt"
38 / 38 papers shown
Title
Latent Action Pretraining from Videos
Seonghyeon Ye
Joel Jang
Byeongguk Jeon
Sejune Joo
Jianwei Yang
...
Kimin Lee
J. Gao
Luke Zettlemoyer
Dieter Fox
Minjoon Seo
56
34
0
15 Oct 2024
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Songming Liu
Lingxuan Wu
Bangguo Li
Hengkai Tan
Huayu Chen
Zhengyi Wang
Ke Xu
Hang Su
Jun Zhu
72
102
0
10 Oct 2024
Sampling from Energy-based Policies using Diffusion
V. Jain
Tara Akhound-Sadegh
Siamak Ravanbakhsh
DiffM
109
2
0
02 Oct 2024
Gen2Act: Human Video Generation in Novel Scenarios enables Generalizable Robot Manipulation
Homanga Bharadhwaj
Debidatta Dwibedi
Abhinav Gupta
Shubham Tulsiani
Carl Doersch
Ted Xiao
Dhruv Shah
Fei Xia
Dorsa Sadigh
Sean Kirmani
VGen
LM&Ro
71
32
0
24 Sep 2024
WiLoR: End-to-end 3D Hand Localization and Reconstruction in-the-wild
Rolandos Alexandros Potamias
Jinglei Zhang
Jiankang Deng
Stefanos Zafeiriou
3DH
64
12
0
18 Sep 2024
Contrast, Imitate, Adapt: Learning Robotic Skills From Raw Human Videos
Zhifeng Qian
Mingyu You
Hongjun Zhou
Xuanhui Xu
Hao Fu
Jinzhe Xue
Bin He
88
1
0
10 Aug 2024
Generative Image as Action Models
Mohit Shridhar
Yat Long Lo
Stephen James
64
10
0
10 Jul 2024
Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals
Moritz Reuss
Ömer Erdinç Yagmurlu
Fabian Wenzel
Rudolf Lioutikov
OffRL
61
47
0
08 Jul 2024
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
Junbang Liang
Ruoshi Liu
Ege Ozguroglu
Sruthi Sudhakar
Achal Dave
P. Tokmakov
Shuran Song
Carl Vondrick
VGen
61
26
0
24 Jun 2024
Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient
Zechu Li
Rickmer Krohn
Tao Chen
Anurag Ajay
Pulkit Agrawal
Georgia Chalvatzaki
DiffM
83
15
0
02 Jun 2024
Diffusion Actor-Critic with Entropy Regulator
Yinuo Wang
Likun Wang
Yuxuan Jiang
Wenjun Zou
Tong Liu
...
Wenxuan Wang
Liming Xiao
Jiang Wu
Jingliang Duan
Shengbo Eben Li
DiffM
64
14
0
24 May 2024
Octo: An Open-Source Generalist Robot Policy
Octo Model Team
Dibya Ghosh
Homer Walke
Karl Pertsch
Kevin Black
...
Quan Vuong
Ted Xiao
Dorsa Sadigh
Chelsea Finn
Sergey Levine
126
397
0
20 May 2024
ViViDex: Learning Vision-based Dexterous Manipulation from Human Videos
Zerui Chen
Shizhe Chen
Cordelia Schmid
Ivan Laptev
Cordelia Schmid
72
16
0
24 Apr 2024
Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Vidhi Jain
Maria Attarian
Nikhil J. Joshi
Ayzaan Wahid
Danny Driess
...
Stefan Welker
Christine Chan
Igor Gilitschenski
Yonatan Bisk
Debidatta Dwibedi
78
31
0
19 Mar 2024
DexCap: Scalable and Portable Mocap Data Collection System for Dexterous Manipulation
Chen Wang
Haochen Shi
Weizhuo Wang
Ruohan Zhang
Fei-Fei Li
Karen Liu
82
114
0
12 Mar 2024
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning
Ruoqing Zhang
Ziwei Luo
Jens Sjölund
Thomas B. Schön
Per Mattsson
61
12
0
06 Feb 2024
General Flow as Foundation Affordance for Scalable Robot Learning
Chengbo Yuan
Chuan Wen
Tong Zhang
Yang Gao
AI4CE
48
34
0
21 Jan 2024
Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipulation
Yuanchen Ju
Kaizhe Hu
Guowei Zhang
Gu Zhang
Mingrun Jiang
Huazhe Xu
60
43
0
15 Jan 2024
Any-point Trajectory Modeling for Policy Learning
Chuan Wen
Xingyu Lin
John So
Kai-xiang Chen
Qi Dou
Yang Gao
Pieter Abbeel
PINN
VGen
73
88
0
28 Dec 2023
RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches
Jiayuan Gu
Sean Kirmani
Paul Wohlhart
Yao Lu
Montse Gonzalez Arenas
...
Hao Su
Karol Hausman
Chelsea Finn
Q. Vuong
Ted Xiao
43
69
0
03 Nov 2023
DEFT: Dexterous Fine-Tuning for Real-World Hand Policies
Aditya Kannan
Kenneth Shaw
Shikhar Bahl
Pragna Mannam
Deepak Pathak
20
19
0
30 Oct 2023
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Anthony Brohan
Noah Brown
Justice Carbajal
Yevgen Chebotar
Xi Chen
...
Ted Xiao
Peng Xu
Sichun Xu
Tianhe Yu
Brianna Zitkovich
LM&Ro
LRM
86
1,172
0
28 Jul 2023
XSkill: Cross Embodiment Skill Discovery
Mengda Xu
Zhenjia Xu
Cheng Chi
Manuela Veloso
Shuran Song
49
71
0
19 Jul 2023
AnyTeleop: A General Vision-Based Dexterous Robot Arm-Hand Teleoperation System
Yuzhe Qin
Wei Yang
Binghao Huang
Karl Van Wyk
Hao Su
Xiaolong Wang
Yu-Wei Chao
Dieter Fox
69
98
0
10 Jul 2023
Learning Video-Conditioned Policies for Unseen Manipulation Tasks
Elliot Chane-Sane
Cordelia Schmid
Ivan Laptev
38
19
0
10 May 2023
Goal-Conditioned Imitation Learning using Score-based Diffusion Policies
Moritz Reuss
M. Li
Xiaogang Jia
Rudolf Lioutikov
DiffM
87
167
0
05 Apr 2023
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion
Cheng Chi
Zhenjia Xu
S. Feng
Eric A. Cousineau
Yilun Du
Benjamin Burchfiel
Russ Tedrake
Shuran Song
224
1,138
0
07 Mar 2023
MimicPlay: Long-Horizon Imitation Learning by Watching Human Play
Chen Wang
Linxi Fan
Jiankai Sun
Ruohan Zhang
Li Fei-Fei
Danfei Xu
Yuke Zhu
Anima Anandkumar
84
188
0
24 Feb 2023
One-shot Visual Imitation via Attributed Waypoints and Demonstration Augmentation
Matthew Chang
Saurabh Gupta
47
6
0
09 Feb 2023
Learning Universal Policies via Text-Guided Video Generation
Yilun Du
Mengjiao Yang
Bo Dai
H. Dai
Ofir Nachum
J. Tenenbaum
Dale Schuurmans
Pieter Abbeel
PINN
LM&Ro
57
241
0
31 Jan 2023
Imitating Human Behaviour with Diffusion Models
Tim Pearce
Tabish Rashid
Anssi Kanervisto
David Bignell
Mingfei Sun
...
Sergio Valcarcel Macua
Shan Zheng Tan
Ida Momennejad
Katja Hofmann
Sam Devlin
DiffM
67
212
0
25 Jan 2023
Siamese Prototypical Contrastive Learning
Shentong Mo
Zhun Sun
Chao Li
SSL
34
13
0
18 Aug 2022
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos
Yuzhe Qin
Yueh-hua Wu
Shaowei Liu
Hanwen Jiang
Ruihan Yang
Yang Fu
Xiaolong Wang
154
191
0
12 Aug 2021
Learning by Watching: Physical Imitation of Manipulation Skills from Human Videos
Haoyu Xiong
Quanzhou Li
Yun-Chun Chen
Homanga Bharadhwaj
Samarth Sinha
Animesh Garg
SSL
135
94
0
18 Jan 2021
Score-Based Generative Modeling through Stochastic Differential Equations
Yang Song
Jascha Narain Sohl-Dickstein
Diederik P. Kingma
Abhishek Kumar
Stefano Ermon
Ben Poole
DiffM
SyDa
268
6,293
0
26 Nov 2020
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
299
17,550
0
19 Jun 2020
Prototypical Contrastive Learning of Unsupervised Representations
Junnan Li
Pan Zhou
Caiming Xiong
Guosheng Lin
SSL
DRL
115
970
0
11 May 2020
AVID: Learning Multi-Stage Tasks via Pixel-Level Translation of Human Videos
Laura M. Smith
Nikita Dhawan
Marvin Zhang
Pieter Abbeel
Sergey Levine
111
158
0
10 Dec 2019
1