ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.01527
  4. Cited By
Track2Act: Predicting Point Tracks from Internet Videos enables Diverse
  Zero-shot Robot Manipulation

Track2Act: Predicting Point Tracks from Internet Videos enables Diverse Zero-shot Robot Manipulation

2 May 2024
Homanga Bharadhwaj
Roozbeh Mottaghi
Abhinav Gupta
Shubham Tulsiani
    3DPC
ArXivPDFHTML

Papers citing "Track2Act: Predicting Point Tracks from Internet Videos enables Diverse Zero-shot Robot Manipulation"

21 / 21 papers shown
Title
X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real
X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real
Prithwish Dan
K. Kedia
Angela Chao
Edward Weiyi Duan
Maximus Adrian Pace
Wei-Chiu Ma
Sanjiban Choudhury
23
0
0
11 May 2025
RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
Haifeng Huang
Xinyi Chen
Y. Chen
Yiming Li
Xiaoshen Han
Z. Wang
Tai Wang
Jiangmiao Pang
Zhou Zhao
LM&Ro
80
0
0
30 Apr 2025
TAPNext: Tracking Any Point (TAP) as Next Token Prediction
TAPNext: Tracking Any Point (TAP) as Next Token Prediction
Artem Zholus
Carl Doersch
Yi Yang
Skanda Koppula
Viorica Patraucean
Xu He
Ignacio Rocco
Mehdi S. M. Sajjadi
Sarath Chandar
Ross Goroshin
30
0
0
08 Apr 2025
GR00T N1: An Open Foundation Model for Generalist Humanoid Robots
GR00T N1: An Open Foundation Model for Generalist Humanoid Robots
Nvidia
Johan Bjorck
Fernando Castañeda
Nikita Cherniadev
Xingye Da
...
Ao Zhang
Hao Zhang
Yizhou Zhao
Ruijie Zheng
Yuke Zhu
VLM
68
22
0
18 Mar 2025
VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation
VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation
Hanzhi Chen
Boyang Sun
Anran Zhang
Marc Pollefeys
Stefan Leutenegger
LM&Ro
72
0
0
10 Mar 2025
HAMSTER: Hierarchical Action Models For Open-World Robot Manipulation
HAMSTER: Hierarchical Action Models For Open-World Robot Manipulation
Yi Li
Yuquan Deng
Jing Zhang
Joel Jang
Marius Memme
...
Fabio Ramos
Dieter Fox
Anqi Li
Abhishek Gupta
Ankit Goyal
LM&Ro
99
9
0
08 Feb 2025
Motion Tracks: A Unified Representation for Human-Robot Transfer in Few-Shot Imitation Learning
Motion Tracks: A Unified Representation for Human-Robot Transfer in Few-Shot Imitation Learning
Juntao Ren
Priya Sundaresan
Dorsa Sadigh
Sanjiban Choudhury
Jeannette Bohg
37
14
0
13 Jan 2025
Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning
Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning
Jiange Yang
Haoyi Zhu
Yuanda Wang
Gangshan Wu
Tong He
Limin Wang
100
2
0
21 Nov 2024
Grounding Video Models to Actions through Goal Conditioned Exploration
Grounding Video Models to Actions through Goal Conditioned Exploration
Yunhao Luo
Yilun Du
LM&Ro
VGen
85
1
0
11 Nov 2024
SPOT: SE(3) Pose Trajectory Diffusion for Object-Centric Manipulation
SPOT: SE(3) Pose Trajectory Diffusion for Object-Centric Manipulation
Cheng-Chun Hsu
Bowen Wen
Jie Xu
Yashraj S. Narang
Xiaolong Wang
Yuke Zhu
Joydeep Biswas
Stan Birchfield
DiffM
41
8
0
01 Nov 2024
Latent Action Pretraining from Videos
Latent Action Pretraining from Videos
Seonghyeon Ye
Joel Jang
Byeongguk Jeon
Sejune Joo
Jianwei Yang
...
Kimin Lee
J. Gao
Luke Zettlemoyer
Dieter Fox
Minjoon Seo
35
27
0
15 Oct 2024
MotIF: Motion Instruction Fine-tuning
MotIF: Motion Instruction Fine-tuning
Minyoung Hwang
Joey Hejna
Dorsa Sadigh
Yonatan Bisk
49
1
0
16 Sep 2024
Embodiment-Agnostic Action Planning via Object-Part Scene Flow
Embodiment-Agnostic Action Planning via Object-Part Scene Flow
Weiliang Tang
Jia-Hui Pan
Wei Zhan
Jianshu Zhou
Huaxiu Yao
Yun-Hui Liu
M. Tomizuka
Mingyu Ding
Chi-Wing Fu
50
0
0
16 Sep 2024
Hand-Object Interaction Pretraining from Videos
Hand-Object Interaction Pretraining from Videos
Himanshu Gaurav Singh
Antonio Loquercio
Carmelo Sferrazza
Jane Wu
Haozhi Qi
Pieter Abbeel
Jitendra Malik
44
13
0
12 Sep 2024
One-Shot Imitation under Mismatched Execution
One-Shot Imitation under Mismatched Execution
K. Kedia
Prithwish Dan
Sanjiban Choudhury
Maximus Adrian Pace
Sanjiban Choudhury
40
2
0
10 Sep 2024
Leveraging Object Priors for Point Tracking
Leveraging Object Priors for Point Tracking
Bikram Boote
Anh Thai
Wenqi Jia
Ozgur Kara
Stefan Stojanov
James M. Rehg
Sangmin Lee
3DPC
31
0
0
09 Sep 2024
RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation
RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation
Mel Vecerík
Carl Doersch
Yi Yang
Todor Davchev
Y. Aytar
Guangyao Zhou
R. Hadsell
Lourdes Agapito
Jonathan Scholz
53
47
0
30 Aug 2023
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
229
1,019
0
13 Oct 2021
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos
Yuzhe Qin
Yueh-hua Wu
Shaowei Liu
Hanwen Jiang
Ruihan Yang
Yang Fu
Xiaolong Wang
125
188
0
12 Aug 2021
Learning by Watching: Physical Imitation of Manipulation Skills from
  Human Videos
Learning by Watching: Physical Imitation of Manipulation Skills from Human Videos
Haoyu Xiong
Quanzhou Li
Yun-Chun Chen
Homanga Bharadhwaj
Samarth Sinha
Animesh Garg
SSL
128
93
0
18 Jan 2021
Where2Act: From Pixels to Actions for Articulated 3D Objects
Where2Act: From Pixels to Actions for Articulated 3D Objects
Kaichun Mo
Leonidas J. Guibas
Mustafa Mukadam
Abhinav Gupta
Shubham Tulsiani
162
176
0
07 Jan 2021
1