Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2410.11758
Cited By
v1
v2 (latest)
Latent Action Pretraining from Videos
15 October 2024
Seonghyeon Ye
Joel Jang
Byeongguk Jeon
Sejune Joo
Jianwei Yang
Baolin Peng
Ajay Mandlekar
Reuben Tan
Yu-Wei Chao
Bill Yuchen Lin
Lars Liden
Kimin Lee
J. Gao
Luke Zettlemoyer
Dieter Fox
Minjoon Seo
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Latent Action Pretraining from Videos"
30 / 80 papers shown
Title
DEFT: Dexterous Fine-Tuning for Real-World Hand Policies
Aditya Kannan
Kenneth Shaw
Shikhar Bahl
Pragna Mannam
Deepak Pathak
65
19
0
30 Oct 2023
Video Language Planning
Yilun Du
Mengjiao Yang
Peter R. Florence
Fei Xia
Ayzaan Wahid
...
Pieter Abbeel
Josh Tenenbaum
L. Kaelbling
Andy Zeng
Jonathan Tompson
PINN
LM&Ro
165
95
0
16 Oct 2023
An Unbiased Look at Datasets for Visuo-Motor Pre-Training
Sudeep Dasari
Mohan Kumar Srirama
Unnat Jain
Abhinav Gupta
SSL
71
38
0
13 Oct 2023
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Open X-Embodiment Collaboration
Abby OÑeill
Abdul Rehman
Abhinav Gupta
Abhiram Maddukuri
...
Zhuo Xu
Zichen Jeff Cui
Zichen Zhang
Zipeng Fu
Zipeng Lin
LM&Ro
166
521
0
13 Oct 2023
Learning to Act from Actionless Videos through Dense Correspondences
Po-Chen Ko
Jiayuan Mao
Yilun Du
Shao-Hua Sun
Josh Tenenbaum
89
89
0
12 Oct 2023
Learning Interactive Real-World Simulators
Mengjiao Yang
Yilun Du
Kamyar Ghasemipour
Jonathan Tompson
Leslie Kaelbling
Dale Schuurmans
Pieter Abbeel
LM&Ro
PINN
69
212
0
09 Oct 2023
BridgeData V2: A Dataset for Robot Learning at Scale
Homer Walke
Kevin Black
Abraham Lee
Moo Jin Kim
Maximilian Du
...
Andre Wang He
Vivek Myers
Kuan Fang
Chelsea Finn
Sergey Levine
91
231
0
24 Aug 2023
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Anthony Brohan
Noah Brown
Justice Carbajal
Yevgen Chebotar
Xi Chen
...
Ted Xiao
Peng Xu
Sichun Xu
Tianhe Yu
Brianna Zitkovich
LM&Ro
LRM
160
1,276
0
28 Jul 2023
Affordances from Human Videos as a Versatile Representation for Robotics
Shikhar Bahl
Russell Mendonca
Lili Chen
Unnat Jain
Deepak Pathak
100
177
0
17 Apr 2023
Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDa
VLM
MLLM
560
4,861
0
17 Apr 2023
MimicPlay: Long-Horizon Imitation Learning by Watching Human Play
Chen Wang
Linxi Fan
Jiankai Sun
Ruohan Zhang
Li Fei-Fei
Danfei Xu
Yuke Zhu
Anima Anandkumar
122
192
0
24 Feb 2023
Zero-Shot Robot Manipulation from Passive Human Videos
Homanga Bharadhwaj
Abhi Gupta
Shubham Tulsiani
Vikash Kumar
86
41
0
03 Feb 2023
Learning Universal Policies via Text-Guided Video Generation
Yilun Du
Mengjiao Yang
Bo Dai
H. Dai
Ofir Nachum
J. Tenenbaum
Dale Schuurmans
Pieter Abbeel
PINN
LM&Ro
99
258
0
31 Jan 2023
VideoDex: Learning Dexterity from Internet Videos
Kenneth Shaw
Shikhar Bahl
Deepak Pathak
87
95
0
08 Dec 2022
Interactive Language: Talking to Robots in Real Time
Corey Lynch
Ayzaan Wahid
Jonathan Tompson
Tianli Ding
James Betker
Robert Baruch
Travis Armstrong
Peter R. Florence
LM&Ro
91
228
0
12 Oct 2022
Phenaki: Variable Length Video Generation From Open Domain Textual Description
Ruben Villegas
Mohammad Babaeizadeh
Pieter-Jan Kindermans
Hernan Moraldo
Han Zhang
M. Saffar
Santiago Castro
Julius Kunze
D. Erhan
DiffM
VGen
127
394
0
05 Oct 2022
Learning Continuous Grasping Function with a Dexterous Hand from Human Demonstrations
Jianglong Ye
Jiashun Wang
Binghao Huang
Yuzhe Qin
Xiaolong Wang
165
52
0
11 Jul 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
128
298
0
23 Jun 2022
R3M: A Universal Visual Representation for Robot Manipulation
Suraj Nair
Aravind Rajeswaran
Vikash Kumar
Chelsea Finn
Abhi Gupta
LM&Ro
98
582
0
23 Mar 2022
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
399
1,107
0
13 Oct 2021
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos
Yuzhe Qin
Yueh-hua Wu
Shaowei Liu
Hanwen Jiang
Ruihan Yang
Yang Fu
Xiaolong Wang
177
200
0
12 Aug 2021
LoRA: Low-Rank Adaptation of Large Language Models
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
477
10,367
0
17 Jun 2021
Rescaling Egocentric Vision
Dima Damen
Hazel Doughty
G. Farinella
Antonino Furnari
Evangelos Kazakos
...
Davide Moltisanti
Jonathan Munro
Toby Perrett
Will Price
Michael Wray
EgoV
78
460
0
23 Jun 2020
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
608
4,893
0
23 Jan 2020
Leveraging Procedural Generation to Benchmark Reinforcement Learning
K. Cobbe
Christopher Hesse
Jacob Hilton
John Schulman
79
557
0
03 Dec 2019
Learning Latent Plans from Play
Corey Lynch
Mohi Khansari
Ted Xiao
Vikash Kumar
Jonathan Tompson
Sergey Levine
P. Sermanet
SSL
LM&Ro
93
406
0
05 Mar 2019
Imitating Latent Policies from Observation
Ashley D. Edwards
Himanshu Sahni
Yannick Schroecker
Charles Isbell
80
138
0
21 May 2018
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
228
5,019
0
02 Nov 2017
The "something something" video database for learning and evaluating visual common sense
Raghav Goyal
Samira Ebrahimi Kahou
Vincent Michalski
Joanna Materzynska
S. Westphal
...
Moritz Mueller-Freitag
F. Hoppe
Christian Thurau
Ingo Bax
Roland Memisevic
VLM
87
1,535
0
13 Jun 2017
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich
Barry Haddow
Alexandra Birch
224
7,755
0
31 Aug 2015
Previous
1
2