Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.03734
Cited By
OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
5 March 2025
Huang Huang
Fangchen Liu
Letian Fu
Tingfan Wu
Mustafa Mukadam
Jitendra Malik
Ken Goldberg
Pieter Abbeel
LM&Ro
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction"
5 / 5 papers shown
Title
Training Strategies for Efficient Embodied Reasoning
William Chen
Suneel Belkhale
Suvir Mirchandani
Oier Mees
Danny Driess
Karl Pertsch
Sergey Levine
OffRL
LRM
23
0
0
13 May 2025
3D CAVLA: Leveraging Depth and 3D Context to Generalize Vision Language Action Models for Unseen Tasks
V. Bhat
Yu-Hsiang Lan
P. Krishnamurthy
Ramesh Karri
Farshad Khorrami
52
0
0
09 May 2025
Vision-Language-Action Models: Concepts, Progress, Applications and Challenges
Ranjan Sapkota
Yang Cao
Konstantinos I Roumeliotis
Manoj Karkee
LM&Ro
151
1
0
07 May 2025
Generalization Capability for Imitation Learning
Yixiao Wang
114
0
0
25 Apr 2025
π
0.5
π_{0.5}
π
0.5
: a Vision-Language-Action Model with Open-World Generalization
Physical Intelligence
Kevin Black
Noah Brown
James Darpinian
Karan Dhabalia
...
Homer Walke
Anna Walling
Haohuan Wang
Lili Yu
Ury Zhilinsky
LM&Ro
VLM
39
10
0
22 Apr 2025
1