Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2401.06341
Cited By
AffordanceLLM: Grounding Affordance from Vision Language Models
12 January 2024
Shengyi Qian
Weifeng Chen
Min Bai
Xiong Zhou
Zhuowen Tu
Li Erran Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"AffordanceLLM: Grounding Affordance from Vision Language Models"
25 / 25 papers shown
Title
Cracking the Code of Action: a Generative Approach to Affordances for Reinforcement Learning
Lynn Cherif
Flemming Kondrup
David Venuto
Ankit Anand
Doina Precup
Khimya Khetarpal
LM&Ro
49
0
0
24 Apr 2025
AffordanceSAM: Segment Anything Once More in Affordance Grounding
D. Jiang
Mengmeng Wang
Teli Ma
Hao Li
Yong-Jin Liu
Guang Dai
L. Zhang
32
0
0
22 Apr 2025
A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation
Rongtao Xu
J. Zhang
Minghao Guo
Youpeng Wen
H. Yang
...
Liqiong Wang
Yuxuan Kuang
Meng Cao
Feng Zheng
Xiaodan Liang
47
3
0
17 Apr 2025
Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions
He Zhu
Quyu Kong
Kechun Xu
Xunlong Xia
Bing Deng
Jieping Ye
R. Xiong
Yixuan Wang
32
0
0
07 Apr 2025
Physically Ground Commonsense Knowledge for Articulated Object Manipulation with Analytic Concepts
Jianhua Sun
Jiude Wei
Yongqian Li
Cewu Lu
LM&Ro
54
1
0
30 Mar 2025
2HandedAfforder: Learning Precise Actionable Bimanual Affordances from Human Videos
Marvin Heidinger
Snehal Jauhri
V. Prasad
Georgia Chalvatzaki
68
0
0
12 Mar 2025
SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model
Chunlin Yu
Hanqing Wang
Ye Shi
Haoyang Luo
Sibei Yang
Jingyi Yu
Jingya Wang
LRM
LM&Ro
92
1
0
02 Dec 2024
GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding
Yawen Shao
Wei-dong Zhai
Yuhang Yang
Hongchen Luo
Yang Cao
Zheng-jun Zha
98
1
0
29 Nov 2024
Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Under Ambiguities
Zheyuan Zhang
Fengyuan Hu
Jayjun Lee
Freda Shi
Parisa Kordjamshidi
Joyce Chai
Ziqiao Ma
56
11
0
22 Oct 2024
PAVLM: Advancing Point Cloud based Affordance Understanding Via Vision-Language Model
Shang-Ching Liu
Van-Nhiem Tran
Wenkai Chen
Wei-Lun Cheng
Yen-Lin Huang
I-Bin Liao
Yung-Hui Li
Jianwei Zhang
18
0
0
15 Oct 2024
Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation
Qingwen Bu
Hongyang Li
Li Chen
Jisong Cai
Jia Zeng
Heming Cui
Maoqing Yao
Yu Qiao
52
4
0
10 Oct 2024
VLTP: Vision-Language Guided Token Pruning for Task-Oriented Segmentation
Hanning Chen
Yang Ni
Wenjun Huang
Yezi Liu
SungHeon Jeong
Fei Wen
Nathaniel Bastian
Hugo Latapie
Mohsen Imani
VLM
32
4
0
13 Sep 2024
Learning Precise Affordances from Egocentric Videos for Robotic Manipulation
Gen Li
Nikolaos Tsagkas
Jifei Song
Ruaridh Mon-Williams
S. Vijayakumar
Kun Shao
Laura Sevilla-Lara
36
7
0
19 Aug 2024
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
Xiang Li
Cristina Mata
J. Park
Kumara Kahatapitiya
Yoo Sung Jang
...
Kanchana Ranasinghe
R. Burgert
Mu Cai
Yong Jae Lee
Michael S. Ryoo
LM&Ro
72
25
0
28 Jun 2024
SInViG: A Self-Evolving Interactive Visual Agent for Human-Robot Interaction
Jie Xu
Hanbo Zhang
Xinghang Li
Huaping Liu
Xuguang Lan
Tao Kong
LM&Ro
35
3
0
19 Feb 2024
Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers
Haifeng Huang
Zehan Wang
Rongjie Huang
Luping Liu
Xize Cheng
Yang Zhao
Tao Jin
Zhou Zhao
59
43
0
13 Dec 2023
Multi-Object Graph Affordance Network: Goal-Oriented Planning through Learned Compound Object Affordances
Tuba Girgin
Emre Ugur
32
3
0
19 Sep 2023
Putting People in Their Place: Affordance-Aware Human Insertion into Scenes
Sumith Kulal
Tim Brooks
A. Aiken
Jiajun Wu
Jimei Yang
Jingwan Lu
Alexei A. Efros
Krishna Kumar Singh
DiffM
46
42
0
27 Apr 2023
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Sébastien Bubeck
Varun Chandrasekaran
Ronen Eldan
J. Gehrke
Eric Horvitz
...
Scott M. Lundberg
Harsha Nori
Hamid Palangi
Marco Tulio Ribeiro
Yi Zhang
ELM
AI4MH
AI4CE
ALM
298
3,007
0
22 Mar 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
270
4,244
0
30 Jan 2023
One-Shot Transfer of Affordance Regions? AffCorrs!
Denis Hadjivelichkov
Sicelukwanda Zwane
M. Deisenroth
Lourdes Agapito
Dimitrios Kanoulas
37
34
0
15 Sep 2022
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
Mohit Shridhar
Lucas Manuelli
D. Fox
LM&Ro
161
457
0
12 Sep 2022
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
317
5,785
0
29 Apr 2021
Associative3D: Volumetric Reconstruction from Sparse Views
Shengyi Qian
Linyi Jin
David Fouhey
42
20
0
27 Jul 2020
Designing Deep Networks for Surface Normal Estimation
Xinyu Wang
David Fouhey
Abhinav Gupta
3DV
SSL
167
353
0
18 Nov 2014
1