Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.17766
Cited By
v1
v2
v3 (latest)
ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
27 February 2024
Zekun Qi
Runpei Dong
Shaochen Zhang
Haoran Geng
Chunrui Han
Zheng Ge
Li Yi
Kaisheng Ma
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"ShapeLLM: Universal 3D Object Understanding for Embodied Interaction"
50 / 102 papers shown
Title
Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training
Wenliang Dai
Zihan Liu
Ziwei Ji
Dan Su
Pascale Fung
MLLM
VLM
84
67
0
14 Oct 2022
SQA3D: Situated Question Answering in 3D Scenes
Xiaojian Ma
Silong Yong
Zilong Zheng
Qing Li
Yitao Liang
Song-Chun Zhu
Siyuan Huang
LM&Ro
81
159
0
14 Oct 2022
VIMA: General Robot Manipulation with Multimodal Prompts
Yunfan Jiang
Agrim Gupta
Zichen Zhang
Guanzhi Wang
Yongqiang Dou
Yanjun Chen
Li Fei-Fei
Anima Anandkumar
Yuke Zhu
Linxi Fan
LM&Ro
113
355
0
06 Oct 2022
CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth Pre-training
Tianyu Huang
Bowen Dong
Yunhan Yang
Xiaoshui Huang
Rynson W. H. Lau
Wanli Ouyang
W. Zuo
VLM
3DPC
CLIP
122
149
0
03 Oct 2022
Point Primitive Transformer for Long-Term 4D Point Cloud Video Understanding
Hao Wen
Yunze Liu
Jingwei Huang
Bokun Duan
Li Yi
ViT
3DPC
87
28
0
30 Jul 2022
Inner Monologue: Embodied Reasoning through Planning with Language Models
Wenlong Huang
F. Xia
Ted Xiao
Harris Chan
Jacky Liang
...
Tomas Jackson
Linda Luu
Sergey Levine
Karol Hausman
Brian Ichter
LLMAG
LM&Ro
LRM
137
922
0
12 Jul 2022
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training
Renrui Zhang
Ziyu Guo
Rongyao Fang
Bingyan Zhao
Dong Wang
Yu Qiao
Hongsheng Li
Peng Gao
3DPC
254
258
0
28 May 2022
Region-aware Knowledge Distillation for Efficient Image-to-Image Translation
Linfeng Zhang
Xin Chen
Runpei Dong
Kaisheng Ma
VLM
82
12
0
25 May 2022
Flamingo: a Visual Language Model for Few-Shot Learning
Jean-Baptiste Alayrac
Jeff Donahue
Pauline Luc
Antoine Miech
Iain Barr
...
Mikolaj Binkowski
Ricardo Barreira
Oriol Vinyals
Andrew Zisserman
Karen Simonyan
MLLM
VLM
418
3,610
0
29 Apr 2022
Do As I Can, Not As I Say: Grounding Language in Robotic Affordances
Michael Ahn
Anthony Brohan
Noah Brown
Yevgen Chebotar
Omar Cortes
...
Ted Xiao
Peng Xu
Sichun Xu
Mengyuan Yan
Andy Zeng
LM&Ro
195
1,988
0
04 Apr 2022
Visual Prompt Tuning
Menglin Jia
Luming Tang
Bor-Chun Chen
Claire Cardie
Serge Belongie
Bharath Hariharan
Ser-Nam Lim
VLM
VPVLM
173
1,647
0
23 Mar 2022
Masked Autoencoders for Point Cloud Self-supervised Learning
Yatian Pang
Wenxiao Wang
Francis E. H. Tay
Wen Liu
Yonghong Tian
Liuliang Yuan
3DPC
ViT
111
477
0
13 Mar 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
897
13,228
0
04 Mar 2022
Benchmarking and Analyzing Point Cloud Classification under Corruptions
Jiawei Ren
Liang Pan
Ziwei Liu
3DPC
91
84
0
07 Feb 2022
Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks
Runpei Dong
Zhanhong Tan
Mengdi Wu
Linfeng Zhang
Kaisheng Ma
MQ
94
12
0
30 Dec 2021
3D Question Answering
Shuquan Ye
Dongdong Chen
Songfang Han
Jing Liao
ViT
90
49
0
15 Dec 2021
Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling
Xumin Yu
Lulu Tang
Yongming Rao
Tiejun Huang
Jie Zhou
Jiwen Lu
3DPC
144
685
0
29 Nov 2021
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
482
7,837
0
11 Nov 2021
ABO: Dataset and Benchmarks for Real-World 3D Object Understanding
Jasmine Collins
Shubham Goel
Kenan Deng
Achleshwar Luthra
Leon L. Xu
...
T. F. Y. Vicente
T. Dideriksen
H. Arora
M. Guillaumin
Jitendra Malik
216
231
0
12 Oct 2021
Towards a Unified View of Parameter-Efficient Transfer Learning
Junxian He
Chunting Zhou
Xuezhe Ma
Taylor Berg-Kirkpatrick
Graham Neubig
AAML
146
953
0
08 Oct 2021
Voxel Transformer for 3D Object Detection
Jiageng Mao
Yujing Xue
Minzhe Niu
Haoyue Bai
Jiashi Feng
Xiaodan Liang
Hang Xu
Chunjing Xu
3DPC
ViT
85
413
0
06 Sep 2021
LoRA: Low-Rank Adaptation of Large Language Models
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
504
10,526
0
17 Jun 2021
Pri3D: Can 3D Priors Help 2D Representation Learning?
Ji Hou
Saining Xie
Benjamin Graham
Angela Dai
Matthias Nießner
SSL
3DPC
MDE
150
81
0
22 Apr 2021
SimCSE: Simple Contrastive Learning of Sentence Embeddings
Tianyu Gao
Xingcheng Yao
Danqi Chen
AILaw
SSL
282
3,432
0
18 Apr 2021
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Xiang Lisa Li
Percy Liang
252
4,305
0
01 Jan 2021
MVTN: Multi-View Transformation Network for 3D Shape Recognition
Abdullah Hamdi
Silvio Giancola
Guohao Li
3DV
3DPC
111
204
0
26 Nov 2020
Point Transformer
Nico Engel
Vasileios Belagiannis
Klaus C. J. Dietmayer
3DPC
186
2,008
0
02 Nov 2020
3D-FUTURE: 3D Furniture shape with TextURE
Huan Fu
Rongfei Jia
Lin Gao
Biwei Huang
Binqiang Zhao
Stephen J. Maybank
Dacheng Tao
3DV
89
262
0
21 Sep 2020
One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control
Wenlong Huang
Igor Mordatch
Deepak Pathak
127
178
0
09 Jul 2020
End-to-End Object Detection with Transformers
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
456
13,130
0
26 May 2020
ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
Dave Zhenyu Chen
Angel X. Chang
Matthias Nießner
3DPC
102
379
0
18 Dec 2019
How Can We Know What Language Models Know?
Zhengbao Jiang
Frank F. Xu
Jun Araki
Graham Neubig
KELM
149
1,412
0
28 Nov 2019
Language Models as Knowledge Bases?
Fabio Petroni
Tim Rocktaschel
Patrick Lewis
A. Bakhtin
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
AI4MH
587
2,680
0
03 Sep 2019
Commonsense Knowledge Mining from Pretrained Models
Joshua Feldman
Joe Davison
Alexander M. Rush
SSL
98
331
0
02 Sep 2019
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Nils Reimers
Iryna Gurevych
1.3K
12,332
0
27 Aug 2019
Revisiting Point Cloud Classification: A New Benchmark Dataset and Classification Model on Real-World Data
Mikaela Angelina Uy
Quang Pham
Binh-Son Hua
D. Nguyen
Sai-Kit Yeung
3DV
3DPC
125
785
0
13 Aug 2019
LVIS: A Dataset for Large Vocabulary Instance Segmentation
Agrim Gupta
Piotr Dollár
Ross B. Girshick
ISeg
VLM
111
1,379
0
08 Aug 2019
Scaling and Benchmarking Self-Supervised Visual Representation Learning
Priya Goyal
D. Mahajan
Abhinav Gupta
Ishan Misra
SSL
90
397
0
03 May 2019
Relation-Shape Convolutional Neural Network for Point Cloud Analysis
Yongcheng Liu
Bin Fan
Shiming Xiang
Chunhong Pan
3DPC
115
896
0
16 Apr 2019
Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation
He Wang
Srinath Sridhar
Jingwei Huang
Julien P. C. Valentin
Shuran Song
Leonidas Guibas
120
696
0
09 Jan 2019
Object Hallucination in Image Captioning
Anna Rohrbach
Lisa Anne Hendricks
Kaylee Burns
Trevor Darrell
Kate Saenko
204
443
0
06 Sep 2018
Dynamic Graph CNN for Learning on Point Clouds
Yue Wang
Yongbin Sun
Ziwei Liu
Sanjay E. Sarma
M. Bronstein
Justin Solomon
GNN
3DPC
260
6,177
0
24 Jan 2018
ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
Angela Dai
Angel X. Chang
Manolis Savva
Maciej Halber
Thomas Funkhouser
Matthias Nießner
3DPC
3DV
513
4,088
0
14 Feb 2017
A Point Set Generation Network for 3D Object Reconstruction from a Single Image
Haoqiang Fan
Hao Su
Leonidas Guibas
3DPC
3DV
176
2,242
0
02 Dec 2016
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
C. Qi
Hao Su
Kaichun Mo
Leonidas Guibas
3DH
3DPC
3DV
PINN
500
14,384
0
02 Dec 2016
Visual Dialog
Abhishek Das
Satwik Kottur
Khushi Gupta
Avi Singh
Deshraj Yadav
José M. F. Moura
Devi Parikh
Dhruv Batra
157
1,004
0
26 Nov 2016
Understanding and Exploiting Object Interaction Landscapes
Soren Pirk
Vojtech Krs
Kaimo Hu
S. D. Rajasekaran
Hao Kang
Bedrich Benes
Yusuke Yoshiyasu
Leonidas Guibas
46
35
0
27 Sep 2016
SGDR: Stochastic Gradient Descent with Warm Restarts
I. Loshchilov
Frank Hutter
ODL
352
8,190
0
13 Aug 2016
Gaussian Error Linear Units (GELUs)
Dan Hendrycks
Kevin Gimpel
176
5,049
0
27 Jun 2016
ShapeNet: An Information-Rich 3D Model Repository
Angel X. Chang
Thomas Funkhouser
Leonidas Guibas
Pat Hanrahan
Qi-Xing Huang
...
Shuran Song
Hao Su
Jianxiong Xiao
L. Yi
Feng Yu
3DV
176
5,538
0
09 Dec 2015
Previous
1
2
3
Next