Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.10255
Cited By
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models
16 May 2024
Xianzheng Ma
Yash Bhalgat
Brandon Smart
Shuai Chen
Xinghui Li
Jian Ding
Jindong Gu
Dave Zhenyu Chen
Songyou Peng
Jiawang Bian
Philip Torr
Marc Pollefeys
Matthias Nießner
Ian D Reid
Angel X. Chang
Iro Laina
V. Prisacariu
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (1689★)
Papers citing
"When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models"
50 / 155 papers shown
Title
The Power of Scale for Parameter-Efficient Prompt Tuning
Brian Lester
Rami Al-Rfou
Noah Constant
VPVLM
579
4,077
0
18 Apr 2021
Holistic 3D Scene Understanding from a Single Image with Implicit Representation
Cheng Zhang
Zhaopeng Cui
Yinda Zhang
B. Zeng
Marc Pollefeys
Shuaicheng Liu
126
107
0
11 Mar 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
445
3,887
0
11 Feb 2021
ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision
Wonjae Kim
Bokyung Son
Ildoo Kim
VLM
CLIP
128
1,749
0
05 Feb 2021
What Makes Good In-Context Examples for GPT-
3
3
3
?
Jiachang Liu
Dinghan Shen
Yizhe Zhang
Bill Dolan
Lawrence Carin
Weizhu Chen
AAML
RALM
385
1,387
0
17 Jan 2021
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Xiang Lisa Li
Percy Liang
248
4,261
0
01 Jan 2021
D-NeRF: Neural Radiance Fields for Dynamic Scenes
Albert Pumarola
Enric Corona
Gerard Pons-Moll
Francesc Moreno-Noguer
119
1,446
0
27 Nov 2020
Score-Based Generative Modeling through Stochastic Differential Equations
Yang Song
Jascha Narain Sohl-Dickstein
Diederik P. Kingma
Abhishek Kumar
Stefano Ermon
Ben Poole
DiffM
SyDa
344
6,551
0
26 Nov 2020
GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields
Michael Niemeyer
Andreas Geiger
OCL
154
963
0
24 Nov 2020
NeRF++: Analyzing and Improving Neural Radiance Fields
Kai Zhang
Gernot Riegler
Noah Snavely
V. Koltun
88
1,046
0
15 Oct 2020
Neural Sparse Voxel Fields
Lingjie Liu
Jiatao Gu
Kyaw Zaw Lin
Tat-Seng Chua
Christian Theobalt
266
1,269
0
22 Jul 2020
GRAF: Generative Radiance Fields for 3D-Aware Image Synthesis
Katja Schwarz
Yiyi Liao
Michael Niemeyer
Andreas Geiger
VGen
140
873
0
05 Jul 2020
Evaluation of Text Generation: A Survey
Asli Celikyilmaz
Elizabeth Clark
Jianfeng Gao
ELM
LM&MA
112
387
0
26 Jun 2020
Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains
Matthew Tancik
Pratul P. Srinivasan
B. Mildenhall
Sara Fridovich-Keil
N. Raghavan
Utkarsh Singhal
R. Ramamoorthi
Jonathan T. Barron
Ren Ng
124
2,421
0
18 Jun 2020
Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions
Johanna Wald
Helisa Dhamo
Nassir Navab
Federico Tombari
3DV
3DPC
71
218
0
08 Apr 2020
PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation
Li Jiang
Hengshuang Zhao
Shaoshuai Shi
Shu Liu
Chi-Wing Fu
Jiaya Jia
3DPC
83
436
0
03 Apr 2020
OccuSeg: Occupancy-aware 3D Instance Segmentation
Lei Han
Tian Zheng
Lan Xu
Lu Fang
3DPC
242
260
0
14 Mar 2020
PolyGen: An Autoregressive Generative Model of 3D Meshes
C. Nash
Yaroslav Ganin
A. Eslami
Peter W. Battaglia
AI4CE
86
262
0
23 Feb 2020
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
608
4,893
0
23 Jan 2020
LayoutLM: Pre-training of Text and Layout for Document Image Understanding
Yiheng Xu
Minghao Li
Lei Cui
Shaohan Huang
Furu Wei
Ming Zhou
135
707
0
31 Dec 2019
ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
Dave Zhenyu Chen
Angel X. Chang
Matthias Nießner
3DPC
89
376
0
18 Dec 2019
SynSin: End-to-end View Synthesis from a Single Image
Olivia Wiles
Georgia Gkioxari
Richard Szeliski
Justin Johnson
3DV
87
472
0
18 Dec 2019
SDFDiff: Differentiable Rendering of Signed Distance Fields for 3D Shape Optimization
Yue Jiang
Dantong Ji
Zhizhong Han
Matthias Zwicker
63
238
0
15 Dec 2019
How Can We Know What Language Models Know?
Zhengbao Jiang
Frank F. Xu
Jun Araki
Graham Neubig
KELM
132
1,405
0
28 Nov 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
445
20,298
0
23 Oct 2019
RIO: 3D Object Instance Re-Localization in Changing Indoor Environments
Johanna Wald
A. Avetisyan
Nassir Navab
Federico Tombari
Matthias Nießner
59
158
0
16 Aug 2019
Revisiting Point Cloud Classification: A New Benchmark Dataset and Classification Model on Real-World Data
Mikaela Angelina Uy
Quang Pham
Binh-Son Hua
D. Nguyen
Sai-Kit Yeung
3DV
3DPC
96
780
0
13 Aug 2019
Learning to Predict 3D Objects with an Interpolation-based Differentiable Renderer
Wenzheng Chen
Jun Gao
Huan Ling
Edward James Smith
J. Lehtinen
Alec Jacobson
Sanja Fidler
3DH
3DV
96
362
0
03 Aug 2019
Differentiable Surface Splatting for Point-based Geometry Processing
Yifan Wang
Felice Serena
Shihao Wu
Cengiz Öztireli
O. Sorkine-Hornung
3DPC
82
299
0
10 Jun 2019
Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representations
Vincent Sitzmann
Michael Zollhoefer
Gordon Wetzstein
3DPC
3DV
163
1,282
0
04 Jun 2019
DISN: Deep Implicit Surface Network for High-quality Single-view 3D Reconstruction
Qiangeng Xu
Weiyue Wang
Duygu Ceylan
R. Měch
Ulrich Neumann
3DH
3DV
106
566
0
26 May 2019
BERTScore: Evaluating Text Generation with BERT
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
329
5,845
0
21 Apr 2019
Analysing Mathematical Reasoning Abilities of Neural Models
D. Saxton
Edward Grefenstette
Felix Hill
Pushmeet Kohli
LRM
193
430
0
02 Apr 2019
nuScenes: A multimodal dataset for autonomous driving
Holger Caesar
Varun Bankiti
Alex H. Lang
Sourabh Vora
Venice Erin Liong
Qiang Xu
Anush Krishnan
Yuxin Pan
G. Baldan
Oscar Beijbom
3DPC
298
5,770
0
26 Mar 2019
DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation
Jeong Joon Park
Peter R. Florence
Julian Straub
Richard Newcombe
S. Lovegrove
3DV
133
3,704
0
16 Jan 2019
3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans
Ji Hou
Angela Dai
Matthias Nießner
ISeg
82
468
0
17 Dec 2018
ApolloCar3D: A Large 3D Car Instance Understanding Benchmark for Autonomous Driving
Xibin Song
Peng Wang
Dingfu Zhou
Rui Zhu
Chenye Guan
Yuchao Dai
Hao Su
Hongdong Li
Ruigang Yang
3DPC
78
158
0
29 Nov 2018
Learning View Priors for Single-view 3D Reconstruction
Hiroharu Kato
Tatsuya Harada
74
80
0
26 Nov 2018
Unsupervised Learning of Shape and Pose with Differentiable Point Clouds
Eldar Insafutdinov
Alexey Dosovitskiy
3DPC
71
244
0
22 Oct 2018
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing
Taku Kudo
John Richardson
198
3,526
0
19 Aug 2018
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
1.1K
7,182
0
20 Apr 2018
3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation
Angela Dai
Matthias Nießner
3DPC
3DV
46
324
0
28 Mar 2018
Text2Shape: Generating Shapes from Natural Language by Learning Joint Embeddings
Kevin Chen
Chris Choy
Manolis Savva
Angel X. Chang
Thomas Funkhouser
Silvio Savarese
3DV
65
250
0
22 Mar 2018
SGPN: Similarity Group Proposal Network for 3D Point Cloud Instance Segmentation
Weiyue Wang
Ronald Yu
Qiangui Huang
Ulrich Neumann
3DPC
96
552
0
23 Nov 2017
VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection
Yin Zhou
Oncel Tuzel
3DPC
112
3,723
0
17 Nov 2017
Matterport3D: Learning from RGB-D Data in Indoor Environments
Angel X. Chang
Angela Dai
Thomas Funkhouser
Maciej Halber
Matthias Nießner
Manolis Savva
Shuran Song
Andy Zeng
Yinda Zhang
3DV
3DPC
191
1,914
0
18 Sep 2017
Learning Efficient Point Cloud Generation for Dense 3D Object Reconstruction
Chen-Hsuan Lin
Chen Kong
Simon Lucey
3DV
72
426
0
21 Jun 2017
Multi-view Supervision for Single-view Reconstruction via Differentiable Ray Consistency
Shubham Tulsiani
Tinghui Zhou
Alyosha A. Efros
Jitendra Malik
3DV
89
560
0
20 Apr 2017
ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
Angela Dai
Angel X. Chang
Manolis Savva
Maciej Halber
Thomas Funkhouser
Matthias Nießner
3DPC
3DV
481
4,077
0
14 Feb 2017
Structured Attention Networks
Yoon Kim
Carl Denton
Luong Hoang
Alexander M. Rush
114
463
0
03 Feb 2017
Previous
1
2
3
4
Next