Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2503.18476
Cited By
v1
v2 (latest)
Global-Local Tree Search in VLMs for 3D Indoor Scene Generation
24 March 2025
Wei Deng
Mengshi Qi
Huadong Ma
3DV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Global-Local Tree Search in VLMs for 3D Indoor Scene Generation"
21 / 21 papers shown
Title
InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior
Chenguo Lin
Yadong Mu
3DV
68
40
0
07 Feb 2024
Disentangled Counterfactual Learning for Physical Audiovisual Commonsense Reasoning
Changsheng Lv
Shuai Zhang
Yapeng Tian
Mengshi Qi
Huadong Ma
CML
91
18
0
30 Oct 2023
RDFC-GAN: RGB-Depth Fusion CycleGAN for Indoor Depth Completion
Haowen Wang
Zhengping Che
Yufan Yang
Ming L. Wang
Zhiyuan Xu
Xiuquan Qiao
Mengshi Qi
Feifei Feng
Jian Tang
3DV
MDE
111
4
0
06 Jun 2023
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models
Weixi Feng
Wanrong Zhu
Tsu-Jui Fu
Varun Jampani
Arjun Reddy Akula
Xuehai He
Sugato Basu
Xinze Wang
William Yang Wang
MLLM
92
179
0
24 May 2023
Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDa
VLM
MLLM
575
4,925
0
17 Apr 2023
Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation
Rui Chen
Yuxiao Chen
Ningxin Jiao
Kui Jia
DiffM
111
592
0
24 Mar 2023
LAION-5B: An open large-scale dataset for training next generation image-text models
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
...
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
VLM
MLLM
CLIP
206
3,502
0
16 Oct 2022
DreamFusion: Text-to-3D using 2D Diffusion
Ben Poole
Ajay Jain
Jonathan T. Barron
B. Mildenhall
177
2,439
0
29 Sep 2022
PaLM: Scaling Language Modeling with Pathways
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
...
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
537
6,301
0
05 Apr 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
859
9,714
0
28 Jan 2022
ATISS: Autoregressive Transformers for Indoor Scene Synthesis
Despoina Paschalidou
Amlan Kar
Maria Shugrina
Karsten Kreis
Andreas Geiger
Sanja Fidler
3DV
ViT
120
155
0
07 Oct 2021
Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes Using Scene Graphs
Helisa Dhamo
Fabian Manhardt
Nassir Navab
F. Tombari
3DV
53
70
0
19 Aug 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
1.0K
29,926
0
26 Feb 2021
SceneFormer: Indoor Scene Generation with Transformers
Xinpeng Wang
Chandan Yeshwanth
Matthias Nießner
ViT
3DPC
57
154
0
17 Dec 2020
GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields
Michael Niemeyer
Andreas Geiger
OCL
166
963
0
24 Nov 2020
3D-FRONT: 3D Furnished Rooms with layOuts and semaNTics
Huan Fu
Bowen Cai
Lin Gao
Ling-Xiao Zhang
Ying Li
Zengqi Xun
Chengyue Sun
Rongfei Jia
Binqiang Zhao
H. Zhang
3DV
83
275
0
18 Nov 2020
3D-FUTURE: 3D Furniture shape with TextURE
Huan Fu
Rongfei Jia
Lin Gao
Biwei Huang
Binqiang Zhao
Stephen J. Maybank
Dacheng Tao
3DV
89
262
0
21 Sep 2020
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
911
42,520
0
28 May 2020
Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions
Johanna Wald
Helisa Dhamo
Nassir Navab
Federico Tombari
3DV
3DPC
73
219
0
08 Apr 2020
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Nils Reimers
Iryna Gurevych
1.3K
12,332
0
27 Aug 2019
Attentive Relational Networks for Mapping Images to Scene Graphs
Mengshi Qi
Weijian Li
Zhengyuan Yang
Yunhong Wang
Jiebo Luo
3DPC
3DH
GNN
76
169
0
26 Nov 2018
1