ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.18476
  4. Cited By
Global-Local Tree Search in VLMs for 3D Indoor Scene Generation
v1v2 (latest)

Global-Local Tree Search in VLMs for 3D Indoor Scene Generation

24 March 2025
Wei Deng
Mengshi Qi
Huadong Ma
    3DV
ArXiv (abs)PDFHTML

Papers citing "Global-Local Tree Search in VLMs for 3D Indoor Scene Generation"

21 / 21 papers shown
Title
InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with
  Semantic Graph Prior
InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior
Chenguo Lin
Yadong Mu
3DV
68
40
0
07 Feb 2024
Disentangled Counterfactual Learning for Physical Audiovisual
  Commonsense Reasoning
Disentangled Counterfactual Learning for Physical Audiovisual Commonsense Reasoning
Changsheng Lv
Shuai Zhang
Yapeng Tian
Mengshi Qi
Huadong Ma
CML
91
18
0
30 Oct 2023
RDFC-GAN: RGB-Depth Fusion CycleGAN for Indoor Depth Completion
RDFC-GAN: RGB-Depth Fusion CycleGAN for Indoor Depth Completion
Haowen Wang
Zhengping Che
Yufan Yang
Ming L. Wang
Zhiyuan Xu
Xiuquan Qiao
Mengshi Qi
Feifei Feng
Jian Tang
3DVMDE
111
4
0
06 Jun 2023
LayoutGPT: Compositional Visual Planning and Generation with Large
  Language Models
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models
Weixi Feng
Wanrong Zhu
Tsu-Jui Fu
Varun Jampani
Arjun Reddy Akula
Xuehai He
Sugato Basu
Xinze Wang
William Yang Wang
MLLM
92
179
0
24 May 2023
Visual Instruction Tuning
Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDaVLMMLLM
575
4,925
0
17 Apr 2023
Fantasia3D: Disentangling Geometry and Appearance for High-quality
  Text-to-3D Content Creation
Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation
Rui Chen
Yuxiao Chen
Ningxin Jiao
Kui Jia
DiffM
111
592
0
24 Mar 2023
LAION-5B: An open large-scale dataset for training next generation
  image-text models
LAION-5B: An open large-scale dataset for training next generation image-text models
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
...
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
VLMMLLMCLIP
206
3,502
0
16 Oct 2022
DreamFusion: Text-to-3D using 2D Diffusion
DreamFusion: Text-to-3D using 2D Diffusion
Ben Poole
Ajay Jain
Jonathan T. Barron
B. Mildenhall
177
2,439
0
29 Sep 2022
PaLM: Scaling Language Modeling with Pathways
PaLM: Scaling Language Modeling with Pathways
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
...
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILMLRM
537
6,301
0
05 Apr 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&RoLRMAI4CEReLM
859
9,714
0
28 Jan 2022
ATISS: Autoregressive Transformers for Indoor Scene Synthesis
ATISS: Autoregressive Transformers for Indoor Scene Synthesis
Despoina Paschalidou
Amlan Kar
Maria Shugrina
Karsten Kreis
Andreas Geiger
Sanja Fidler
3DVViT
120
155
0
07 Oct 2021
Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes Using
  Scene Graphs
Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes Using Scene Graphs
Helisa Dhamo
Fabian Manhardt
Nassir Navab
F. Tombari
3DV
53
70
0
19 Aug 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
1.0K
29,926
0
26 Feb 2021
SceneFormer: Indoor Scene Generation with Transformers
SceneFormer: Indoor Scene Generation with Transformers
Xinpeng Wang
Chandan Yeshwanth
Matthias Nießner
ViT3DPC
57
154
0
17 Dec 2020
GIRAFFE: Representing Scenes as Compositional Generative Neural Feature
  Fields
GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields
Michael Niemeyer
Andreas Geiger
OCL
166
963
0
24 Nov 2020
3D-FRONT: 3D Furnished Rooms with layOuts and semaNTics
3D-FRONT: 3D Furnished Rooms with layOuts and semaNTics
Huan Fu
Bowen Cai
Lin Gao
Ling-Xiao Zhang
Ying Li
Zengqi Xun
Chengyue Sun
Rongfei Jia
Binqiang Zhao
H. Zhang
3DV
83
275
0
18 Nov 2020
3D-FUTURE: 3D Furniture shape with TextURE
3D-FUTURE: 3D Furniture shape with TextURE
Huan Fu
Rongfei Jia
Lin Gao
Biwei Huang
Binqiang Zhao
Stephen J. Maybank
Dacheng Tao
3DV
89
262
0
21 Sep 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
911
42,520
0
28 May 2020
Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions
Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions
Johanna Wald
Helisa Dhamo
Nassir Navab
Federico Tombari
3DV3DPC
73
219
0
08 Apr 2020
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Nils Reimers
Iryna Gurevych
1.3K
12,332
0
27 Aug 2019
Attentive Relational Networks for Mapping Images to Scene Graphs
Attentive Relational Networks for Mapping Images to Scene Graphs
Mengshi Qi
Weijian Li
Zhengyuan Yang
Yunhong Wang
Jiebo Luo
3DPC3DHGNN
76
169
0
26 Nov 2018
1