ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.02925
  4. Cited By
ShapeGlot: Learning Language for Shape Differentiation

ShapeGlot: Learning Language for Shape Differentiation

IEEE International Conference on Computer Vision (ICCV), 2019
8 May 2019
Panos Achlioptas
Judy Fan
Robert D. Hawkins
Noah D. Goodman
Leonidas Guibas
ArXiv (abs)PDFHTML

Papers citing "ShapeGlot: Learning Language for Shape Differentiation"

50 / 61 papers shown
Title
ChangingGrounding: 3D Visual Grounding in Changing Scenes
ChangingGrounding: 3D Visual Grounding in Changing Scenes
Miao Hu
Zhiwei Huang
Tai Wang
Jiangmiao Pang
Dahua Lin
Nanning Zheng
R. Xu
VGen
73
0
0
16 Oct 2025
Why Are You Wrong? Counterfactual Explanations for Language Grounding with 3D Objects
Why Are You Wrong? Counterfactual Explanations for Language Grounding with 3D Objects
Tobias Preintner
Weixuan Yuan
Qi Huang
Adrian König
Thomas Bäck
Elena Raponi
Niki van Stein
174
0
0
09 May 2025
SceneFactor: Factored Latent 3D Diffusion for Controllable 3D Scene
  Generation
SceneFactor: Factored Latent 3D Diffusion for Controllable 3D Scene GenerationComputer Vision and Pattern Recognition (CVPR), 2024
Alexey Bokhovkin
Quan Meng
Shubham Tulsiani
Angela Dai
DiffM
347
12
0
02 Dec 2024
OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part
  Scene Parsing
OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene ParsingEuropean Conference on Computer Vision (ECCV), 2024
Pranav Gupta
Rishubh Singh
Pradeep Shenoy
Ravikiran Sarvadevabhatla
144
1
0
05 Nov 2024
SPARTUN3D: Situated Spatial Understanding of 3D World in Large Language Models
SPARTUN3D: Situated Spatial Understanding of 3D World in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2024
Yue Zhang
Zhiyang Xu
Ying Shen
Parisa Kordjamshidi
Lifu Huang
255
17
0
04 Oct 2024
Multi-Task Domain Adaptation for Language Grounding with 3D Objects
Multi-Task Domain Adaptation for Language Grounding with 3D Objects
Yixiang Chen
Yaoxian Song
Xinglin Pan
Peijie Dong
Xiaofei Yang
Qiang-qiang Wang
Zhixu Li
Tiefeng Li
Xiaowen Chu
200
2
0
03 Jul 2024
Situational Awareness Matters in 3D Vision Language Reasoning
Situational Awareness Matters in 3D Vision Language Reasoning
Yunze Man
Liang-Yan Gui
Yu-Xiong Wang
212
34
0
11 Jun 2024
A Survey On Text-to-3D Contents Generation In The Wild
A Survey On Text-to-3D Contents Generation In The Wild
Chenhan Jiang
210
11
0
15 May 2024
PointCloud-Text Matching: Benchmark Datasets and a Baseline
PointCloud-Text Matching: Benchmark Datasets and a Baseline
Yanglin Feng
Yang Qin
Dezhong Peng
Erik Cambria
Xi Peng
Peng Hu
233
0
0
28 Mar 2024
NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion,
  Reconstruction, and Generation
NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and Generation
Ruikai Cui
Weizhe Liu
Weixuan Sun
Senbo Wang
Taizhang Shang
...
Han Yan
Zhennan Wu
Shenzhou Chen
Hongdong Li
Pan Ji
175
11
0
27 Mar 2024
Text-to-3D Shape Generation
Text-to-3D Shape Generation
Han-Hung Lee
Manolis Savva
Angel X. Chang
191
17
0
20 Mar 2024
Advancements in Point Cloud-Based 3D Defect Detection and Classification
  for Industrial Systems: A Comprehensive Survey
Advancements in Point Cloud-Based 3D Defect Detection and Classification for Industrial Systems: A Comprehensive Survey
Anju Rani
D. O. Arroyo
Petar Durdevic
3DPC
264
28
0
20 Feb 2024
Stellar: Systematic Evaluation of Human-Centric Personalized
  Text-to-Image Methods
Stellar: Systematic Evaluation of Human-Centric Personalized Text-to-Image Methods
Panos Achlioptas
Alexandros Benetatos
Iordanis Fostiropoulos
Dimitris Skourtis
212
10
0
11 Dec 2023
Which One? Leveraging Context Between Objects and Multiple Views for
  Language Grounding
Which One? Leveraging Context Between Objects and Multiple Views for Language GroundingNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Chancharik Mitra
Abrar Anwar
Rodolfo Corona
Dan Klein
Trevor Darrell
Jesse Thomason
129
2
0
12 Nov 2023
State of the Art on Diffusion Models for Visual Computing
State of the Art on Diffusion Models for Visual Computing
Ryan Po
Wang Yifan
Vladislav Golyanik
Kfir Aberman
Jonathan T. Barron
...
Matthias Nießner
Bjorn Ommer
Christian Theobalt
Peter Wonka
Gordon Wetzstein
213
148
0
11 Oct 2023
Looking at words and points with attention: a benchmark for
  text-to-shape coherence
Looking at words and points with attention: a benchmark for text-to-shape coherence
Andrea Amaduzzi
Giuseppe Lisanti
Samuele Salti
Luigi Di Stefano
112
3
0
14 Sep 2023
Multi3DRefer: Grounding Text Description to Multiple 3D Objects
Multi3DRefer: Grounding Text Description to Multiple 3D ObjectsIEEE International Conference on Computer Vision (ICCV), 2023
Yiming Zhang
ZeMing Gong
Angel X. Chang
281
126
0
11 Sep 2023
FArMARe: a Furniture-Aware Multi-task methodology for Recommending
  Apartments based on the user interests
FArMARe: a Furniture-Aware Multi-task methodology for Recommending Apartments based on the user interests
Ali Abdari
Alex Falcon
Giuseppe Serra
142
5
0
06 Sep 2023
Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text
  Aligned Latent Representation
Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent RepresentationNeural Information Processing Systems (NeurIPS), 2023
Zibo Zhao
Wen Liu
Xin Chen
Xi Zeng
Rui Wang
Pei Cheng
Bin-Bin Fu
Tao Chen
Gang Yu
Shenghua Gao
DiffM
249
160
0
29 Jun 2023
Scalable 3D Captioning with Pretrained Models
Scalable 3D Captioning with Pretrained ModelsNeural Information Processing Systems (NeurIPS), 2023
Tiange Luo
C. Rockwell
Honglak Lee
Justin Johnson
237
202
0
12 Jun 2023
Investigating Agency of LLMs in Human-AI Collaboration Tasks
Investigating Agency of LLMs in Human-AI Collaboration TasksConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023
Ashish Sharma
Sudha Rao
Chris Brockett
Akanksha Malhotra
Nebojsa Jojic
W. Dolan
LLMAG
217
20
0
22 May 2023
Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era
Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era
Chenghao Li
Chaoning Zhang
Atish Waghwase
Lik-Hang Lee
François Rameau
Yang Yang
Sung-Ho Bae
Choong Seon Hong
167
97
0
10 May 2023
DiffFacto: Controllable Part-Based 3D Point Cloud Generation with Cross
  Diffusion
DiffFacto: Controllable Part-Based 3D Point Cloud Generation with Cross DiffusionIEEE International Conference on Computer Vision (ICCV), 2023
Kiyohiro Nakayama
Mikaela Angelina Uy
Jiahui Huang
Shihui Hu
Ke Li
Leonidas Guibas
DiffM
194
38
0
03 May 2023
SALAD: Part-Level Latent Diffusion for 3D Shape Generation and
  Manipulation
SALAD: Part-Level Latent Diffusion for 3D Shape Generation and ManipulationIEEE International Conference on Computer Vision (ICCV), 2023
Juil Koo
Seungwoo Yoo
Minh Hoai Nguyen
Minhyuk Sung
DiffM
150
65
0
21 Mar 2023
3DQD: Generalized Deep 3D Shape Prior via Part-Discretized Diffusion
  Process
3DQD: Generalized Deep 3D Shape Prior via Part-Discretized Diffusion ProcessComputer Vision and Pattern Recognition (CVPR), 2023
Yuhan Li
Yishun Dou
Xuanhong Chen
Bingbing Ni
Yilin Sun
Yutian Liu
Fuzhen Wang
DiffM
179
37
0
18 Mar 2023
Paparazzi: A Deep Dive into the Capabilities of Language and Vision
  Models for Grounding Viewpoint Descriptions
Paparazzi: A Deep Dive into the Capabilities of Language and Vision Models for Grounding Viewpoint DescriptionsFindings (Findings), 2023
Henrik Voigt
J. Hombeck
M. Meuschke
K. Lawonn
Sina Zarrieß
VLM
161
1
0
13 Feb 2023
3DShape2VecSet: A 3D Shape Representation for Neural Fields and
  Generative Diffusion Models
3DShape2VecSet: A 3D Shape Representation for Neural Fields and Generative Diffusion ModelsACM Transactions on Graphics (TOG), 2023
Biao Zhang
Jiapeng Tang
Matthias Niessner
Peter Wonka
DiffM
336
319
0
26 Jan 2023
Neural Shape Compiler: A Unified Framework for Transforming between
  Text, Point Cloud, and Program
Neural Shape Compiler: A Unified Framework for Transforming between Text, Point Cloud, and Program
Tiange Luo
Honglak Lee
Justin Johnson
215
6
0
25 Dec 2022
Objaverse: A Universe of Annotated 3D Objects
Objaverse: A Universe of Annotated 3D ObjectsComputer Vision and Pattern Recognition (CVPR), 2022
Matt Deitke
Dustin Schwenk
Jordi Salvador
Luca Weihs
Oscar Michel
Eli VanderBilt
Ludwig Schmidt
Kiana Ehsani
Aniruddha Kembhavi
Ali Farhadi
420
1,301
0
15 Dec 2022
ScanEnts3D: Exploiting Phrase-to-3D-Object Correspondences for Improved
  Visio-Linguistic Models in 3D Scenes
ScanEnts3D: Exploiting Phrase-to-3D-Object Correspondences for Improved Visio-Linguistic Models in 3D ScenesIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Ahmed Abdelreheem
Kyle Olszewski
Hsin-Ying Lee
Peter Wonka
Panos Achlioptas
3DPC
219
32
0
12 Dec 2022
LADIS: Language Disentanglement for 3D Shape Editing
LADIS: Language Disentanglement for 3D Shape EditingConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Ian Huang
Panos Achlioptas
Tianyi Zhang
Sergey Tulyakov
Minhyuk Sung
Leonidas Guibas
147
10
0
09 Dec 2022
SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation
SDFusion: Multimodal 3D Shape Completion, Reconstruction, and GenerationComputer Vision and Pattern Recognition (CVPR), 2022
Yen-Chi Cheng
Hsin-Ying Lee
Sergey Tulyakov
Alex Schwing
Liangyan Gui
DiffM
357
310
0
08 Dec 2022
Diffusion-SDF: Text-to-Shape via Voxelized Diffusion
Diffusion-SDF: Text-to-Shape via Voxelized DiffusionComputer Vision and Pattern Recognition (CVPR), 2022
Muheng Li
Yueqi Duan
Jie Zhou
Jiwen Lu
DiffM
227
147
0
06 Dec 2022
CLIP-Sculptor: Zero-Shot Generation of High-Fidelity and Diverse Shapes
  from Natural Language
CLIP-Sculptor: Zero-Shot Generation of High-Fidelity and Diverse Shapes from Natural LanguageComputer Vision and Pattern Recognition (CVPR), 2022
Aditya Sanghi
Rao Fu
Vivian Liu
Karl Willis
Hooman Shayani
Amir Hosein Khasahmadi
Srinath Sridhar
Daniel E. Ritchie
165
65
0
02 Nov 2022
PoseScript: Linking 3D Human Poses and Natural Language
PoseScript: Linking 3D Human Poses and Natural LanguageIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Ginger Delmas
Philippe Weinzaepfel
Thomas Lucas
Francesc Moreno-Noguer
Grégory Rogez
3DH
139
6
0
21 Oct 2022
Affection: Learning Affective Explanations for Real-World Visual Data
Affection: Learning Affective Explanations for Real-World Visual DataComputer Vision and Pattern Recognition (CVPR), 2022
Panos Achlioptas
M. Ovsjanikov
Leonidas Guibas
Sergey Tulyakov
141
23
0
04 Oct 2022
ShapeCrafter: A Recursive Text-Conditioned 3D Shape Generation Model
ShapeCrafter: A Recursive Text-Conditioned 3D Shape Generation ModelNeural Information Processing Systems (NeurIPS), 2022
Rao Fu
Xiaoyu Zhan
Yiwen Chen
Daniel E. Ritchie
Srinath Sridhar
253
89
0
19 Jul 2022
Toward Explainable and Fine-Grained 3D Grounding through Referring
  Textual Phrases
Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases
Zhihao Yuan
Xu Yan
Zhuo Li
Xuhao Li
Yao Guo
Shuguang Cui
Zhen Li
144
18
0
05 Jul 2022
Voxel-informed Language Grounding
Voxel-informed Language GroundingAnnual Meeting of the Association for Computational Linguistics (ACL), 2022
Rodolfo Corona
Shizhan Zhu
Dan Klein
Trevor Darrell
263
13
0
19 May 2022
FLOAT: Factorized Learning of Object Attributes for Improved
  Multi-object Multi-part Scene Parsing
FLOAT: Factorized Learning of Object Attributes for Improved Multi-object Multi-part Scene ParsingComputer Vision and Pattern Recognition (CVPR), 2022
Rishu Singh
Pranav Gupta
Pradeep Shenoy
Ravi Kiran Sarvadevabhatla
215
14
0
30 Mar 2022
AutoSDF: Shape Priors for 3D Completion, Reconstruction and Generation
AutoSDF: Shape Priors for 3D Completion, Reconstruction and GenerationComputer Vision and Pattern Recognition (CVPR), 2022
Paritosh Mittal
Y. Cheng
Maneesh Singh
Shubham Tulsiani
268
265
0
17 Mar 2022
TriCoLo: Trimodal Contrastive Loss for Text to Shape Retrieval
TriCoLo: Trimodal Contrastive Loss for Text to Shape RetrievalIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2022
Yue Ruan
Han-Hung Lee
Yiming Zhang
Ke Zhang
Angel X. Chang
190
26
0
19 Jan 2022
Comprehensive Visual Question Answering on Point Clouds through
  Compositional Scene Manipulation
Comprehensive Visual Question Answering on Point Clouds through Compositional Scene ManipulationIEEE Transactions on Visualization and Computer Graphics (TVCG), 2021
Xu Yan
Zhihao Yuan
Yuhao Du
Yinghong Liao
Yao Guo
Zhen Li
Shuguang Cui
3DPCCoGe
141
23
0
22 Dec 2021
PartGlot: Learning Shape Part Segmentation from Language Reference Games
PartGlot: Learning Shape Part Segmentation from Language Reference Games
Juil Koo
Ian Huang
Panos Achlioptas
Leonidas Guibas
Minhyuk Sung
3DPC
205
33
0
13 Dec 2021
PTR: A Benchmark for Part-based Conceptual, Relational, and Physical
  Reasoning
PTR: A Benchmark for Part-based Conceptual, Relational, and Physical Reasoning
Yining Hong
Li Yi
J. Tenenbaum
Antonio Torralba
Chuang Gan
132
43
0
09 Dec 2021
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning
  and Visual Grounding
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Dave Zhenyu Chen
Qirui Wu
Matthias Nießner
Angel X. Chang
164
48
0
02 Dec 2021
Reinforced Natural Language Interfaces via Entropy Decomposition
Reinforced Natural Language Interfaces via Entropy Decomposition
Xiaoran Wu
Yipeng Kang
LLMAG
126
0
0
23 Sep 2021
Towers of Babel: Combining Images, Language, and 3D Geometry for
  Learning Multimodal Vision
Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal VisionIEEE International Conference on Computer Vision (ICCV), 2021
Xiaoshi Wu
Hadar Averbuch-Elor
J. Sun
Noah Snavely
133
24
0
12 Aug 2021
Language Grounding with 3D Objects
Language Grounding with 3D ObjectsConference on Robot Learning (CoRL), 2021
Jesse Thomason
Mohit Shridhar
Yonatan Bisk
Chris Paxton
Luke Zettlemoyer
LM&Ro
182
54
0
26 Jul 2021
Parts2Words: Learning Joint Embedding of Point Clouds and Texts by
  Bidirectional Matching between Parts and Words
Parts2Words: Learning Joint Embedding of Point Clouds and Texts by Bidirectional Matching between Parts and Words
Chuan Tang
Xi Yang
Bojian Wu
Zhizhong Han
Yi Chang
3DPC
157
16
0
05 Jul 2021
12
Next