ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2501.09278
  4. Cited By
Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding
v1v2 (latest)

Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding

20 January 2025
Kohei Torimi
Ryosuke Yamada
Daichi Otsuka
Kensho Hara
Yuki M. Asano
Hirokatsu Kataoka
Y. Aoki
    3DV
ArXiv (abs)PDFHTML

Papers citing "Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding"

35 / 35 papers shown
Title
Learning Vision from Models Rivals Learning Vision from Data
Learning Vision from Models Rivals Learning Vision from Data
Yonglong Tian
Lijie Fan
Kaifeng Chen
Dina Katabi
Dilip Krishnan
Phillip Isola
108
51
0
28 Dec 2023
Sculpting Holistic 3D Representation in Contrastive Language-Image-3D
  Pre-training
Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training
Yipeng Gao
Zeyu Wang
Wei-Shi Zheng
Cihang Xie
Yuyin Zhou
3DPC
150
10
0
03 Nov 2023
Uni3D: Exploring Unified 3D Representation at Scale
Uni3D: Exploring Unified 3D Representation at Scale
Junsheng Zhou
Jinsheng Wang
Baorui Ma
Yu-Shen Liu
Tiejun Huang
Xinlong Wang
113
98
0
10 Oct 2023
T$^3$Bench: Benchmarking Current Progress in Text-to-3D Generation
T3^33Bench: Benchmarking Current Progress in Text-to-3D Generation
Yuze He
Yushi Bai
Matthieu Lin
Wang Zhao
Yubin Hu
Jenny Sheng
Ran Yi
Juanzi Li
Yong Liu
130
33
0
04 Oct 2023
Objaverse-XL: A Universe of 10M+ 3D Objects
Objaverse-XL: A Universe of 10M+ 3D Objects
Matt Deitke
Ruoshi Liu
Matthew Wallingford
Huong Ngo
Oscar Michel
...
Carl Vondrick
Georgia Gkioxari
Kiana Ehsani
Ludwig Schmidt
Ali Farhadi
108
423
0
11 Jul 2023
StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual
  Representation Learners
StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners
Yonglong Tian
Lijie Fan
Phillip Isola
Huiwen Chang
Dilip Krishnan
VLMDiffM
152
153
0
01 Jun 2023
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with
  Variational Score Distillation
ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation
Zhengyi Wang
Cheng Lu
Yikai Wang
Fan Bao
Chongxuan Li
Hang Su
Jun Zhu
DiffM
180
876
0
25 May 2023
OpenShape: Scaling Up 3D Shape Representation Towards Open-World
  Understanding
OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding
Minghua Liu
Ruoxi Shi
Kaiming Kuang
Yinhao Zhu
Xuanlin Li
Shizhong Han
H. Cai
Fatih Porikli
Hao Su
3DPC
126
124
0
18 May 2023
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
Le Xue
Ning Yu
Shu Zhen Zhang
Artemis Panagopoulou
Junnan Li
...
Jiajun Wu
Caiming Xiong
Ran Xu
Juan Carlos Niebles
Silvio Savarese
127
130
0
14 May 2023
TextMesh: Generation of Realistic 3D Meshes From Text Prompts
TextMesh: Generation of Realistic 3D Meshes From Text Prompts
Christina Tsalicoglou
Fabian Manhardt
A. Tonioni
Michael Niemeyer
F. Tombari
DiffM
76
135
0
24 Apr 2023
DINOv2: Learning Robust Visual Features without Supervision
DINOv2: Learning Robust Visual Features without Supervision
Maxime Oquab
Timothée Darcet
Théo Moutakanni
Huy Q. Vo
Marc Szafraniec
...
Hervé Jégou
Julien Mairal
Patrick Labatut
Armand Joulin
Piotr Bojanowski
VLMCLIPSSL
538
3,535
0
14 Apr 2023
Fantasia3D: Disentangling Geometry and Appearance for High-quality
  Text-to-3D Content Creation
Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation
Rui Chen
Yuxiao Chen
Ningxin Jiao
Kui Jia
DiffM
121
593
0
24 Mar 2023
Zero-1-to-3: Zero-shot One Image to 3D Object
Zero-1-to-3: Zero-shot One Image to 3D Object
Ruoshi Liu
Rundi Wu
Basile Van Hoorick
P. Tokmakov
Sergey Zakharov
Carl Vondrick
DiffM
153
1,113
0
20 Mar 2023
CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D
  Recognition
CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition
Deepti Hegde
Jeya Maria Jose Valanarasu
Vishal M. Patel
CLIP
122
68
0
20 Mar 2023
Contrast with Reconstruct: Contrastive 3D Representation Learning Guided
  by Generative Pretraining
Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining
Zekun Qi
Runpei Dong
Guo Fan
Zheng Ge
Xiangyu Zhang
Kaisheng Ma
Li Yi
154
131
0
05 Feb 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image
  Encoders and Large Language Models
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLMMLLM
576
4,675
0
30 Jan 2023
Point-E: A System for Generating 3D Point Clouds from Complex Prompts
Point-E: A System for Generating 3D Point Clouds from Complex Prompts
Alex Nichol
Heewoo Jun
Prafulla Dhariwal
Pamela Mishkin
Mark Chen
DiffM
141
614
0
16 Dec 2022
Fake it till you make it: Learning transferable representations from
  synthetic ImageNet clones
Fake it till you make it: Learning transferable representations from synthetic ImageNet clones
Mert Bulent Sariyildiz
Alahari Karteek
Diane Larlus
Yannis Kalantidis
DiffMVLM
111
161
0
16 Dec 2022
Objaverse: A Universe of Annotated 3D Objects
Objaverse: A Universe of Annotated 3D Objects
Matt Deitke
Dustin Schwenk
Jordi Salvador
Luca Weihs
Oscar Michel
Eli VanderBilt
Ludwig Schmidt
Kiana Ehsani
Aniruddha Kembhavi
Ali Farhadi
116
975
0
15 Dec 2022
Reproducible scaling laws for contrastive language-image learning
Reproducible scaling laws for contrastive language-image learning
Mehdi Cherti
Romain Beaumont
Ross Wightman
Mitchell Wortsman
Gabriel Ilharco
Cade Gordon
Christoph Schuhmann
Ludwig Schmidt
J. Jitsev
VLMCLIP
141
824
0
14 Dec 2022
Magic3D: High-Resolution Text-to-3D Content Creation
Magic3D: High-Resolution Text-to-3D Content Creation
Chen-Hsuan Lin
Jun Gao
Luming Tang
Towaki Takikawa
Fangyin Wei
Xun Huang
Karsten Kreis
Sanja Fidler
Ming-Yuan Liu
Nayeon Lee
238
1,167
0
18 Nov 2022
LAION-5B: An open large-scale dataset for training next generation
  image-text models
LAION-5B: An open large-scale dataset for training next generation image-text models
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
...
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
VLMMLLMCLIP
237
3,521
0
16 Oct 2022
CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth
  Pre-training
CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth Pre-training
Tianyu Huang
Bowen Dong
Yunhan Yang
Xiaoshui Huang
Rynson W. H. Lau
Wanli Ouyang
W. Zuo
VLM3DPCCLIP
138
150
0
03 Oct 2022
DreamFusion: Text-to-3D using 2D Diffusion
DreamFusion: Text-to-3D using 2D Diffusion
Ben Poole
Ajay Jain
Jonathan T. Barron
B. Mildenhall
313
2,445
0
29 Sep 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
...
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
618
6,106
0
23 May 2022
Generative Adversarial Networks
Generative Adversarial Networks
Gilad Cohen
Raja Giryes
GAN
331
30,162
0
01 Mar 2022
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
615
15,855
0
20 Dec 2021
GLIDE: Towards Photorealistic Image Generation and Editing with
  Text-Guided Diffusion Models
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
471
3,641
0
20 Dec 2021
ABO: Dataset and Benchmarks for Real-World 3D Object Understanding
ABO: Dataset and Benchmarks for Real-World 3D Object Understanding
Jasmine Collins
Shubham Goel
Kenan Deng
Achleshwar Luthra
Leon L. Xu
...
T. F. Y. Vicente
T. Dideriksen
H. Arora
M. Guillaumin
Jitendra Malik
222
232
0
12 Oct 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
1.1K
30,096
0
26 Feb 2021
3D-FUTURE: 3D Furniture shape with TextURE
3D-FUTURE: 3D Furniture shape with TextURE
Huan Fu
Rongfei Jia
Lin Gao
Biwei Huang
Binqiang Zhao
Stephen J. Maybank
Dacheng Tao
3DV
102
262
0
21 Sep 2020
Decision-Making with Auto-Encoding Variational Bayes
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
792
10,591
0
17 Feb 2020
Revisiting Point Cloud Classification: A New Benchmark Dataset and
  Classification Model on Real-World Data
Revisiting Point Cloud Classification: A New Benchmark Dataset and Classification Model on Real-World Data
Mikaela Angelina Uy
Quang Pham
Binh-Son Hua
D. Nguyen
Sai-Kit Yeung
3DV3DPC
167
789
0
13 Aug 2019
SGDR: Stochastic Gradient Descent with Warm Restarts
SGDR: Stochastic Gradient Descent with Warm Restarts
I. Loshchilov
Frank Hutter
ODL
378
8,210
0
13 Aug 2016
ShapeNet: An Information-Rich 3D Model Repository
ShapeNet: An Information-Rich 3D Model Repository
Angel X. Chang
Thomas Funkhouser
Leonidas Guibas
Pat Hanrahan
Qi-Xing Huang
...
Shuran Song
Hao Su
Jianxiong Xiao
L. Yi
Feng Yu
3DV
178
5,545
0
09 Dec 2015
1