ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2110.02624
  4. Cited By
CLIP-Forge: Towards Zero-Shot Text-to-Shape Generation

CLIP-Forge: Towards Zero-Shot Text-to-Shape Generation

6 October 2021
Aditya Sanghi
Hang Chu
Joseph G. Lambourne
Ye Wang
Chin-Yi Cheng
Marco Fumero
Kamal Rahimi Malekshan
    CLIP
ArXivPDFHTML

Papers citing "CLIP-Forge: Towards Zero-Shot Text-to-Shape Generation"

36 / 236 papers shown
Title
Magic3D: High-Resolution Text-to-3D Content Creation
Magic3D: High-Resolution Text-to-3D Content Creation
Chen-Hsuan Lin
Jun Gao
Luming Tang
Towaki Takikawa
Fangyin Wei
Xun Huang
Karsten Kreis
Sanja Fidler
Ming Liu
Nayeon Lee
67
1,119
0
18 Nov 2022
Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures
Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures
G. Metzer
Elad Richardson
Or Patashnik
Raja Giryes
Daniel Cohen-Or
DiffM
71
453
0
14 Nov 2022
Zero-shot Video Moment Retrieval With Off-the-Shelf Models
Zero-shot Video Moment Retrieval With Off-the-Shelf Models
Anuj Diwan
Puyuan Peng
Raymond J. Mooney
VLM
28
3
0
03 Nov 2022
CLIP-Sculptor: Zero-Shot Generation of High-Fidelity and Diverse Shapes
  from Natural Language
CLIP-Sculptor: Zero-Shot Generation of High-Fidelity and Diverse Shapes from Natural Language
Aditya Sanghi
Rao Fu
Vivian Liu
Karl Willis
Hooman Shayani
Amir Hosein Khasahmadi
Srinath Sridhar
Daniel E. Ritchie
25
52
0
02 Nov 2022
Being Comes from Not-being: Open-vocabulary Text-to-Motion Generation
  with Wordless Training
Being Comes from Not-being: Open-vocabulary Text-to-Motion Generation with Wordless Training
Junfan Lin
Jianlong Chang
Lingbo Liu
Guanbin Li
Liang Lin
Qi Tian
Changan Chen
VGen
66
40
0
28 Oct 2022
3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows
3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows
Vivian Liu
Jo Vermeulen
G. Fitzmaurice
Justin Matejka
HAI
33
119
0
20 Oct 2022
TANGO: Text-driven Photorealistic and Robust 3D Stylization via Lighting
  Decomposition
TANGO: Text-driven Photorealistic and Robust 3D Stylization via Lighting Decomposition
Yuxiao Chen
Rui Chen
Jiabao Lei
Yabin Zhang
Kui Jia
CLIP
27
81
0
20 Oct 2022
LION: Latent Point Diffusion Models for 3D Shape Generation
LION: Latent Point Diffusion Models for 3D Shape Generation
Fangyin Wei
Arash Vahdat
Francis Williams
Zan Gojcic
Or Litany
Sanja Fidler
Karsten Kreis
DiffM
73
489
0
12 Oct 2022
AVE-CLIP: AudioCLIP-based Multi-window Temporal Transformer for Audio
  Visual Event Localization
AVE-CLIP: AudioCLIP-based Multi-window Temporal Transformer for Audio Visual Event Localization
Tanvir Mahmud
Diana Marculescu
CLIP
19
31
0
11 Oct 2022
Understanding Pure CLIP Guidance for Voxel Grid NeRF Models
Understanding Pure CLIP Guidance for Voxel Grid NeRF Models
Han-Hung Lee
Angel X. Chang
24
63
0
30 Sep 2022
DreamFusion: Text-to-3D using 2D Diffusion
DreamFusion: Text-to-3D using 2D Diffusion
Ben Poole
Ajay Jain
Jonathan T. Barron
B. Mildenhall
85
2,323
0
29 Sep 2022
GAMA: Generative Adversarial Multi-Object Scene Attacks
GAMA: Generative Adversarial Multi-Object Scene Attacks
Abhishek Aich
Calvin-Khang Ta
Akash Gupta
Chengyu Song
S. Krishnamurthy
Ulugbek S. Kamilov
Amit K. Roy-Chowdhury
AAML
53
17
0
20 Sep 2022
ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation
ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation
Zhengzhe Liu
Peng Dai
Ruihui Li
Xiaojuan Qi
Chi-Wing Fu
DiffM
182
25
0
09 Sep 2022
Prompt Tuning with Soft Context Sharing for Vision-Language Models
Prompt Tuning with Soft Context Sharing for Vision-Language Models
Kun Ding
Ying Wang
Pengzhang Liu
Qiang Yu
Hao Zhang
Shiming Xiang
Chunhong Pan
VPVLM
VLM
29
14
0
29 Aug 2022
DALLE-URBAN: Capturing the urban design expertise of large text to image
  transformers
DALLE-URBAN: Capturing the urban design expertise of large text to image transformers
Sachith Seneviratne
Damith A. Senanayake
Sanka Rasnayaka
Rajith Vidanaarachchi
Jason Thompson
ViT
19
17
0
03 Aug 2022
ShapeCrafter: A Recursive Text-Conditioned 3D Shape Generation Model
ShapeCrafter: A Recursive Text-Conditioned 3D Shape Generation Model
Rao Fu
Xiaoyu Zhan
Yiwen Chen
Daniel E. Ritchie
Srinath Sridhar
39
79
0
19 Jul 2022
Text-Driven Stylization of Video Objects
Text-Driven Stylization of Video Objects
Sebastian Loeschcke
Serge Belongie
Sagie Benaim
VGen
DiffM
30
16
0
24 Jun 2022
Multimodal Learning with Transformers: A Survey
Multimodal Learning with Transformers: A Survey
Peng Xu
Xiatian Zhu
David Clifton
ViT
79
530
0
13 Jun 2022
Volumetric Disentanglement for 3D Scene Manipulation
Volumetric Disentanglement for 3D Scene Manipulation
Sagie Benaim
Frederik Warburg
Peter Ebert Christensen
Serge Belongie
42
15
0
06 Jun 2022
CyCLIP: Cyclic Contrastive Language-Image Pretraining
CyCLIP: Cyclic Contrastive Language-Image Pretraining
Shashank Goel
Hritik Bansal
S. Bhatia
Ryan A. Rossi
Vishwa Vinay
Aditya Grover
CLIP
VLM
184
134
0
28 May 2022
AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars
AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars
Fangzhou Hong
Mingyuan Zhang
Liang Pan
Zhongang Cai
Lei Yang
Ziwei Liu
CLIP
98
79
0
17 May 2022
Language-Grounded Indoor 3D Semantic Segmentation in the Wild
Language-Grounded Indoor 3D Semantic Segmentation in the Wild
Dávid Rozenberszki
Or Litany
Angela Dai
3DV
VLM
23
184
0
16 Apr 2022
CLIP-Mesh: Generating textured meshes from text using pretrained
  image-text models
CLIP-Mesh: Generating textured meshes from text using pretrained image-text models
N. Khalid
Tianhao Xie
Eugene Belilovsky
Tiberiu Popa
CLIP
30
291
0
24 Mar 2022
MotionCLIP: Exposing Human Motion Generation to CLIP Space
MotionCLIP: Exposing Human Motion Generation to CLIP Space
Guy Tevet
Brian Gordon
Amir Hertz
Amit H. Bermano
Daniel Cohen-Or
CLIP
39
325
0
15 Mar 2022
Text and Image Guided 3D Avatar Generation and Manipulation
Text and Image Guided 3D Avatar Generation and Manipulation
Zehranaz Canfes
M. Atasoy
Alara Dirik
Pinar Yanardag
3DH
36
42
0
12 Feb 2022
PartGlot: Learning Shape Part Segmentation from Language Reference Games
PartGlot: Learning Shape Part Segmentation from Language Reference Games
Juil Koo
Ian Huang
Panos Achlioptas
Leonidas J. Guibas
Minhyuk Sung
3DPC
40
28
0
13 Dec 2021
Text2Mesh: Text-Driven Neural Stylization for Meshes
Text2Mesh: Text-Driven Neural Stylization for Meshes
O. Michel
Roi Bar-On
Richard Liu
Sagie Benaim
Rana Hanocka
CLIP
AI4CE
226
353
0
06 Dec 2021
Zero-Shot Text-Guided Object Generation with Dream Fields
Zero-Shot Text-Guided Object Generation with Dream Fields
Ajay Jain
B. Mildenhall
Jonathan T. Barron
Pieter Abbeel
Ben Poole
49
561
0
02 Dec 2021
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic
  Arithmetic
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Yoad Tewel
Yoav Shalev
Idan Schwartz
Lior Wolf
VLM
34
192
0
29 Nov 2021
Learning to Prompt for Vision-Language Models
Learning to Prompt for Vision-Language Models
Kaiyang Zhou
Jingkang Yang
Chen Change Loy
Ziwei Liu
VPVLM
CLIP
VLM
350
2,286
0
02 Sep 2021
How Much Can CLIP Benefit Vision-and-Language Tasks?
How Much Can CLIP Benefit Vision-and-Language Tasks?
Sheng Shen
Liunian Harold Li
Hao Tan
Joey Tianyi Zhou
Anna Rohrbach
Kai-Wei Chang
Z. Yao
Kurt Keutzer
CLIP
VLM
MLLM
202
406
0
13 Jul 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,805
0
24 Feb 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
340
3,726
0
11 Feb 2021
Convolutional Occupancy Networks
Convolutional Occupancy Networks
Songyou Peng
Michael Niemeyer
L. Mescheder
Marc Pollefeys
Andreas Geiger
3DV
AI4CE
232
974
0
10 Mar 2020
PointNet: Deep Learning on Point Sets for 3D Classification and
  Segmentation
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
C. Qi
Hao Su
Kaichun Mo
Leonidas J. Guibas
3DH
3DPC
3DV
PINN
222
14,131
0
02 Dec 2016
Learning a Probabilistic Latent Space of Object Shapes via 3D
  Generative-Adversarial Modeling
Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling
Jiajun Wu
Chengkai Zhang
Tianfan Xue
Bill Freeman
J. Tenenbaum
GAN
191
1,942
0
24 Oct 2016
Previous
12345