ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.06125
  4. Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents

Hierarchical Text-Conditional Image Generation with CLIP Latents

13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
    VLM
    DiffM
ArXivPDFHTML

Papers citing "Hierarchical Text-Conditional Image Generation with CLIP Latents"

50 / 4,757 papers shown
Title
DEPICT: Diffusion-Enabled Permutation Importance for Image
  Classification Tasks
DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks
Sarah Jabbour
Gregory Kondas
Ella Kazerooni
Michael Sjoding
David Fouhey
Jenna Wiens
FAtt
DiffM
49
1
0
19 Jul 2024
Visual Text Generation in the Wild
Visual Text Generation in the Wild
Yuanzhi Zhu
Jiawei Liu
Feiyu Gao
Wenyu Liu
Xinggang Wang
Peng Wang
Fei Huang
Cong Yao
Zhibo Yang
DiffM
58
10
0
19 Jul 2024
Stable-Hair: Real-World Hair Transfer via Diffusion Model
Stable-Hair: Real-World Hair Transfer via Diffusion Model
Yuxuan Zhang
Qing Zhang
Yiren Song
Jiaming Liu
DiffM
48
6
0
19 Jul 2024
Are handcrafted filters helpful for attributing AI-generated images?
Are handcrafted filters helpful for attributing AI-generated images?
Jialiang Li
Haoyue Wang
Sheng Li
Zhenxing Qian
Xinpeng Zhang
Athanasios V. Vasilakos
40
0
0
19 Jul 2024
NeuroBind: Towards Unified Multimodal Representations for Neural Signals
NeuroBind: Towards Unified Multimodal Representations for Neural Signals
Fengyu Yang
Chao Feng
Daniel Wang
Tianye Wang
Ziyao Zeng
...
Hyoungseob Park
Pengliang Ji
Han Zhao
Yuanning Li
Alex Wong
53
9
0
19 Jul 2024
LogoSticker: Inserting Logos into Diffusion Models for Customized
  Generation
LogoSticker: Inserting Logos into Diffusion Models for Customized Generation
Mingkang Zhu
Xi Chen
Zhongdao Wang
Hengshuang Zhao
Jiaya Jia
DiffM
47
2
0
18 Jul 2024
Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion
  Models: A Tutorial and Review
Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review
Masatoshi Uehara
Yulai Zhao
Tommaso Biancalani
Sergey Levine
71
22
0
18 Jul 2024
Training-free Composite Scene Generation for Layout-to-Image Synthesis
Training-free Composite Scene Generation for Layout-to-Image Synthesis
Jiaqi Liu
Tao Huang
Chang Xu
DiffM
54
5
0
18 Jul 2024
DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar
  X-Rays
DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays
Xuhui Liu
Zhi Qiao
Runkun Liu
Hong Li
Juan Zhang
Xiantong Zhen
Zhen Qian
Baochang Zhang
MedIm
50
2
0
18 Jul 2024
Unveiling Structural Memorization: Structural Membership Inference
  Attack for Text-to-Image Diffusion Models
Unveiling Structural Memorization: Structural Membership Inference Attack for Text-to-Image Diffusion Models
Qiao Li
Xiaomeng Fu
Xi Wang
Jin Liu
Xingyu Gao
Jiao Dai
Jizhong Han
38
3
0
18 Jul 2024
Safe-SD: Safe and Traceable Stable Diffusion with Text Prompt Trigger
  for Invisible Generative Watermarking
Safe-SD: Safe and Traceable Stable Diffusion with Text Prompt Trigger for Invisible Generative Watermarking
Zhiyuan Ma
Guoli Jia
Biqing Qi
Bowen Zhou
WIGM
81
10
0
18 Jul 2024
GenRC: Generative 3D Room Completion from Sparse Image Collections
GenRC: Generative 3D Room Completion from Sparse Image Collections
Ming-feng Li
Yueh-Feng Ku
Hong-Xuan Yen
Chi Liu
Yu-Lun Liu
Albert Y. C. Chen
Cheng-Hao Kuo
Min Sun
3DV
VGen
62
4
0
17 Jul 2024
SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow
SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow
Yuanzhi Zhu
Xingchao Liu
Qiang Liu
51
9
0
17 Jul 2024
IMAGDressing-v1: Customizable Virtual Dressing
IMAGDressing-v1: Customizable Virtual Dressing
Fei Shen
Xin Jiang
Xin He
Hu Ye
Cong Wang
Xiaoyu Du
Zechao Li
Jinghui Tang
DiffM
75
33
0
17 Jul 2024
Towards Understanding Unsafe Video Generation
Towards Understanding Unsafe Video Generation
Yan Pang
Aiping Xiong
Yang Zhang
Tianhao Wang
EGVM
39
2
0
17 Jul 2024
The Fabrication of Reality and Fantasy: Scene Generation with
  LLM-Assisted Prompt Interpretation
The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation
Yi Yao
Chan-Feng Hsu
Jhe-Hao Lin
Hongxia Xie
Terence Lin
Yi-Ning Huang
Hong-Han Shuai
Wen-Huang Cheng
DiffM
44
4
0
17 Jul 2024
Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models
Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models
Chao Gong
Kai-xiang Chen
Zhipeng Wei
Jingjing Chen
Yulong Jiang
DiffM
65
30
0
17 Jul 2024
ModalChorus: Visual Probing and Alignment of Multi-modal Embeddings via
  Modal Fusion Map
ModalChorus: Visual Probing and Alignment of Multi-modal Embeddings via Modal Fusion Map
Yilin Ye
Shishi Xiao
Xingchen Zeng
Wei Zeng
54
3
0
17 Jul 2024
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Huiguo He
Huan Yang
Zixi Tuo
Yuan Zhou
Qiuyue Wang
Yuhang Zhang
Zeyu Liu
Wenhao Huang
Hongyang Chao
Jian Yin
DiffM
VGen
69
13
0
17 Jul 2024
I2AM: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps
I2AM: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps
Junseo Park
Hyeryung Jang
81
0
0
17 Jul 2024
Gated Temporal Diffusion for Stochastic Long-Term Dense Anticipation
Gated Temporal Diffusion for Stochastic Long-Term Dense Anticipation
Olga Zatsarynna
Emad Bahrami
Yazan Abu Farha
Gianpiero Francesca
Juergen Gall
48
1
0
16 Jul 2024
Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language
  Large Models
Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models
Chen Ju
Haicheng Wang
Haozhe Cheng
Xu Chen
Zhonghua Zhai
Weilin Huang
Jinsong Lan
Shuai Xiao
Bo Zheng
VLM
59
5
0
16 Jul 2024
Scaling Diffusion Transformers to 16 Billion Parameters
Scaling Diffusion Transformers to 16 Billion Parameters
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Debang Li
Junshi Huang
DiffM
MoE
70
17
0
16 Jul 2024
UP-Diff: Latent Diffusion Model for Remote Sensing Urban Prediction
UP-Diff: Latent Diffusion Model for Remote Sensing Urban Prediction
Zeyu Wang
Zecheng Hao
Jingyu Lin
Yuchao Feng
Yufei Guo
37
2
0
16 Jul 2024
Length-Aware Motion Synthesis via Latent Diffusion
Length-Aware Motion Synthesis via Latent Diffusion
Alessio Sampieri
Alessio Palma
Indro Spinelli
Fabio Galasso
VGen
DiffM
65
7
0
16 Jul 2024
ColorwAI: Generative Colorways of Textiles through GAN and Diffusion
  Disentanglement
ColorwAI: Generative Colorways of Textiles through GAN and Diffusion Disentanglement
Ludovica Schaerf
Andrea Alfarano
Eric Postma
DiffM
31
2
0
16 Jul 2024
Isometric Representation Learning for Disentangled Latent Space of
  Diffusion Models
Isometric Representation Learning for Disentangled Latent Space of Diffusion Models
Jaehoon Hahm
Junho Lee
Sunghyun Kim
Joonseok Lee
DiffM
35
7
0
16 Jul 2024
AIGC for Industrial Time Series: From Deep Generative Models to Large Generative Models
AIGC for Industrial Time Series: From Deep Generative Models to Large Generative Models
Lei Ren
Haiteng Wang
Yang Tang
Yang Tang
Chunhua Yang
AI4TS
AI4CE
54
5
0
16 Jul 2024
InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models
InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models
Nirat Saini
Navaneeth Bodla
Ashish Shrivastava
Avinash Ravichandran
Xiao Zhang
Abhinav Shrivastava
Bharat Singh
DiffM
29
2
0
15 Jul 2024
DataDream: Few-shot Guided Dataset Generation
DataDream: Few-shot Guided Dataset Generation
Jae Myung Kim
Jessica Bader
Stephan Alaniz
Cordelia Schmid
Zeynep Akata
44
6
0
15 Jul 2024
Optical Diffusion Models for Image Generation
Optical Diffusion Models for Image Generation
Ilker Oguz
Niyazi Ulaş Dinç
Mustafa Yildirim
Junjie Ke
Innfarn Yoo
Qifei Wang
Feng Yang
Christophe Moser
D. Psaltis
45
0
0
15 Jul 2024
ConTEXTure: Consistent Multiview Images to Texture
ConTEXTure: Consistent Multiview Images to Texture
Jaehoon Ahn
Sumin Cho
Harim Jung
Kibeom Hong
Seonghoon Ban
Moon-Ryul Jung
35
0
0
15 Jul 2024
How and where does CLIP process negation?
How and where does CLIP process negation?
Vincent Quantmeyer
Pablo Mosteiro
Albert Gatt
CoGe
34
6
0
15 Jul 2024
3DEgo: 3D Editing on the Go!
3DEgo: 3D Editing on the Go!
Umar Khalid
Hasan Iqbal
Azib Farooq
Michael J. Hua
Chong Chen
VGen
34
6
0
14 Jul 2024
Mixed-View Panorama Synthesis using Geospatially Guided Diffusion
Mixed-View Panorama Synthesis using Geospatially Guided Diffusion
Zhexiao Xiong
Xin Xing
Scott Workman
Subash Khanal
Nathan Jacobs
DiffM
MDE
69
1
0
12 Jul 2024
PersonificationNet: Making customized subject act like a person
PersonificationNet: Making customized subject act like a person
Tianchu Guo
Pengyu Li
Biao Wang
Xiansheng Hua
39
0
0
12 Jul 2024
Bora: Biomedical Generalist Video Generation Model
Bora: Biomedical Generalist Video Generation Model
Weixiang Sun
Xiaocao You
Ruizhe Zheng
Zhengqing Yuan
Xiang Li
Lifang He
Quanzheng Li
Lichao Sun
VGen
MedIm
35
8
0
12 Jul 2024
AirSketch: Generative Motion to Sketch
AirSketch: Generative Motion to Sketch
Hui Xian Grace Lim
Xuanming Cui
Yogesh S Rawat
Ser-Nam Lim
VGen
DiffM
36
0
0
12 Jul 2024
Surgical Text-to-Image Generation
Surgical Text-to-Image Generation
C. Nwoye
Rupak Bose
K. Elgohary
Lorenzo Arboit
Giorgio Carlino
Joël L. Lavanchy
Pietro Mascagni
N. Padoy
MedIm
55
3
0
12 Jul 2024
Controlling the Fidelity and Diversity of Deep Generative Models via
  Pseudo Density
Controlling the Fidelity and Diversity of Deep Generative Models via Pseudo Density
Shuangqi Li
Chen Liu
Tong Zhang
Hieu Le
Sabine Süsstrunk
Mathieu Salzmann
DiffM
55
1
0
11 Jul 2024
E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion
  Priors
E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors
Jinxiu Liang
Bohan Yu
Yixin Yang
Yiming Han
Boxin Shi
VGen
DiffM
MDE
41
0
0
11 Jul 2024
Coherent and Multi-modality Image Inpainting via Latent Space
  Optimization
Coherent and Multi-modality Image Inpainting via Latent Space Optimization
Lingzhi Pan
Tong Zhang
Bingyuan Chen
Qi Zhou
Wei Ke
Sabine Süsstrunk
Mathieu Salzmann
DiffM
45
2
0
10 Jul 2024
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image
  Synthesis
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Wanggui He
Siming Fu
Mushui Liu
Xierui Wang
Wenyi Xiao
...
Zhelun Yu
Haoyuan Li
Ziwei Huang
Leilei Gan
Hao Jiang
DiffM
35
23
0
10 Jul 2024
A Survey of Attacks on Large Vision-Language Models: Resources,
  Advances, and Future Trends
A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends
Daizong Liu
Mingyu Yang
Xiaoye Qu
Pan Zhou
Yu Cheng
Wei Hu
ELM
AAML
46
26
0
10 Jul 2024
CamFreeDiff: Camera-free Image to Panorama Generation with Diffusion
  Model
CamFreeDiff: Camera-free Image to Panorama Generation with Diffusion Model
Xiaoding Yuan
Shitao Tang
Kejie Li
Alan Yuille
Peng Wang
DiffM
47
3
0
09 Jul 2024
ConceptExpress: Harnessing Diffusion Models for Single-image
  Unsupervised Concept Extraction
ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
Shaozhe Hao
Kai Han
Zhengyao Lv
Shihao Zhao
Kwan-Yee K. Wong
DiffM
CoGe
41
6
0
09 Jul 2024
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with
  Coarse-to-fine Pose-Reversible Guidance
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance
Guian Fang
Wenbiao Yan
Yuanfan Guo
J. N. Han
Zutao Jiang
Hang Xu
Shengcai Liao
Xiaodan Liang
38
5
0
09 Jul 2024
Powerful and Flexible: Personalized Text-to-Image Generation via
  Reinforcement Learning
Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning
Fanyue Wei
Wei Zeng
Zhenyang Li
Dawei Yin
Lixin Duan
Wen Li
EGVM
39
2
0
09 Jul 2024
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized
  Text-to-Image Generation
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation
Yu Zeng
Vishal M. Patel
Haochen Wang
Xun Huang
Ting-Chun Wang
Xuan Li
Yogesh Balaji
DiffM
32
18
0
08 Jul 2024
Layered Diffusion Model for One-Shot High Resolution Text-to-Image
  Synthesis
Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis
Emaad Khwaja
Abdullah Rashwan
Ting Chen
Oliver Wang
Suraj Kothawade
Yeqing Li
DiffM
48
0
0
08 Jul 2024
Previous
123...222324...949596
Next