Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.06125
Cited By
Hierarchical Text-Conditional Image Generation with CLIP Latents
13 April 2022
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hierarchical Text-Conditional Image Generation with CLIP Latents"
50 / 4,757 papers shown
Title
DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks
Sarah Jabbour
Gregory Kondas
Ella Kazerooni
Michael Sjoding
David Fouhey
Jenna Wiens
FAtt
DiffM
49
1
0
19 Jul 2024
Visual Text Generation in the Wild
Yuanzhi Zhu
Jiawei Liu
Feiyu Gao
Wenyu Liu
Xinggang Wang
Peng Wang
Fei Huang
Cong Yao
Zhibo Yang
DiffM
58
10
0
19 Jul 2024
Stable-Hair: Real-World Hair Transfer via Diffusion Model
Yuxuan Zhang
Qing Zhang
Yiren Song
Jiaming Liu
DiffM
48
6
0
19 Jul 2024
Are handcrafted filters helpful for attributing AI-generated images?
Jialiang Li
Haoyue Wang
Sheng Li
Zhenxing Qian
Xinpeng Zhang
Athanasios V. Vasilakos
40
0
0
19 Jul 2024
NeuroBind: Towards Unified Multimodal Representations for Neural Signals
Fengyu Yang
Chao Feng
Daniel Wang
Tianye Wang
Ziyao Zeng
...
Hyoungseob Park
Pengliang Ji
Han Zhao
Yuanning Li
Alex Wong
53
9
0
19 Jul 2024
LogoSticker: Inserting Logos into Diffusion Models for Customized Generation
Mingkang Zhu
Xi Chen
Zhongdao Wang
Hengshuang Zhao
Jiaya Jia
DiffM
47
2
0
18 Jul 2024
Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review
Masatoshi Uehara
Yulai Zhao
Tommaso Biancalani
Sergey Levine
71
22
0
18 Jul 2024
Training-free Composite Scene Generation for Layout-to-Image Synthesis
Jiaqi Liu
Tao Huang
Chang Xu
DiffM
54
5
0
18 Jul 2024
DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays
Xuhui Liu
Zhi Qiao
Runkun Liu
Hong Li
Juan Zhang
Xiantong Zhen
Zhen Qian
Baochang Zhang
MedIm
50
2
0
18 Jul 2024
Unveiling Structural Memorization: Structural Membership Inference Attack for Text-to-Image Diffusion Models
Qiao Li
Xiaomeng Fu
Xi Wang
Jin Liu
Xingyu Gao
Jiao Dai
Jizhong Han
38
3
0
18 Jul 2024
Safe-SD: Safe and Traceable Stable Diffusion with Text Prompt Trigger for Invisible Generative Watermarking
Zhiyuan Ma
Guoli Jia
Biqing Qi
Bowen Zhou
WIGM
81
10
0
18 Jul 2024
GenRC: Generative 3D Room Completion from Sparse Image Collections
Ming-feng Li
Yueh-Feng Ku
Hong-Xuan Yen
Chi Liu
Yu-Lun Liu
Albert Y. C. Chen
Cheng-Hao Kuo
Min Sun
3DV
VGen
62
4
0
17 Jul 2024
SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow
Yuanzhi Zhu
Xingchao Liu
Qiang Liu
51
9
0
17 Jul 2024
IMAGDressing-v1: Customizable Virtual Dressing
Fei Shen
Xin Jiang
Xin He
Hu Ye
Cong Wang
Xiaoyu Du
Zechao Li
Jinghui Tang
DiffM
75
33
0
17 Jul 2024
Towards Understanding Unsafe Video Generation
Yan Pang
Aiping Xiong
Yang Zhang
Tianhao Wang
EGVM
39
2
0
17 Jul 2024
The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation
Yi Yao
Chan-Feng Hsu
Jhe-Hao Lin
Hongxia Xie
Terence Lin
Yi-Ning Huang
Hong-Han Shuai
Wen-Huang Cheng
DiffM
44
4
0
17 Jul 2024
Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models
Chao Gong
Kai-xiang Chen
Zhipeng Wei
Jingjing Chen
Yulong Jiang
DiffM
65
30
0
17 Jul 2024
ModalChorus: Visual Probing and Alignment of Multi-modal Embeddings via Modal Fusion Map
Yilin Ye
Shishi Xiao
Xingchen Zeng
Wei Zeng
54
3
0
17 Jul 2024
DreamStory: Open-Domain Story Visualization by LLM-Guided Multi-Subject Consistent Diffusion
Huiguo He
Huan Yang
Zixi Tuo
Yuan Zhou
Qiuyue Wang
Yuhang Zhang
Zeyu Liu
Wenhao Huang
Hongyang Chao
Jian Yin
DiffM
VGen
69
13
0
17 Jul 2024
I2AM: Interpreting Image-to-Image Latent Diffusion Models via Bi-Attribution Maps
Junseo Park
Hyeryung Jang
81
0
0
17 Jul 2024
Gated Temporal Diffusion for Stochastic Long-Term Dense Anticipation
Olga Zatsarynna
Emad Bahrami
Yazan Abu Farha
Gianpiero Francesca
Juergen Gall
48
1
0
16 Jul 2024
Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models
Chen Ju
Haicheng Wang
Haozhe Cheng
Xu Chen
Zhonghua Zhai
Weilin Huang
Jinsong Lan
Shuai Xiao
Bo Zheng
VLM
59
5
0
16 Jul 2024
Scaling Diffusion Transformers to 16 Billion Parameters
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Debang Li
Junshi Huang
DiffM
MoE
70
17
0
16 Jul 2024
UP-Diff: Latent Diffusion Model for Remote Sensing Urban Prediction
Zeyu Wang
Zecheng Hao
Jingyu Lin
Yuchao Feng
Yufei Guo
37
2
0
16 Jul 2024
Length-Aware Motion Synthesis via Latent Diffusion
Alessio Sampieri
Alessio Palma
Indro Spinelli
Fabio Galasso
VGen
DiffM
65
7
0
16 Jul 2024
ColorwAI: Generative Colorways of Textiles through GAN and Diffusion Disentanglement
Ludovica Schaerf
Andrea Alfarano
Eric Postma
DiffM
31
2
0
16 Jul 2024
Isometric Representation Learning for Disentangled Latent Space of Diffusion Models
Jaehoon Hahm
Junho Lee
Sunghyun Kim
Joonseok Lee
DiffM
35
7
0
16 Jul 2024
AIGC for Industrial Time Series: From Deep Generative Models to Large Generative Models
Lei Ren
Haiteng Wang
Yang Tang
Yang Tang
Chunhua Yang
AI4TS
AI4CE
54
5
0
16 Jul 2024
InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models
Nirat Saini
Navaneeth Bodla
Ashish Shrivastava
Avinash Ravichandran
Xiao Zhang
Abhinav Shrivastava
Bharat Singh
DiffM
29
2
0
15 Jul 2024
DataDream: Few-shot Guided Dataset Generation
Jae Myung Kim
Jessica Bader
Stephan Alaniz
Cordelia Schmid
Zeynep Akata
44
6
0
15 Jul 2024
Optical Diffusion Models for Image Generation
Ilker Oguz
Niyazi Ulaş Dinç
Mustafa Yildirim
Junjie Ke
Innfarn Yoo
Qifei Wang
Feng Yang
Christophe Moser
D. Psaltis
45
0
0
15 Jul 2024
ConTEXTure: Consistent Multiview Images to Texture
Jaehoon Ahn
Sumin Cho
Harim Jung
Kibeom Hong
Seonghoon Ban
Moon-Ryul Jung
35
0
0
15 Jul 2024
How and where does CLIP process negation?
Vincent Quantmeyer
Pablo Mosteiro
Albert Gatt
CoGe
34
6
0
15 Jul 2024
3DEgo: 3D Editing on the Go!
Umar Khalid
Hasan Iqbal
Azib Farooq
Michael J. Hua
Chong Chen
VGen
34
6
0
14 Jul 2024
Mixed-View Panorama Synthesis using Geospatially Guided Diffusion
Zhexiao Xiong
Xin Xing
Scott Workman
Subash Khanal
Nathan Jacobs
DiffM
MDE
69
1
0
12 Jul 2024
PersonificationNet: Making customized subject act like a person
Tianchu Guo
Pengyu Li
Biao Wang
Xiansheng Hua
39
0
0
12 Jul 2024
Bora: Biomedical Generalist Video Generation Model
Weixiang Sun
Xiaocao You
Ruizhe Zheng
Zhengqing Yuan
Xiang Li
Lifang He
Quanzheng Li
Lichao Sun
VGen
MedIm
35
8
0
12 Jul 2024
AirSketch: Generative Motion to Sketch
Hui Xian Grace Lim
Xuanming Cui
Yogesh S Rawat
Ser-Nam Lim
VGen
DiffM
36
0
0
12 Jul 2024
Surgical Text-to-Image Generation
C. Nwoye
Rupak Bose
K. Elgohary
Lorenzo Arboit
Giorgio Carlino
Joël L. Lavanchy
Pietro Mascagni
N. Padoy
MedIm
55
3
0
12 Jul 2024
Controlling the Fidelity and Diversity of Deep Generative Models via Pseudo Density
Shuangqi Li
Chen Liu
Tong Zhang
Hieu Le
Sabine Süsstrunk
Mathieu Salzmann
DiffM
55
1
0
11 Jul 2024
E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors
Jinxiu Liang
Bohan Yu
Yixin Yang
Yiming Han
Boxin Shi
VGen
DiffM
MDE
41
0
0
11 Jul 2024
Coherent and Multi-modality Image Inpainting via Latent Space Optimization
Lingzhi Pan
Tong Zhang
Bingyuan Chen
Qi Zhou
Wei Ke
Sabine Süsstrunk
Mathieu Salzmann
DiffM
45
2
0
10 Jul 2024
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
Wanggui He
Siming Fu
Mushui Liu
Xierui Wang
Wenyi Xiao
...
Zhelun Yu
Haoyuan Li
Ziwei Huang
Leilei Gan
Hao Jiang
DiffM
35
23
0
10 Jul 2024
A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends
Daizong Liu
Mingyu Yang
Xiaoye Qu
Pan Zhou
Yu Cheng
Wei Hu
ELM
AAML
46
26
0
10 Jul 2024
CamFreeDiff: Camera-free Image to Panorama Generation with Diffusion Model
Xiaoding Yuan
Shitao Tang
Kejie Li
Alan Yuille
Peng Wang
DiffM
47
3
0
09 Jul 2024
ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction
Shaozhe Hao
Kai Han
Zhengyao Lv
Shihao Zhao
Kwan-Yee K. Wong
DiffM
CoGe
41
6
0
09 Jul 2024
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance
Guian Fang
Wenbiao Yan
Yuanfan Guo
J. N. Han
Zutao Jiang
Hang Xu
Shengcai Liao
Xiaodan Liang
38
5
0
09 Jul 2024
Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning
Fanyue Wei
Wei Zeng
Zhenyang Li
Dawei Yin
Lixin Duan
Wen Li
EGVM
39
2
0
09 Jul 2024
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation
Yu Zeng
Vishal M. Patel
Haochen Wang
Xun Huang
Ting-Chun Wang
Xuan Li
Yogesh Balaji
DiffM
32
18
0
08 Jul 2024
Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis
Emaad Khwaja
Abdullah Rashwan
Ting Chen
Oliver Wang
Suraj Kothawade
Yeqing Li
DiffM
48
0
0
08 Jul 2024
Previous
1
2
3
...
22
23
24
...
94
95
96
Next