Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2101.04702
Cited By
v1
v2
v3
v4
v5 (latest)
Cross-Modal Contrastive Learning for Text-to-Image Generation
12 January 2021
Han Zhang
Jing Yu Koh
Jason Baldridge
Honglak Lee
Yinfei Yang
GAN
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Cross-Modal Contrastive Learning for Text-to-Image Generation"
50 / 62 papers shown
Title
Learning Graph Representation of Agent Diffusers
Youcef Djenouri
Nassim Belmecheri
Tomasz Michalak
Jan Dubiñski
Ahmed Nabil Belbachir
Anis Yazidi
AI4CE
200
0
0
10 May 2025
HCMA: Hierarchical Cross-model Alignment for Grounded Text-to-Image Generation
Hang Wang
Zhi-Qi Cheng
Chenhao Lin
Chao Shen
Lei Zhang
DiffM
99
0
0
10 May 2025
Continual Multimodal Contrastive Learning
Xiaohao Liu
Xiaobo Xia
See-Kiong Ng
Tat-Seng Chua
CLL
211
2
0
19 Mar 2025
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
Ziyang Zhang
Yang Yu
Yucheng Chen
Xulei Yang
S. Yeo
MedIm
136
2
0
02 Mar 2025
SOEDiff: Efficient Distillation for Small Object Editing
Yiming Wu
Qihe Pan
Zhen Zhao
Zicheng Wang
Sifan Long
Ronghua Liang
DiffM
146
0
0
03 Jan 2025
Examining the Prevalence and Dynamics of AI-Generated Media in Art Subreddits
Hana Matatov
Marianne Aubin Le Quere
Ofra Amir
Mor Naaman
84
2
0
09 Oct 2024
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Sihyun Yu
Sangkyung Kwak
Huiwon Jang
Jongheon Jeong
Jonathan Huang
Jinwoo Shin
Saining Xie
OCL
153
98
0
09 Oct 2024
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Jing He
Haodong Li
Wei Yin
Yixun Liang
Leheng Li
Kaiqiang Zhou
Hongbo Zhang
Bingbing Liu
Ying-Cong Chen
DiffM
VLM
165
54
0
26 Sep 2024
Ensembling Diffusion Models via Adaptive Feature Aggregation
Cong Wang
Kuan Tian
Yonghang Guan
Jun Zhang
Zhiwei Jiang
Fei Shen
Xiao Han
116
5
0
27 May 2024
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
Jing Liu
184
31
0
27 Aug 2023
SINE: SINgle Image Editing with Text-to-Image Diffusion Models
Zhixing Zhang
Ligong Han
Arna Ghosh
Dimitris N. Metaxas
Jian Ren
DiffM
125
160
0
08 Dec 2022
Generative Adversarial Networks
Gilad Cohen
Raja Giryes
GAN
283
30,149
0
01 Mar 2022
Text-to-Image Generation Grounded by Fine-Grained User Attention
Jing Yu Koh
Jason Baldridge
Honglak Lee
Yinfei Yang
DiffM
113
59
0
07 Nov 2020
DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis
Ming Tao
Hao Tang
Leilei Gan
Xiaoyuan Jing
Bingkun Bao
Changsheng Xu
98
214
0
13 Aug 2020
Contrastive Learning for Unpaired Image-to-Image Translation
Taesung Park
Alexei A. Efros
Richard Y. Zhang
Jun-Yan Zhu
SSL
86
1,232
0
30 Jul 2020
InfoMax-GAN: Improved Adversarial Image Generation via Information Maximization and Contrastive Learning
Kwot Sin Lee
Ngoc-Trung Tran
Ngai-Man Cheung
GAN
63
68
0
09 Jul 2020
Image Augmentations for GAN Training
Zhengli Zhao
Zizhao Zhang
Ting-Li Chen
Sameer Singh
Han Zhang
61
137
0
04 Jun 2020
What Makes for Good Views for Contrastive Learning?
Yonglong Tian
Chen Sun
Ben Poole
Dilip Krishnan
Cordelia Schmid
Phillip Isola
SSL
114
1,335
0
20 May 2020
Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO
Zarana Parekh
Jason Baldridge
Daniel Cer
Austin Waters
Yinfei Yang
47
62
0
30 Apr 2020
Audio-Visual Instance Discrimination with Cross-Modal Agreement
Pedro Morgado
Nuno Vasconcelos
Ishan Misra
SSL
80
276
0
27 Apr 2020
Disentangled and Controllable Face Image Generation via 3D Imitative-Contrastive Learning
Yu Deng
Jiaolong Yang
Dong Chen
Fang Wen
Xin Tong
CoGe
CVBM
59
347
0
24 Apr 2020
Improved Baselines with Momentum Contrastive Learning
Xinlei Chen
Haoqi Fan
Ross B. Girshick
Kaiming He
SSL
486
3,442
0
09 Mar 2020
A Simple Framework for Contrastive Learning of Visual Representations
Ting-Li Chen
Simon Kornblith
Mohammad Norouzi
Geoffrey E. Hinton
SSL
375
18,859
0
13 Feb 2020
CPGAN: Full-Spectrum Content-Parsing Generative Adversarial Networks for Text-to-Image Synthesis
Jiadong Liang
Wenjie Pei
Feng Lu
GAN
52
19
0
18 Dec 2019
Connecting Vision and Language with Localized Narratives
Jordi Pont-Tuset
J. Uijlings
Soravit Changpinyo
Radu Soricut
V. Ferrari
ObjD
91
251
0
06 Dec 2019
Momentum Contrast for Unsupervised Visual Representation Learning
Kaiming He
Haoqi Fan
Yuxin Wu
Saining Xie
Ross B. Girshick
SSL
210
12,121
0
13 Nov 2019
Semantic Object Accuracy for Generative Text-to-Image Synthesis
Tobias Hinz
Stefan Heinrich
S. Wermter
EGVM
80
159
0
29 Oct 2019
Consistency Regularization for Generative Adversarial Networks
Han Zhang
Zizhao Zhang
Augustus Odena
Honglak Lee
GAN
66
285
0
26 Oct 2019
Understanding the Limitations of Variational Mutual Information Estimators
Jiaming Song
Stefano Ermon
SSL
DRL
74
204
0
14 Oct 2019
Controllable Text-to-Image Generation
Bowen Li
Xiaojuan Qi
Thomas Lukasiewicz
Philip Torr
GAN
86
357
0
16 Sep 2019
Dual Adversarial Inference for Text-to-Image Synthesis
Qicheng Lao
Mohammad Havaei
Ahmad Pesaranghader
Francis Dutil
Lisa Di-Jorio
T. Fevens
GAN
50
39
0
14 Aug 2019
Semantics Disentangling for Text-to-Image Generation
Guojun Yin
Bin Liu
Lu Sheng
Nenghai Yu
Xiaogang Wang
Jing Shao
54
184
0
02 Apr 2019
DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image Synthesis
Minfeng Zhu
Pingbo Pan
Wei Chen
Yi Yang
GAN
54
583
0
02 Apr 2019
MirrorGAN: Learning Text-to-image Generation by Redescription
Tingting Qiao
Jing Zhang
Duanqing Xu
Dacheng Tao
VLM
GAN
61
542
0
14 Mar 2019
Object-driven Text-to-Image Synthesis via Adversarial Training
Wenbo Li
Pengchuan Zhang
Lei Zhang
Qiuyuan Huang
Xiaodong He
Siwei Lyu
Jianfeng Gao
GAN
71
302
0
27 Feb 2019
Generating Multiple Objects at Spatially Distinct Locations
Tobias Hinz
Stefan Heinrich
S. Wermter
81
103
0
03 Jan 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.8K
95,114
0
11 Oct 2018
On Self Modulation for Generative Adversarial Networks
Ting Chen
Mario Lucic
N. Houlsby
Sylvain Gelly
GAN
62
105
0
02 Oct 2018
Large Scale GAN Training for High Fidelity Natural Image Synthesis
Andrew Brock
Jeff Donahue
Karen Simonyan
269
5,401
0
28 Sep 2018
Perfect match: Improved cross-modal embeddings for audio-visual synchronisation
Soo-Whan Chung
Joon Son Chung
Hong-Goo Kang
55
117
0
21 Sep 2018
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRL
SSL
351
10,349
0
10 Jul 2018
Self-Attention Generative Adversarial Networks
Han Zhang
Ian Goodfellow
Dimitris N. Metaxas
Augustus Odena
GAN
148
3,729
0
21 May 2018
Unsupervised Feature Learning via Non-Parametric Instance-level Discrimination
Zhirong Wu
Yuanjun Xiong
Stella X. Yu
Dahua Lin
SSL
179
3,465
0
05 May 2018
Improving GANs Using Optimal Transport
Tim Salimans
Han Zhang
Alec Radford
Dimitris N. Metaxas
OT
GAN
72
324
0
15 Mar 2018
Photographic Text-to-Image Synthesis with a Hierarchically-nested Adversarial Network
Zizhao Zhang
Yuanpu Xie
Ling Yang
EGVM
93
305
0
26 Feb 2018
Spectral Normalization for Generative Adversarial Networks
Takeru Miyato
Toshiki Kataoka
Masanori Koyama
Yuichi Yoshida
ODL
159
4,442
0
16 Feb 2018
Geometry-Contrastive GAN for Facial Expression Transfer
Fengchun Qiao
Nai-Ming Yao
Zirui Jiao
Zhihao Li
Hui Chen
Hongan Wang
CVBM
GAN
104
51
0
06 Feb 2018
Inferring Semantic Layout for Hierarchical Text-to-Image Synthesis
Seunghoon Hong
Dingdong Yang
Jongwook Choi
Honglak Lee
EGVM
110
337
0
16 Jan 2018
MINE: Mutual Information Neural Estimation
Mohamed Ishmael Belghazi
A. Baratin
Sai Rajeswar
Sherjil Ozair
Yoshua Bengio
Aaron Courville
R. Devon Hjelm
DRL
196
1,280
0
12 Jan 2018
A Note on the Inception Score
Shane T. Barratt
Rishi Sharma
EGVM
99
694
0
06 Jan 2018
1
2
Next