ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2101.04702
  4. Cited By
Cross-Modal Contrastive Learning for Text-to-Image Generation
v1v2v3v4v5 (latest)

Cross-Modal Contrastive Learning for Text-to-Image Generation

12 January 2021
Han Zhang
Jing Yu Koh
Jason Baldridge
Honglak Lee
Yinfei Yang
    GAN
ArXiv (abs)PDFHTML

Papers citing "Cross-Modal Contrastive Learning for Text-to-Image Generation"

50 / 62 papers shown
Title
Learning Graph Representation of Agent Diffusers
Learning Graph Representation of Agent Diffusers
Youcef Djenouri
Nassim Belmecheri
Tomasz Michalak
Jan Dubiñski
Ahmed Nabil Belbachir
Anis Yazidi
AI4CE
200
0
0
10 May 2025
HCMA: Hierarchical Cross-model Alignment for Grounded Text-to-Image Generation
HCMA: Hierarchical Cross-model Alignment for Grounded Text-to-Image Generation
Hang Wang
Zhi-Qi Cheng
Chenhao Lin
Chao Shen
Lei Zhang
DiffM
99
0
0
10 May 2025
Continual Multimodal Contrastive Learning
Continual Multimodal Contrastive Learning
Xiaohao Liu
Xiaobo Xia
See-Kiong Ng
Tat-Seng Chua
CLL
211
2
0
19 Mar 2025
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
Ziyang Zhang
Yang Yu
Yucheng Chen
Xulei Yang
S. Yeo
MedIm
136
2
0
02 Mar 2025
SOEDiff: Efficient Distillation for Small Object Editing
SOEDiff: Efficient Distillation for Small Object Editing
Yiming Wu
Qihe Pan
Zhen Zhao
Zicheng Wang
Sifan Long
Ronghua Liang
DiffM
146
0
0
03 Jan 2025
Examining the Prevalence and Dynamics of AI-Generated Media in Art Subreddits
Examining the Prevalence and Dynamics of AI-Generated Media in Art Subreddits
Hana Matatov
Marianne Aubin Le Quere
Ofra Amir
Mor Naaman
84
2
0
09 Oct 2024
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Sihyun Yu
Sangkyung Kwak
Huiwon Jang
Jongheon Jeong
Jonathan Huang
Jinwoo Shin
Saining Xie
OCL
153
98
0
09 Oct 2024
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Jing He
Haodong Li
Wei Yin
Yixun Liang
Leheng Li
Kaiqiang Zhou
Hongbo Zhang
Bingbing Liu
Ying-Cong Chen
DiffMVLM
165
54
0
26 Sep 2024
Ensembling Diffusion Models via Adaptive Feature Aggregation
Ensembling Diffusion Models via Adaptive Feature Aggregation
Cong Wang
Kuan Tian
Yonghang Guan
Jun Zhang
Zhiwei Jiang
Fei Shen
Xiao Han
116
5
0
27 May 2024
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
Jing Liu
184
31
0
27 Aug 2023
SINE: SINgle Image Editing with Text-to-Image Diffusion Models
SINE: SINgle Image Editing with Text-to-Image Diffusion Models
Zhixing Zhang
Ligong Han
Arna Ghosh
Dimitris N. Metaxas
Jian Ren
DiffM
125
160
0
08 Dec 2022
Generative Adversarial Networks
Generative Adversarial Networks
Gilad Cohen
Raja Giryes
GAN
283
30,149
0
01 Mar 2022
Text-to-Image Generation Grounded by Fine-Grained User Attention
Text-to-Image Generation Grounded by Fine-Grained User Attention
Jing Yu Koh
Jason Baldridge
Honglak Lee
Yinfei Yang
DiffM
113
59
0
07 Nov 2020
DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis
DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis
Ming Tao
Hao Tang
Leilei Gan
Xiaoyuan Jing
Bingkun Bao
Changsheng Xu
98
214
0
13 Aug 2020
Contrastive Learning for Unpaired Image-to-Image Translation
Contrastive Learning for Unpaired Image-to-Image Translation
Taesung Park
Alexei A. Efros
Richard Y. Zhang
Jun-Yan Zhu
SSL
86
1,232
0
30 Jul 2020
InfoMax-GAN: Improved Adversarial Image Generation via Information
  Maximization and Contrastive Learning
InfoMax-GAN: Improved Adversarial Image Generation via Information Maximization and Contrastive Learning
Kwot Sin Lee
Ngoc-Trung Tran
Ngai-Man Cheung
GAN
63
68
0
09 Jul 2020
Image Augmentations for GAN Training
Image Augmentations for GAN Training
Zhengli Zhao
Zizhao Zhang
Ting-Li Chen
Sameer Singh
Han Zhang
61
137
0
04 Jun 2020
What Makes for Good Views for Contrastive Learning?
What Makes for Good Views for Contrastive Learning?
Yonglong Tian
Chen Sun
Ben Poole
Dilip Krishnan
Cordelia Schmid
Phillip Isola
SSL
114
1,335
0
20 May 2020
Crisscrossed Captions: Extended Intramodal and Intermodal Semantic
  Similarity Judgments for MS-COCO
Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO
Zarana Parekh
Jason Baldridge
Daniel Cer
Austin Waters
Yinfei Yang
47
62
0
30 Apr 2020
Audio-Visual Instance Discrimination with Cross-Modal Agreement
Audio-Visual Instance Discrimination with Cross-Modal Agreement
Pedro Morgado
Nuno Vasconcelos
Ishan Misra
SSL
80
276
0
27 Apr 2020
Disentangled and Controllable Face Image Generation via 3D
  Imitative-Contrastive Learning
Disentangled and Controllable Face Image Generation via 3D Imitative-Contrastive Learning
Yu Deng
Jiaolong Yang
Dong Chen
Fang Wen
Xin Tong
CoGeCVBM
59
347
0
24 Apr 2020
Improved Baselines with Momentum Contrastive Learning
Improved Baselines with Momentum Contrastive Learning
Xinlei Chen
Haoqi Fan
Ross B. Girshick
Kaiming He
SSL
486
3,442
0
09 Mar 2020
A Simple Framework for Contrastive Learning of Visual Representations
A Simple Framework for Contrastive Learning of Visual Representations
Ting-Li Chen
Simon Kornblith
Mohammad Norouzi
Geoffrey E. Hinton
SSL
375
18,859
0
13 Feb 2020
CPGAN: Full-Spectrum Content-Parsing Generative Adversarial Networks for
  Text-to-Image Synthesis
CPGAN: Full-Spectrum Content-Parsing Generative Adversarial Networks for Text-to-Image Synthesis
Jiadong Liang
Wenjie Pei
Feng Lu
GAN
52
19
0
18 Dec 2019
Connecting Vision and Language with Localized Narratives
Connecting Vision and Language with Localized Narratives
Jordi Pont-Tuset
J. Uijlings
Soravit Changpinyo
Radu Soricut
V. Ferrari
ObjD
91
251
0
06 Dec 2019
Momentum Contrast for Unsupervised Visual Representation Learning
Momentum Contrast for Unsupervised Visual Representation Learning
Kaiming He
Haoqi Fan
Yuxin Wu
Saining Xie
Ross B. Girshick
SSL
210
12,121
0
13 Nov 2019
Semantic Object Accuracy for Generative Text-to-Image Synthesis
Semantic Object Accuracy for Generative Text-to-Image Synthesis
Tobias Hinz
Stefan Heinrich
S. Wermter
EGVM
80
159
0
29 Oct 2019
Consistency Regularization for Generative Adversarial Networks
Consistency Regularization for Generative Adversarial Networks
Han Zhang
Zizhao Zhang
Augustus Odena
Honglak Lee
GAN
66
285
0
26 Oct 2019
Understanding the Limitations of Variational Mutual Information
  Estimators
Understanding the Limitations of Variational Mutual Information Estimators
Jiaming Song
Stefano Ermon
SSLDRL
74
204
0
14 Oct 2019
Controllable Text-to-Image Generation
Controllable Text-to-Image Generation
Bowen Li
Xiaojuan Qi
Thomas Lukasiewicz
Philip Torr
GAN
86
357
0
16 Sep 2019
Dual Adversarial Inference for Text-to-Image Synthesis
Dual Adversarial Inference for Text-to-Image Synthesis
Qicheng Lao
Mohammad Havaei
Ahmad Pesaranghader
Francis Dutil
Lisa Di-Jorio
T. Fevens
GAN
50
39
0
14 Aug 2019
Semantics Disentangling for Text-to-Image Generation
Semantics Disentangling for Text-to-Image Generation
Guojun Yin
Bin Liu
Lu Sheng
Nenghai Yu
Xiaogang Wang
Jing Shao
54
184
0
02 Apr 2019
DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image
  Synthesis
DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image Synthesis
Minfeng Zhu
Pingbo Pan
Wei Chen
Yi Yang
GAN
54
583
0
02 Apr 2019
MirrorGAN: Learning Text-to-image Generation by Redescription
MirrorGAN: Learning Text-to-image Generation by Redescription
Tingting Qiao
Jing Zhang
Duanqing Xu
Dacheng Tao
VLMGAN
61
542
0
14 Mar 2019
Object-driven Text-to-Image Synthesis via Adversarial Training
Object-driven Text-to-Image Synthesis via Adversarial Training
Wenbo Li
Pengchuan Zhang
Lei Zhang
Qiuyuan Huang
Xiaodong He
Siwei Lyu
Jianfeng Gao
GAN
71
302
0
27 Feb 2019
Generating Multiple Objects at Spatially Distinct Locations
Generating Multiple Objects at Spatially Distinct Locations
Tobias Hinz
Stefan Heinrich
S. Wermter
81
103
0
03 Jan 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLMSSLSSeg
1.8K
95,114
0
11 Oct 2018
On Self Modulation for Generative Adversarial Networks
On Self Modulation for Generative Adversarial Networks
Ting Chen
Mario Lucic
N. Houlsby
Sylvain Gelly
GAN
62
105
0
02 Oct 2018
Large Scale GAN Training for High Fidelity Natural Image Synthesis
Large Scale GAN Training for High Fidelity Natural Image Synthesis
Andrew Brock
Jeff Donahue
Karen Simonyan
269
5,401
0
28 Sep 2018
Perfect match: Improved cross-modal embeddings for audio-visual
  synchronisation
Perfect match: Improved cross-modal embeddings for audio-visual synchronisation
Soo-Whan Chung
Joon Son Chung
Hong-Goo Kang
55
117
0
21 Sep 2018
Representation Learning with Contrastive Predictive Coding
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRLSSL
351
10,349
0
10 Jul 2018
Self-Attention Generative Adversarial Networks
Self-Attention Generative Adversarial Networks
Han Zhang
Ian Goodfellow
Dimitris N. Metaxas
Augustus Odena
GAN
148
3,729
0
21 May 2018
Unsupervised Feature Learning via Non-Parametric Instance-level
  Discrimination
Unsupervised Feature Learning via Non-Parametric Instance-level Discrimination
Zhirong Wu
Yuanjun Xiong
Stella X. Yu
Dahua Lin
SSL
179
3,465
0
05 May 2018
Improving GANs Using Optimal Transport
Improving GANs Using Optimal Transport
Tim Salimans
Han Zhang
Alec Radford
Dimitris N. Metaxas
OTGAN
72
324
0
15 Mar 2018
Photographic Text-to-Image Synthesis with a Hierarchically-nested
  Adversarial Network
Photographic Text-to-Image Synthesis with a Hierarchically-nested Adversarial Network
Zizhao Zhang
Yuanpu Xie
Ling Yang
EGVM
93
305
0
26 Feb 2018
Spectral Normalization for Generative Adversarial Networks
Spectral Normalization for Generative Adversarial Networks
Takeru Miyato
Toshiki Kataoka
Masanori Koyama
Yuichi Yoshida
ODL
159
4,442
0
16 Feb 2018
Geometry-Contrastive GAN for Facial Expression Transfer
Geometry-Contrastive GAN for Facial Expression Transfer
Fengchun Qiao
Nai-Ming Yao
Zirui Jiao
Zhihao Li
Hui Chen
Hongan Wang
CVBMGAN
104
51
0
06 Feb 2018
Inferring Semantic Layout for Hierarchical Text-to-Image Synthesis
Inferring Semantic Layout for Hierarchical Text-to-Image Synthesis
Seunghoon Hong
Dingdong Yang
Jongwook Choi
Honglak Lee
EGVM
110
337
0
16 Jan 2018
MINE: Mutual Information Neural Estimation
MINE: Mutual Information Neural Estimation
Mohamed Ishmael Belghazi
A. Baratin
Sai Rajeswar
Sherjil Ozair
Yoshua Bengio
Aaron Courville
R. Devon Hjelm
DRL
196
1,280
0
12 Jan 2018
A Note on the Inception Score
A Note on the Inception Score
Shane T. Barratt
Rishi Sharma
EGVM
99
694
0
06 Jan 2018
12
Next