ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.10485
  4. Cited By
AttnGAN: Fine-Grained Text to Image Generation with Attentional
  Generative Adversarial Networks

AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks

28 November 2017
Tao Xu
Pengchuan Zhang
Qiuyuan Huang
Han Zhang
Zhe Gan
Xiaolei Huang
Xiaodong He
    GANViT
ArXiv (abs)PDFHTML

Papers citing "AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks"

50 / 822 papers shown
Title
Vision+X: A Survey on Multimodal Learning in the Light of Data
Vision+X: A Survey on Multimodal Learning in the Light of Data
Ye Zhu
Yuehua Wu
N. Sebe
Yan Yan
107
19
0
05 Oct 2022
ManiCLIP: Multi-Attribute Face Manipulation from Text
ManiCLIP: Multi-Attribute Face Manipulation from Text
Hao Wang
Guosheng Lin
A. Molino
Anran Wang
Jiashi Feng
Zehuan Yuan
CVBM
95
9
0
02 Oct 2022
T2CI-GAN: Text to Compressed Image generation using Generative
  Adversarial Network
T2CI-GAN: Text to Compressed Image generation using Generative Adversarial Network
B. Rajesh
Nandakishore Dusa
M. Javed
S. Dubey
P. Nagabhushan
GAN
35
7
0
01 Oct 2022
Make-A-Video: Text-to-Video Generation without Text-Video Data
Make-A-Video: Text-to-Video Generation without Text-Video Data
Uriel Singer
Adam Polyak
Thomas Hayes
Xiaoyue Yin
Jie An
...
Oron Ashual
Oran Gafni
Devi Parikh
Sonal Gupta
Yaniv Taigman
DiffMVGen
97
1,440
0
29 Sep 2022
Adma-GAN: Attribute-Driven Memory Augmented GANs for Text-to-Image
  Generation
Adma-GAN: Attribute-Driven Memory Augmented GANs for Text-to-Image Generation
Xintian Wu
Hanbin Zhao
Liangli Zheng
Shouhong Ding
Xi Li
65
15
0
28 Sep 2022
Draw Your Art Dream: Diverse Digital Art Synthesis with Multimodal
  Guided Diffusion
Draw Your Art Dream: Diverse Digital Art Synthesis with Multimodal Guided Diffusion
Nisha Huang
Fan Tang
Weiming Dong
Changsheng Xu
DiffM
188
43
0
27 Sep 2022
All are Worth Words: A ViT Backbone for Diffusion Models
All are Worth Words: A ViT Backbone for Diffusion Models
Fan Bao
Shen Nie
Kaiwen Xue
Yue Cao
Chongxuan Li
Hang Su
Jun Zhu
VLM
187
365
0
25 Sep 2022
Implementing and Experimenting with Diffusion Models for Text-to-Image
  Generation
Implementing and Experimenting with Diffusion Models for Text-to-Image Generation
Robin Zbinden
42
3
0
22 Sep 2022
StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story
  Continuation
StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story Continuation
A. Maharana
Darryl Hannan
Joey Tianyi Zhou
DiffM
112
83
0
13 Sep 2022
ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation
ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation
Zhengzhe Liu
Peng Dai
Ruihui Li
Xiaojuan Qi
Chi-Wing Fu
DiffM
245
25
0
09 Sep 2022
Text-Free Learning of a Natural Language Interface for Pretrained Face
  Generators
Text-Free Learning of a Natural Language Interface for Pretrained Face Generators
Xiaodan Du
Raymond A. Yeh
Nicholas I. Kolkin
Eli Shechtman
Gregory Shakhnarovich
CLIP
64
1
0
08 Sep 2022
Lightweight Long-Range Generative Adversarial Networks
Lightweight Long-Range Generative Adversarial Networks
Bowen Li
Thomas Lukasiewicz
GAN
66
4
0
08 Sep 2022
Sporthesia: Augmenting Sports Videos Using Natural Language
Sporthesia: Augmenting Sports Videos Using Natural Language
Zhutian Chen
Qisen Yang
Xiao Xie
Johanna Beyer
Haijun Xia
Yingnian Wu
Hanspeter Pfister
DiffM
106
39
0
07 Sep 2022
AI Illustrator: Translating Raw Descriptions into Images by Prompt-based
  Cross-Modal Generation
AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation
Yi Ma
Huan Yang
Bei Liu
Jianlong Fu
Jiaying Liu
DiffMMLLM
52
10
0
07 Sep 2022
Cross Modal Compression: Towards Human-comprehensible Semantic
  Compression
Cross Modal Compression: Towards Human-comprehensible Semantic Compression
Jiguo Li
Chuanmin Jia
Xinfeng Zhang
Siwei Ma
Wen Gao
40
21
0
06 Sep 2022
DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for
  Text-to-Image Generation
DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation
Mengqi Huang
Zhendong Mao
Penghui Wang
Quang Wang
Yongdong Zhang
68
21
0
03 Sep 2022
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Wanshu Fan
Yen-Chun Chen
Dongdong Chen
Yu Cheng
Lu Yuan
Yu-Chiang Frank Wang
DiffM
92
97
0
29 Aug 2022
Unsupervised Structure-Consistent Image-to-Image Translation
Unsupervised Structure-Consistent Image-to-Image Translation
Shima Shahfar
Charalambos (Charis) Poullis
61
0
0
24 Aug 2022
Vision-Language Matching for Text-to-Image Synthesis via Generative
  Adversarial Networks
Vision-Language Matching for Text-to-Image Synthesis via Generative Adversarial Networks
Qingrong Cheng
Keyu Wen
X. Gu
VLMEGVM
64
17
0
20 Aug 2022
T-Person-GAN: Text-to-Person Image Generation with Identity-Consistency
  and Manifold Mix-Up
T-Person-GAN: Text-to-Person Image Generation with Identity-Consistency and Manifold Mix-Up
Deyin Liu
Yang Wang
Q. Tian
Zongyuan Ge
DiffM
126
5
0
18 Aug 2022
Text-to-Image Generation via Implicit Visual Guidance and Hypernetwork
Text-to-Image Generation via Implicit Visual Guidance and Hypernetwork
Xin Yuan
Zhe Lin
Jason Kuen
Jianming Zhang
John Collomosse
96
5
0
17 Aug 2022
Understanding Attention for Vision-and-Language Tasks
Understanding Attention for Vision-and-Language Tasks
Feiqi Cao
S. Han
Siqu Long
Changwei Xu
Josiah Poon
84
5
0
17 Aug 2022
Memory-Driven Text-to-Image Generation
Memory-Driven Text-to-Image Generation
Bowen Li
Philip Torr
Thomas Lukasiewicz
DiffM
79
12
0
15 Aug 2022
Layout-Bridging Text-to-Image Synthesis
Layout-Bridging Text-to-Image Synthesis
Jiadong Liang
Wenjie Pei
Feng Lu
EGVM
98
15
0
12 Aug 2022
ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal
  Fashion Design
ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal Fashion Design
Xujie Zhang
Yuyang Sha
Michael C. Kampffmeyer
Zhenyu Xie
Zequn Jie
Chengwen Huang
Jianqing Peng
Xiaodan Liang
97
20
0
11 Aug 2022
Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern
  Hopfield Networks
Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks
Yonghao Xu
Weikang Yu
Pedram Ghamisi
Michael K Kopp
Sepp Hochreiter
66
34
0
08 Aug 2022
Word-Level Fine-Grained Story Visualization
Word-Level Fine-Grained Story Visualization
Bowen Li
Thomas Lukasiewicz
DiffM3DH
129
27
0
03 Aug 2022
An Image is Worth One Word: Personalizing Text-to-Image Generation using
  Textual Inversion
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Rinon Gal
Yuval Alaluf
Yuval Atzmon
Or Patashnik
Amit H. Bermano
Gal Chechik
Daniel Cohen-Or
176
1,905
0
02 Aug 2022
ShapeCrafter: A Recursive Text-Conditioned 3D Shape Generation Model
ShapeCrafter: A Recursive Text-Conditioned 3D Shape Generation Model
Rao Fu
Xiaoyu Zhan
Yiwen Chen
Daniel E. Ritchie
Srinath Sridhar
125
79
0
19 Jul 2022
Rethinking Super-Resolution as Text-Guided Details Generation
Rethinking Super-Resolution as Text-Guided Details Generation
Chenxi Ma
Bo Yan
Qing Lin
Weimin Tan
Siming Chen
91
6
0
14 Jul 2022
Towards Counterfactual Image Manipulation via CLIP
Towards Counterfactual Image Manipulation via CLIP
Yingchen Yu
Fangneng Zhan
Rongliang Wu
Jiahui Zhang
Shijian Lu
Miaomiao Cui
Xuansong Xie
Xiansheng Hua
Chunyan Miao
CLIP
133
33
0
06 Jul 2022
TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of
  3D Human Motions and Texts
TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts
Chuan Guo
Xinxin Xuo
Sen Wang
Li Cheng
VGen
191
244
0
04 Jul 2022
Transforming Image Generation from Scene Graphs
Transforming Image Generation from Scene Graphs
Renato Sortino
S. Palazzo
C. Spampinato
ViT
64
2
0
01 Jul 2022
A Fast Text-Driven Approach for Generating Artistic Content
A Fast Text-Driven Approach for Generating Artistic Content
M. Lupascu
Ryan Murdock
Ionut Mironica
Yijun Li
34
1
0
22 Jun 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
280
1,134
0
22 Jun 2022
StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis
StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis
Minguk Kang
Joonghyuk Shin
Jaesik Park
EGVM
66
70
0
19 Jun 2022
Discrete Contrastive Diffusion for Cross-Modal Music and Image
  Generation
Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation
Ye Zhu
Yuehua Wu
Kyle Olszewski
Jian Ren
Sergey Tulyakov
Yan Yan
DiffM
107
49
0
15 Jun 2022
Write and Paint: Generative Vision-Language Models are Unified Modal
  Learners
Write and Paint: Generative Vision-Language Models are Unified Modal Learners
Shizhe Diao
Wangchunshu Zhou
Xinsong Zhang
Jiawei Wang
MLLMAI4CE
95
17
0
15 Jun 2022
Intra-agent speech permits zero-shot task acquisition
Intra-agent speech permits zero-shot task acquisition
Chen Yan
Federico Carnevale
Petko Georgiev
Adam Santoro
Aurelia Guy
Alistair Muldal
Chia-Chun Hung
Josh Abramson
Timothy Lillicrap
Greg Wayne
LM&Ro
97
9
0
07 Jun 2022
Blended Latent Diffusion
Blended Latent Diffusion
Omri Avrahami
Ohad Fried
Dani Lischinski
DiffM
180
393
0
06 Jun 2022
ContraCLIP: Interpretable GAN generation driven by pairs of contrasting
  sentences
ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences
Christos Tzelepis
James Oldfield
Georgios Tzimiropoulos
Ioannis Patras
53
16
0
05 Jun 2022
Compositional Visual Generation with Composable Diffusion Models
Compositional Visual Generation with Composable Diffusion Models
Nan Liu
Shuang Li
Yilun Du
Antonio Torralba
J. Tenenbaum
DiffMCoGe
213
530
0
03 Jun 2022
Style-Content Disentanglement in Language-Image Pretraining
  Representations for Zero-Shot Sketch-to-Image Synthesis
Style-Content Disentanglement in Language-Image Pretraining Representations for Zero-Shot Sketch-to-Image Synthesis
Jan Zuiderveld
DRL
64
1
0
03 Jun 2022
DE-Net: Dynamic Text-guided Image Editing Adversarial Networks
DE-Net: Dynamic Text-guided Image Editing Adversarial Networks
Ming Tao
Bingkun Bao
Hao Tang
Leilei Gan
Longhui Wei
Qi Tian
DiffM
80
15
0
02 Jun 2022
DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder
DiVAE: Photorealistic Images Synthesis with Denoising Diffusion Decoder
Jie Shi
Chenfei Wu
Jian Liang
Xiang Liu
Nan Duan
DiffM
81
26
0
01 Jun 2022
VALHALLA: Visual Hallucination for Machine Translation
VALHALLA: Visual Hallucination for Machine Translation
Yi Li
Yikang Shen
Yoon Kim
Chun-Fu Chen
Rogerio Feris
David D. Cox
Nuno Vasconcelos
MLLM
141
40
0
31 May 2022
Text2Human: Text-Driven Controllable Human Image Generation
Text2Human: Text-Driven Controllable Human Image Generation
Yuming Jiang
Shuai Yang
Haonan Qiu
Wayne Wu
Chen Change Loy
Ziwei Liu
DiffM
176
48
0
31 May 2022
Looks Like Magic: Transfer Learning in GANs to Generate New Card
  Illustrations
Looks Like Magic: Transfer Learning in GANs to Generate New Card Illustrations
Matheus K. Venturelli
P. H. Gomes
Jonatas Wehrmann
GAN
49
1
0
28 May 2022
Mutual Information Divergence: A Unified Metric for Multimodal
  Generative Models
Mutual Information Divergence: A Unified Metric for Multimodal Generative Models
Jin-Hwa Kim
Yunji Kim
Jiyoung Lee
Kang Min Yoo
Sang-Woo Lee
EGVM
108
35
0
25 May 2022
Text-to-Face Generation with StyleGAN2
Text-to-Face Generation with StyleGAN2
D. M. A. Ayanthi
Sarasi Munasinghe
CVBM
48
5
0
25 May 2022
Previous
123...8910...151617
Next