ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.10485
  4. Cited By
AttnGAN: Fine-Grained Text to Image Generation with Attentional
  Generative Adversarial Networks

AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks

28 November 2017
Tao Xu
Pengchuan Zhang
Qiuyuan Huang
Han Zhang
Zhe Gan
Xiaolei Huang
Xiaodong He
    GANViT
ArXiv (abs)PDFHTML

Papers citing "AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks"

50 / 822 papers shown
Title
Glaze: Protecting Artists from Style Mimicry by Text-to-Image Models
Glaze: Protecting Artists from Style Mimicry by Text-to-Image Models
Shawn Shan
Jenna Cryan
Emily Wenger
Haitao Zheng
Rana Hanocka
Ben Y. Zhao
WIGM
80
189
0
08 Feb 2023
Zero-shot Generation of Coherent Storybook from Plain Text Story using
  Diffusion Models
Zero-shot Generation of Coherent Storybook from Plain Text Story using Diffusion Models
Hyeonho Jeong
Gihyun Kwon
Jong Chul Ye
77
23
0
08 Feb 2023
ShiftDDPMs: Exploring Conditional Diffusion Models by Shifting Diffusion
  Trajectories
ShiftDDPMs: Exploring Conditional Diffusion Models by Shifting Diffusion Trajectories
Zijian Zhang
Zhou Zhao
Jun Yu
Qi Tian
DiffM
45
14
0
05 Feb 2023
Multimodality Representation Learning: A Survey on Evolution,
  Pretraining and Its Applications
Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its Applications
Muhammad Arslan Manzoor
S. Albarri
Ziting Xian
Zaiqiao Meng
Preslav Nakov
Shangsong Liang
AI4TS
101
32
0
01 Feb 2023
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image
  Diffusion Models
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models
Hila Chefer
Yuval Alaluf
Yael Vinker
Lior Wolf
Daniel Cohen-Or
DiffM
193
520
0
31 Jan 2023
Shape-aware Text-driven Layered Video Editing
Shape-aware Text-driven Layered Video Editing
Yao-Chih Lee
Ji-Ze Jang
Yi-Ting Chen
Elizabeth Qiu
Jia-Bin Huang
VGenDiffM
87
54
0
30 Jan 2023
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
Ming Tao
Bingkun Bao
Hao Tang
Changsheng Xu
DiffMVLM
117
109
0
30 Jan 2023
Face Generation from Textual Features using Conditionally Trained Inputs
  to Generative Adversarial Networks
Face Generation from Textual Features using Conditionally Trained Inputs to Generative Adversarial Networks
Sandeep Shinde
Tejas Pradhan
Aniket Ghorpade
Mihir Tale
GANCVBM
49
3
0
22 Jan 2023
GLIGEN: Open-Set Grounded Text-to-Image Generation
GLIGEN: Open-Set Grounded Text-to-Image Generation
Yuheng Li
Haotian Liu
Qingyang Wu
Fangzhou Mu
Jianwei Yang
Jianfeng Gao
Chunyuan Li
Yong Jae Lee
VLM
148
603
1
17 Jan 2023
An Impartial Transformer for Story Visualization
An Impartial Transformer for Story Visualization
N. Tsakas
Maria Lymperaiou
Giorgos Filandrianos
Giorgos Stamou
ViT
92
3
0
09 Jan 2023
ANNA: Abstractive Text-to-Image Synthesis with Filtered News Captions
ANNA: Abstractive Text-to-Image Synthesis with Filtered News Captions
Aashish Anantha Ramakrishnan
Sharon X. Huang
Dongwon Lee
108
6
0
05 Jan 2023
FICE: Text-Conditioned Fashion Image Editing With Guided GAN Inversion
FICE: Text-Conditioned Fashion Image Editing With Guided GAN Inversion
Martin Pernuš
Clinton Fookes
Vitomir Štruc
Simon Dobrišek
DiffM
84
29
0
05 Jan 2023
Attribute-Centric Compositional Text-to-Image Generation
Attribute-Centric Compositional Text-to-Image Generation
Yuren Cong
Martin Renqiang Min
Erran L. Li
Bodo Rosenhahn
M. Yang
114
13
0
04 Jan 2023
Muse: Text-To-Image Generation via Masked Generative Transformers
Muse: Text-To-Image Generation via Masked Generative Transformers
Huiwen Chang
Han Zhang
Jarred Barber
AJ Maschinot
José Lezama
...
Kevin Patrick Murphy
William T. Freeman
Michael Rubinstein
Yuanzhen Li
Dilip Krishnan
DiffM
278
560
0
02 Jan 2023
Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and
  Text-to-Image Diffusion Models
Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models
Jiale Xu
Xintao Wang
Weihao Cheng
Yan-Pei Cao
Ying Shan
Xiaohu Qie
Shenghua Gao
260
165
0
28 Dec 2022
Benchmarking Spatial Relationships in Text-to-Image Generation
Benchmarking Spatial Relationships in Text-to-Image Generation
Tejas Gokhale
Hamid Palangi
Besmira Nushi
Vibhav Vineet
Eric Horvitz
Ece Kamar
Chitta Baral
Yezhou Yang
EGVM
116
72
0
20 Dec 2022
Multi-Concept Customization of Text-to-Image Diffusion
Multi-Concept Customization of Text-to-Image Diffusion
Nupur Kumari
Bin Zhang
Richard Y. Zhang
Eli Shechtman
Jun-Yan Zhu
233
877
0
08 Dec 2022
SINE: SINgle Image Editing with Text-to-Image Diffusion Models
SINE: SINgle Image Editing with Text-to-Image Diffusion Models
Zhixing Zhang
Ligong Han
Arna Ghosh
Dimitris N. Metaxas
Jian Ren
DiffM
186
160
0
08 Dec 2022
3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation
3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation
Zutao Jiang
Guangsong Lu
Xiaodan Liang
Jihua Zhu
Wei Zhang
Xiaojun Chang
Hang Xu
DiffM
79
8
0
02 Dec 2022
CLIP2GAN: Towards Bridging Text with the Latent Space of GANs
CLIP2GAN: Towards Bridging Text with the Latent Space of GANs
Yixuan Wang
Wen-gang Zhou
Jianmin Bao
Weilun Wang
Li Li
Houqiang Li
GANCLIP
62
6
0
28 Nov 2022
Unified Discrete Diffusion for Simultaneous Vision-Language Generation
Unified Discrete Diffusion for Simultaneous Vision-Language Generation
Minghui Hu
Chuanxia Zheng
Heliang Zheng
Tat-Jen Cham
Chaoyue Wang
Zuopeng Yang
Dacheng Tao
Ponnuthurai Nagaratnam Suganthan
DiffM
131
26
0
27 Nov 2022
SpaText: Spatio-Textual Representation for Controllable Image Generation
SpaText: Spatio-Textual Representation for Controllable Image Generation
Omri Avrahami
Thomas Hayes
Oran Gafni
Sonal Gupta
Yaniv Taigman
Devi Parikh
Dani Lischinski
Ohad Fried
Xiaoyue Yin
DiffM
133
210
0
25 Nov 2022
Interactive Image Manipulation with Complex Text Instructions
Interactive Image Manipulation with Complex Text Instructions
Ryugo Morita
Zhiqiang Zhang
Man M. Ho
Jinjia Zhou
DiffM
77
3
0
25 Nov 2022
TPA-Net: Generate A Dataset for Text to Physics-based Animation
TPA-Net: Generate A Dataset for Text to Physics-based Animation
Yuxing Qiu
Feng Gao
Minchen Li
Govind Thattai
Yin Yang
Chenfanfu Jiang
PINNDiffMVGen
58
0
0
25 Nov 2022
Shifted Diffusion for Text-to-image Generation
Shifted Diffusion for Text-to-image Generation
Yufan Zhou
Bingchen Liu
Yizhe Zhu
Xiao Yang
Changyou Chen
Jinhui Xu
DiffM
137
45
0
24 Nov 2022
ReCo: Region-Controlled Text-to-Image Generation
ReCo: Region-Controlled Text-to-Image Generation
Zhengyuan Yang
Jianfeng Wang
Zhe Gan
Linjie Li
Kevin Qinghong Lin
...
Nan Duan
Zicheng Liu
Ce Liu
Michael Zeng
Lijuan Wang
DiffM
105
150
0
23 Nov 2022
Tell Me What Happened: Unifying Text-guided Video Completion via
  Multimodal Masked Video Generation
Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation
Tsu-Jui Fu
Licheng Yu
Ning Zhang
Cheng-Yang Fu
Jong-Chyi Su
William Yang Wang
Sean Bell
VGen
148
38
0
23 Nov 2022
Video Background Music Generation: Dataset, Method and Evaluation
Video Background Music Generation: Dataset, Method and Evaluation
Le Zhuo
Zhaokai Wang
Baisen Wang
Yue Liao
Chenxi Bao
Stanley Peng
Miao Lu
Xiaobo Li
Fei Fang
Si Liu
VGen
89
31
0
21 Nov 2022
Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
Xichen Pan
Pengda Qin
Yuhong Li
Hui Xue
Wenhu Chen
DiffM
97
65
0
20 Nov 2022
A survey on knowledge-enhanced multimodal learning
A survey on knowledge-enhanced multimodal learning
Maria Lymperaiou
Giorgos Stamou
159
15
0
19 Nov 2022
How to train your draGAN: A task oriented solution to imbalanced
  classification
How to train your draGAN: A task oriented solution to imbalanced classification
Leon O. Guertler
Andri Ashfahani
Anh Tuan Luu
DiffMSyDa
45
1
0
18 Nov 2022
Extreme Generative Image Compression by Learning Text Embedding from
  Diffusion Models
Extreme Generative Image Compression by Learning Text Embedding from Diffusion Models
Zhihong Pan
Xiaoxia Zhou
Hao Tian
DiffM
75
23
0
14 Nov 2022
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image
  Generation
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation
Zhihong Pan
Xiaoxia Zhou
Hao Tian
DiffM
62
12
0
14 Nov 2022
Learning to Model Multimodal Semantic Alignment for Story Visualization
Learning to Model Multimodal Semantic Alignment for Story Visualization
Bowen Li
Thomas Lukasiewicz
DiffM
83
2
0
14 Nov 2022
Large-Scale Bidirectional Training for Zero-Shot Image Captioning
Large-Scale Bidirectional Training for Zero-Shot Image Captioning
Taehoon Kim
Mark A Marsden
Pyunghwan Ahn
Sangyun Kim
Sihaeng Lee
Alessandra Sala
S. Kim
VLM
62
4
0
13 Nov 2022
SSGVS: Semantic Scene Graph-to-Video Synthesis
SSGVS: Semantic Scene Graph-to-Video Synthesis
Yuren Cong
Jinhui Yi
Bodo Rosenhahn
M. Yang
135
8
0
11 Nov 2022
Disentangling Content and Motion for Text-Based Neural Video
  Manipulation
Disentangling Content and Motion for Text-Based Neural Video Manipulation
Levent Karacan
Tolga Kerimouglu
.Ismail .Inan
Tolga Birdal
Erkut Erdem
Aykut Erdem
90
1
0
05 Nov 2022
UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal
  Guidance
UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal Guidance
Wei Li
Xue Xu
Xinyan Xiao
Jiacheng Liu
Hu Yang
...
Zhanpeng Wang
Zhifan Feng
Qiaoqiao She
Yajuan Lyu
Hua Wu
232
30
0
28 Oct 2022
SSD: Towards Better Text-Image Consistency Metric in Text-to-Image
  Generation
SSD: Towards Better Text-Image Consistency Metric in Text-to-Image Generation
Zhaorui Tan
Xi Yang
Zihan Ye
Qiufeng Wang
Yuyao Yan
Anh Nguyen
Kaizhu Huang
EGVM
80
3
0
27 Oct 2022
Cover Reproducible Steganography via Deep Generative Models
Cover Reproducible Steganography via Deep Generative Models
Kejiang Chen
Hang Zhou
Yaofei Wang
Meng Li
Weiming Zhang
Neng H. Yu
DiffM
63
13
0
26 Oct 2022
Guiding Users to Where to Give Color Hints for Efficient Interactive
  Sketch Colorization via Unsupervised Region Prioritization
Guiding Users to Where to Give Color Hints for Efficient Interactive Sketch Colorization via Unsupervised Region Prioritization
Youngin Cho
Junsoo Lee
Soyoung Yang
Juntae Kim
Yeojeong Park
Haneol Lee
Mohammad Azam Khan
Daesik Kim
Jaegul Choo
FAtt
79
2
0
25 Oct 2022
Lafite2: Few-shot Text-to-Image Generation
Lafite2: Few-shot Text-to-Image Generation
Yufan Zhou
Chunyuan Li
Changyou Chen
Jianfeng Gao
Jinhui Xu
DiffM
108
11
0
25 Oct 2022
3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows
3DALL-E: Integrating Text-to-Image AI in 3D Design Workflows
Vivian Liu
Jo Vermeulen
G. Fitzmaurice
Justin Matejka
HAI
88
126
0
20 Oct 2022
Commonsense Knowledge from Scene Graphs for Textual Environments
Commonsense Knowledge from Scene Graphs for Textual Environments
Tsunehiko Tanaka
Daiki Kimura
Michiaki Tatsubori
66
2
0
19 Oct 2022
Character-Centric Story Visualization via Visual Planning and Token
  Alignment
Character-Centric Story Visualization via Visual Planning and Token Alignment
Hong Chen
Rujun Han
Te-Lin Wu
Hideki Nakayama
Nanyun Peng
DiffMVGen
94
32
0
16 Oct 2022
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation
  with Semantic Modulations
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
Yi-Chun Zhu
Hongyu Liu
Yibing Song
Ziyang Yuan
Xintong Han
Chun Yuan
Qifeng Chen
Jue Wang
VLMDiffM
113
32
0
14 Oct 2022
Style-Guided Inference of Transformer for High-resolution Image
  Synthesis
Style-Guided Inference of Transformer for High-resolution Image Synthesis
Jonghwa Yim
Minjae Kim
ViT
103
0
0
11 Oct 2022
Markup-to-Image Diffusion Models with Scheduled Sampling
Markup-to-Image Diffusion Models with Scheduled Sampling
Yuntian Deng
Noriyuki Kojima
Alexander M. Rush
DiffM
86
4
0
11 Oct 2022
HORIZON: High-Resolution Semantically Controlled Panorama Synthesis
HORIZON: High-Resolution Semantically Controlled Panorama Synthesis
Kun Yan
Lei Ji
Chenfei Wu
Jian Liang
Ming Zhou
Nan Duan
Shuai Ma
73
0
0
10 Oct 2022
Distill the Image to Nowhere: Inversion Knowledge Distillation for
  Multimodal Machine Translation
Distill the Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation
Ru Peng
Yawen Zeng
Jiaqi Zhao
75
18
0
10 Oct 2022
Previous
123...789...151617
Next