ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.10485
  4. Cited By
AttnGAN: Fine-Grained Text to Image Generation with Attentional
  Generative Adversarial Networks

AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks

28 November 2017
Tao Xu
Pengchuan Zhang
Qiuyuan Huang
Han Zhang
Zhe Gan
Xiaolei Huang
Xiaodong He
    GANViT
ArXiv (abs)PDFHTML

Papers citing "AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks"

50 / 822 papers shown
Title
A-STAR: Test-time Attention Segregation and Retention for Text-to-image
  Synthesis
A-STAR: Test-time Attention Segregation and Retention for Text-to-image Synthesis
Aishwarya Agarwal
Srikrishna Karanam
K. J. Joseph
Apoorv Saxena
Koustava Goswami
Balaji Vasan Srinivasan
VLMDiffM
39
51
0
26 Jun 2023
Towards Unseen Triples: Effective Text-Image-joint Learning for Scene
  Graph Generation
Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation
Qianji Di
Wenxing Ma
Zhongang Qi
Tianxiang Hou
Ying Shan
Hanzi Wang
52
0
0
23 Jun 2023
Ambigram Generation by A Diffusion Model
Ambigram Generation by A Diffusion Model
T. Shirakawa
Seiichi Uchida
DiffM
57
4
0
21 Jun 2023
Learning Profitable NFT Image Diffusions via Multiple Visual-Policy
  Guided Reinforcement Learning
Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
Huiguo He
Tianfu Wang
Huan Yang
Jianlong Fu
N. Yuan
Jian Yin
Hongyang Chao
Qi Zhang
EGVM
152
10
0
20 Jun 2023
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image
  Editing
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing
Kai Zhang
Lingbo Mo
Wenhu Chen
Huan Sun
Yu-Chuan Su
EGVM
226
277
0
16 Jun 2023
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal
  Contrastive Training
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training
Chong Liu
Yuqi Zhang
Hongsong Wang
Weihua Chen
F. Wang
Yan Huang
Yixing Shen
Liang Wang
73
28
0
15 Jun 2023
Diffusion in Diffusion: Cyclic One-Way Diffusion for
  Text-Vision-Conditioned Generation
Diffusion in Diffusion: Cyclic One-Way Diffusion for Text-Vision-Conditioned Generation
Ruoyu Wang
Yongqi Yang
Zhihao Qian
Ye Zhu
Yuehua Wu
DiffM
97
14
0
14 Jun 2023
AutoML in the Age of Large Language Models: Current Challenges, Future
  Opportunities and Risks
AutoML in the Age of Large Language Models: Current Challenges, Future Opportunities and Risks
Alexander Tornede
Difan Deng
Theresa Eimer
Joseph Giovanelli
Aditya Mohan
...
Sarah Segel
Daphne Theodorakopoulos
Tanja Tornede
Henning Wachsmuth
Marius Lindauer
119
24
0
13 Jun 2023
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Shuai Yang
Yifan Zhou
Ziwei Liu
Chen Change Loy
VGenDiffM
102
221
0
13 Jun 2023
Generative Watermarking Against Unauthorized Subject-Driven Image
  Synthesis
Generative Watermarking Against Unauthorized Subject-Driven Image Synthesis
Yi Ma
Zhengyu Zhao
Xinlei He
Zheng Li
Michael Backes
Yang Zhang
AAMLWIGM
90
21
0
13 Jun 2023
Paste, Inpaint and Harmonize via Denoising: Subject-Driven Image Editing
  with Pre-Trained Diffusion Model
Paste, Inpaint and Harmonize via Denoising: Subject-Driven Image Editing with Pre-Trained Diffusion Model
Xinyu Zhang
Jiaxian Guo
Paul D. Yoo
Yutaka Matsuo
Yusuke Iwasawa
DiffM
108
22
0
13 Jun 2023
Controlling Text-to-Image Diffusion by Orthogonal Finetuning
Controlling Text-to-Image Diffusion by Orthogonal Finetuning
Zeju Qiu
Wei-yu Liu
Haiwen Feng
Yuxuan Xue
Yao Feng
Zhen Liu
Dan Zhang
Adrian Weller
Bernhard Schölkopf
DiffM
126
158
0
12 Jun 2023
Improving Tuning-Free Real Image Editing with Proximal Guidance
Improving Tuning-Free Real Image Editing with Proximal Guidance
Ligong Han
Song Wen
Qi Chen
Zhixing Zhang
Kunpeng Song
...
Qilong Zhangli
Jindong Jiang
Zhaoyang Xia
Akash Srivastava
Dimitris N. Metaxas
DiffM
115
63
0
08 Jun 2023
AGIQA-3K: An Open Database for AI-Generated Image Quality Assessment
AGIQA-3K: An Open Database for AI-Generated Image Quality Assessment
Chunyi Li
Zicheng Zhang
Haoning Wu
Wei Sun
Xiongkuo Min
Xiaohong Liu
Guangtao Zhai
Weisi Lin
EGVM
82
124
0
07 Jun 2023
Object Detection with Transformers: A Review
Object Detection with Transformers: A Review
Tahira Shehzadi
K. Hashmi
D. Stricker
Muhammad Zeshan Afzal
ViTMU
102
29
0
07 Jun 2023
Efficient Text-Guided 3D-Aware Portrait Generation with Score
  Distillation Sampling on Distribution
Efficient Text-Guided 3D-Aware Portrait Generation with Score Distillation Sampling on Distribution
Yiji Cheng
Fei Yin
Xiaoke Huang
Xintong Yu
Jiaxiang Liu
Shi Feng
Yujiu Yang
Yansong Tang
DiffM
76
5
0
03 Jun 2023
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion
  Models
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models
Chang-rui Liu
Haoning Wu
Yujie Zhong
Xiaoyu Zhang
Yanfeng Wang
Weidi Xie
DiffMVLM
154
44
0
01 Jun 2023
ViCo: Plug-and-play Visual Condition for Personalized Text-to-image
  Generation
ViCo: Plug-and-play Visual Condition for Personalized Text-to-image Generation
Shaozhe Hao
Kai Han
Shihao Zhao
Kwan-Yee K. Wong
88
10
0
01 Jun 2023
The Hidden Language of Diffusion Models
The Hidden Language of Diffusion Models
Hila Chefer
Oran Lang
Mor Geva
Volodymyr Polosukhin
Assaf Shocher
Michal Irani
Inbar Mosseri
Lior Wolf
DiffM
117
27
0
01 Jun 2023
Differential Diffusion: Giving Each Pixel Its Strength
Differential Diffusion: Giving Each Pixel Its Strength
E. Levin
Ohad Fried
DiffM
88
21
0
01 Jun 2023
RealignDiff: Boosting Text-to-Image Diffusion Model with Coarse-to-fine
  Semantic Re-alignment
RealignDiff: Boosting Text-to-Image Diffusion Model with Coarse-to-fine Semantic Re-alignment
Guian Fang
Zutao Jiang
Jianhua Han
Guangsong Lu
Hang Xu
Shengcai Liao
Xiaodan Liang
EGVM
67
1
0
31 May 2023
Translation-Enhanced Multilingual Text-to-Image Generation
Translation-Enhanced Multilingual Text-to-Image Generation
Yaoyiran Li
Ching-Yun Chang
Stephen Rawls
Ivan Vulić
Anna Korhonen
57
8
0
30 May 2023
Gen-L-Video: Multi-Text to Long Video Generation via Temporal
  Co-Denoising
Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Fu Lee Wang
Wenshuo Chen
Guanglu Song
Han-Jia Ye
Yu Liu
Hongsheng Li
VGenDiffM
117
93
0
29 May 2023
TaleCrafter: Interactive Story Visualization with Multiple Characters
TaleCrafter: Interactive Story Visualization with Multiple Characters
Yuan Gong
Youxin Pang
Xiaodong Cun
Menghan Xia
Yingqing He
...
Longyue Wang
Yong Zhang
Xintao Wang
Ying Shan
Yujiu Yang
DiffM
110
50
0
29 May 2023
Text-to-image Editing by Image Information Removal
Text-to-image Editing by Image Information Removal
Zhongping Zhang
Jian Zheng
Jacob Zhiyuan Fang
Bryan A. Plummer
DiffM
97
13
0
27 May 2023
Generating Images with Multimodal Language Models
Generating Images with Multimodal Language Models
Jing Yu Koh
Daniel Fried
Ruslan Salakhutdinov
MLLM
162
259
0
26 May 2023
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
Shihao Zhao
Dongdong Chen
Yen-Chun Chen
Jianmin Bao
Shaozhe Hao
Lu Yuan
Kwan-Yee K. Wong
115
268
0
25 May 2023
Break-A-Scene: Extracting Multiple Concepts from a Single Image
Break-A-Scene: Extracting Multiple Concepts from a Single Image
Omri Avrahami
Kfir Aberman
Ohad Fried
Daniel Cohen-Or
Dani Lischinski
VLMDiffM
106
178
0
25 May 2023
ProSpect: Prompt Spectrum for Attribute-Aware Personalization of
  Diffusion Models
ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models
Yuxin Zhang
Weiming Dong
Fan Tang
Nisha Huang
Haibin Huang
Chongyang Ma
Tong-Yee Lee
Oliver Deussen
Changsheng Xu
DiffM
99
81
0
25 May 2023
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion
  Models
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models
Xingqian Xu
Jiayi Guo
Zhangyang Wang
Gao Huang
Irfan Essa
Humphrey Shi
VLMDiffM
127
61
0
25 May 2023
T2TD: Text-3D Generation Model based on Prior Knowledge Guidance
T2TD: Text-3D Generation Model based on Prior Knowledge Guidance
Weizhi Nie
Ruidong Chen
Weijie Wang
Bruno Lepri
N. Sebe
74
6
0
25 May 2023
Visual Programming for Text-to-Image Generation and Evaluation
Visual Programming for Text-to-Image Generation and Evaluation
Jaemin Cho
Abhaysinh Zala
Joey Tianyi Zhou
MLLM
119
51
0
24 May 2023
DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion
  Models
DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models
Sungnyun Kim
Junsoo Lee
Kibeom Hong
Daesik Kim
Namhyuk Ahn
DiffM
88
15
0
24 May 2023
Vision + Language Applications: A Survey
Vision + Language Applications: A Survey
Yutong Zhou
N. Shimada
VLM
117
7
0
24 May 2023
Text-guided 3D Human Generation from 2D Collections
Text-guided 3D Human Generation from 2D Collections
Tsu-Jui Fu
Wenhan Xiong
Yixin Nie
Jingyu Liu
Barlas Ouguz
William Yang Wang
84
2
0
23 May 2023
Not All Image Regions Matter: Masked Vector Quantization for
  Autoregressive Image Generation
Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation
Mengqi Huang
Zhendong Mao
Quang Wang
Yongdong Zhang
VGenDiffM
121
23
0
23 May 2023
Enhancing Detail Preservation for Customized Text-to-Image Generation: A
  Regularization-Free Approach
Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach
Yufan Zhou
Ruiyi Zhang
Tongfei Sun
Jinhui Xu
DiffM
144
40
0
23 May 2023
SneakyPrompt: Jailbreaking Text-to-image Generative Models
SneakyPrompt: Jailbreaking Text-to-image Generative Models
Yuchen Yang
Bo Hui
Haolin Yuan
Neil Zhenqiang Gong
Yinzhi Cao
EGVM
187
93
0
20 May 2023
Data Redaction from Conditional Generative Models
Data Redaction from Conditional Generative Models
Zhifeng Kong
Kamalika Chaudhuri
KELM
77
7
0
18 May 2023
Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
Wenjing Wang
Huan Yang
Zixi Tuo
Huiguo He
Sitong Su
Jianlong Fu
Jiaying Liu
DiffMVGen
155
117
0
18 May 2023
Fusion-S2iGan: An Efficient and Effective Single-Stage Framework for
  Speech-to-Image Generation
Fusion-S2iGan: An Efficient and Effective Single-Stage Framework for Speech-to-Image Generation
Zhenxing Zhang
Lambert Schomaker
46
3
0
17 May 2023
Interactive Fashion Content Generation Using LLMs and Latent Diffusion
  Models
Interactive Fashion Content Generation Using LLMs and Latent Diffusion Models
Krishna Sri Ipsit Mantri
Nevasini Sasikumar
DiffM
63
2
0
15 May 2023
Vision-Language Models in Remote Sensing: Current Progress and Future
  Trends
Vision-Language Models in Remote Sensing: Current Progress and Future Trends
Xiang Li
Congcong Wen
Yuan Hu
Zhenghang Yuan
Xiao Xiang Zhu
VLM
82
82
0
09 May 2023
Learning to Generate Poetic Chinese Landscape Painting with Calligraphy
Learning to Generate Poetic Chinese Landscape Painting with Calligraphy
Shaozu Yuan
Aijun Dai
Zhiling Yan
Ruixue Liu
Meng Chen
Baoyang Chen
Zhijie Qiu
Xiaodong He
95
7
0
08 May 2023
DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven
  Text-to-Image Generation
DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation
Hong Chen
Yipeng Zhang
Simin Wu
Xin Eric Wang
Xuguang Duan
Yuwei Zhou
Wenwu Zhu
DiffM
110
51
0
05 May 2023
High-Fidelity 3D Face Generation from Natural Language Descriptions
High-Fidelity 3D Face Generation from Natural Language Descriptions
Menghua Wu
Hao Zhu
Linjiang Huang
Yi Zhuang
Yuanxun Lu
Xun Cao
3DHCVBM
82
32
0
05 May 2023
Multi-Modality Deep Network for JPEG Artifacts Reduction
Multi-Modality Deep Network for JPEG Artifacts Reduction
Xuhao Jiang
Weimin Tan
Qing Lin
Chenxi Ma
Bo Yan
Liquan Shen
82
2
0
04 May 2023
Catch Missing Details: Image Reconstruction with Frequency Augmented
  Variational Autoencoder
Catch Missing Details: Image Reconstruction with Frequency Augmented Variational Autoencoder
Xinmiao Lin
Yikang Li
Jenhao Hsiao
C. Ho
Yu Kong
138
20
0
04 May 2023
Few-shot Domain-Adaptive Visually-fused Event Detection from Text
Few-shot Domain-Adaptive Visually-fused Event Detection from Text
Farhad Moghimifar
Fatemeh Shiri
Van Nguyen
Gholamreza Haffari
Yuanyou Li
VLM
68
2
0
04 May 2023
Multimodal Procedural Planning via Dual Text-Image Prompting
Multimodal Procedural Planning via Dual Text-Image Prompting
Yujie Lu
Pan Lu
Zhiyu Zoey Chen
Wanrong Zhu
Xinze Wang
William Yang Wang
LM&Ro
128
45
0
02 May 2023
Previous
123...567...151617
Next