ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.10485
  4. Cited By
AttnGAN: Fine-Grained Text to Image Generation with Attentional
  Generative Adversarial Networks

AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks

28 November 2017
Tao Xu
Pengchuan Zhang
Qiuyuan Huang
Han Zhang
Zhe Gan
Xiaolei Huang
Xiaodong He
    GANViT
ArXiv (abs)PDFHTML

Papers citing "AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks"

50 / 822 papers shown
Title
Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image
  Quality Assessment
Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment
Tianwei Zhou
Songbai Tan
Wei Zhou
Yu Luo
Yuan-Gen Wang
Guanghui Yue
EGVM
101
11
0
23 Apr 2024
Enhancing Prompt Following with Visual Control Through Training-Free
  Mask-Guided Diffusion
Enhancing Prompt Following with Visual Control Through Training-Free Mask-Guided Diffusion
Hongyu Chen
Yi-Meng Gao
Min Zhou
Peng Wang
Xubin Li
Tiezheng Ge
Bo Zheng
DiffM
68
5
0
23 Apr 2024
MultiBooth: Towards Generating All Your Concepts in an Image from Text
MultiBooth: Towards Generating All Your Concepts in an Image from Text
Chenyang Zhu
Kai Li
Yue Ma
Chunming He
Li Xiu
DiffM
241
29
0
22 Apr 2024
Iteratively Prompting Multimodal LLMs to Reproduce Natural and
  AI-Generated Images
Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images
Ali Naseh
Katherine Thai
Mohit Iyyer
Amir Houmansadr
90
7
0
21 Apr 2024
ANCHOR: LLM-driven News Subject Conditioning for Text-to-Image Synthesis
ANCHOR: LLM-driven News Subject Conditioning for Text-to-Image Synthesis
Aashish Anantha Ramakrishnan
Sharon X. Huang
Dongwon Lee
90
0
0
15 Apr 2024
Semantic Approach to Quantifying the Consistency of Diffusion Model
  Image Generation
Semantic Approach to Quantifying the Consistency of Diffusion Model Image Generation
Brinnae Bent
67
3
0
12 Apr 2024
StoryImager: A Unified and Efficient Framework for Coherent Story
  Visualization and Completion
StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion
Ming Tao
Bing-Kun Bao
Hao Tang
Yaowei Wang
Changsheng Xu
DiffM
91
8
0
09 Apr 2024
Rethinking the Spatial Inconsistency in Classifier-Free Diffusion
  Guidance
Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance
Dazhong Shen
Guanglu Song
Zeyue Xue
Fu-Yun Wang
Yu Liu
DiffM
91
18
0
08 Apr 2024
InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise
  Optimization
InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
Xiefan Guo
Jinlin Liu
Miaomiao Cui
Jiankai Li
Hongyu Yang
Di Huang
102
38
0
06 Apr 2024
Real, fake and synthetic faces -- does the coin have three sides?
Real, fake and synthetic faces -- does the coin have three sides?
Shahzeb Naeem
Ramzi Al-Sharawi
Muhammad Riyyan Khan
Usman Tariq
Abhinav Dhall
H. Al-Nashash
90
1
0
02 Apr 2024
CLIP-VQDiffusion : Langauge Free Training of Text To Image generation
  using CLIP and vector quantized diffusion model
CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusion model
S. Han
Joohee Kim
DiffMCLIP
70
2
0
22 Mar 2024
Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing
Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing
Alberto Baldrati
Davide Morelli
Marcella Cornia
Marco Bertini
Rita Cucchiara
DiffM
81
8
0
21 Mar 2024
Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific
  Adaptation
Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Fu-Yun Wang
Xiaoshi Wu
Zhaoyang Huang
Xiaoyu Shi
Dazhong Shen
Guanglu Song
Yu Liu
Hongsheng Li
DiffM
74
14
0
20 Mar 2024
TiBiX: Leveraging Temporal Information for Bidirectional X-ray and
  Report Generation
TiBiX: Leveraging Temporal Information for Bidirectional X-ray and Report Generation
Santosh Sanjeev
F. Maani
Arsen Abzhanov
Vijay Ram Papineni
Ibrahim Almakky
Bartlomiej W. Papie.z
Mohammad Yaqub
MedIm
85
0
0
20 Mar 2024
Can AI Outperform Human Experts in Creating Social Media Creatives?
Can AI Outperform Human Experts in Creating Social Media Creatives?
Eunkyung Park
Raymond K. Wong
Junbum Kwon
62
0
0
19 Mar 2024
LogicalDefender: Discovering, Extracting, and Utilizing Common-Sense
  Knowledge
LogicalDefender: Discovering, Extracting, and Utilizing Common-Sense Knowledge
Yuhe Liu
Mengxue Kang
Zengchang Qin
Xiangxiang Chu
NAIVLM
56
0
0
18 Mar 2024
Desigen: A Pipeline for Controllable Design Template Generation
Desigen: A Pipeline for Controllable Design Template Generation
Haohan Weng
Danqing Huang
Yu Qiao
Zheng Hu
Chin-Yew Lin
Tong Zhang
Chong Chen
DiffM
67
17
0
14 Mar 2024
Masked Generative Story Transformer with Character Guidance and Caption
  Augmentation
Masked Generative Story Transformer with Character Guidance and Caption Augmentation
Christos Papadimitriou
Giorgos Filandrianos
Maria Lymperaiou
Giorgos Stamou
DiffM
141
2
0
13 Mar 2024
CoroNetGAN: Controlled Pruning of GANs via Hypernetworks
CoroNetGAN: Controlled Pruning of GANs via Hypernetworks
Aman Kumar
Khushboo Anand
Shubham Mandloi
Ashutosh Mishra
Avinash Thakur
Neeraj Kasera
Prathosh A P
77
4
0
13 Mar 2024
FaceChain-SuDe: Building Derived Class to Inherit Category Attributes
  for One-shot Subject-Driven Generation
FaceChain-SuDe: Building Derived Class to Inherit Category Attributes for One-shot Subject-Driven Generation
Pengchong Qiao
Lei Shang
Chang-Shu Liu
Baigui Sun
Xiang Ji
Jie Chen
CVBM
59
3
0
11 Mar 2024
3D-aware Image Generation and Editing with Multi-modal Conditions
3D-aware Image Generation and Editing with Multi-modal Conditions
Bo Li
Yike Li
Zhien He
Bin Liu
Yu-Kun Lai
67
2
0
11 Mar 2024
Towards Effective Usage of Human-Centric Priors in Diffusion Models for
  Text-based Human Image Generation
Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation
Junyan Wang
Zhenhong Sun
Zhiyu Tan
Xuanbai Chen
Weihua Chen
Hao Li
Cheng Zhang
Yang Song
96
12
0
08 Mar 2024
Sora as an AGI World Model? A Complete Survey on Text-to-Video
  Generation
Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation
Joseph Cho
Fachrina Dewi Puspitasari
Sheng Zheng
Jingyao Zheng
Lik-Hang Lee
Tae-Ho Kim
Choong Seon Hong
Chaoning Zhang
EGVMVGen
104
43
0
08 Mar 2024
Discriminative Probing and Tuning for Text-to-Image Generation
Discriminative Probing and Tuning for Text-to-Image Generation
Leigang Qu
Wenjie Wang
Chak Tou Leong
Hanwang Zhang
Liqiang Nie
Tat-Seng Chua
87
8
0
07 Mar 2024
Neural Image Compression with Text-guided Encoding for both Pixel-level
  and Perceptual Fidelity
Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity
Hagyeong Lee
Minkyu Kim
Jun-Hyuk Kim
Seungeon Kim
Dokwan Oh
Jaeho Lee
DiffM
90
6
0
05 Mar 2024
Position: Towards Implicit Prompt For Text-To-Image Models
Position: Towards Implicit Prompt For Text-To-Image Models
Yue Yang
Yuqi Lin
Hong Liu
Wenqi Shao
Runjian Chen
Hailong Shang
Yu Wang
Yu Qiao
Kaipeng Zhang
Ping Luo
EGVMVLM
112
2
0
04 Mar 2024
MCA: Moment Channel Attention Networks
MCA: Moment Channel Attention Networks
Yangbo Jiang
Zhiwei Jiang
Le Han
Zenan Huang
Nenggan Zheng
37
3
0
04 Mar 2024
HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
Supreeth Narasimhaswamy
Uttaran Bhattacharya
Xiang Chen
Ishita Dasgupta
Saayan Mitra
Minh Hoai
DiffM
71
25
0
04 Mar 2024
Text-guided Explorable Image Super-resolution
Text-guided Explorable Image Super-resolution
Kanchana Vaishnavi Gandikota
Paramanand Chandramouli
113
8
0
02 Mar 2024
CustomSketching: Sketch Concept Extraction for Sketch-based Image
  Synthesis and Editing
CustomSketching: Sketch Concept Extraction for Sketch-based Image Synthesis and Editing
Chufeng Xiao
Hongbo Fu
DiffM
83
3
0
27 Feb 2024
Social Reward: Evaluating and Enhancing Generative AI through
  Million-User Feedback from an Online Creative Community
Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community
Arman Isajanyan
Artur Shatveryan
David Kocharyan
Zhangyang Wang
Humphrey Shi
EGVM
131
6
0
15 Feb 2024
AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal
  Conditioning
AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning
W. Para
Abdelrahman Eldesokey
Zhenyu Li
Pradyumna Reddy
Jiankang Deng
Peter Wonka
DiffM
74
0
0
08 Feb 2024
MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis
MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis
Dewei Zhou
You Li
Fan Ma
Zongxin Yang
Yi Yang
DiffM
96
61
0
08 Feb 2024
InstanceDiffusion: Instance-level Control for Image Generation
InstanceDiffusion: Instance-level Control for Image Generation
Xudong Wang
Trevor Darrell
Sai Saketh Rambhatla
Rohit Girdhar
Ishan Misra
VLMDiffM
61
101
0
05 Feb 2024
Spatial-Aware Latent Initialization for Controllable Image Generation
Spatial-Aware Latent Initialization for Controllable Image Generation
Wenqiang Sun
Tengtao Li
Zehong Lin
Jun Zhang
94
11
0
29 Jan 2024
FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion
  Models
FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models
Feihong He
Gang Li
Mengyuan Zhang
Leilei Yan
Hui Xiong
Fanzhang Li
Li Shen
DiffM
88
15
0
28 Jan 2024
Explicitly Representing Syntax Improves Sentence-to-layout Prediction of
  Unexpected Situations
Explicitly Representing Syntax Improves Sentence-to-layout Prediction of Unexpected Situations
Wolf Nuyts
Ruben Cartuyvels
Marie-Francine Moens
119
1
0
25 Jan 2024
Text-to-Image Cross-Modal Generation: A Systematic Review
Text-to-Image Cross-Modal Generation: A Systematic Review
Maciej Żelaszczyk
Jacek Mańdziuk
97
4
0
21 Jan 2024
Inflation with Diffusion: Efficient Temporal Adaptation for
  Text-to-Video Super-Resolution
Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution
Xin Yuan
Jinoo Baek
Keyang Xu
Omer Tov
Hongliang Fei
VGen
65
4
0
18 Jan 2024
Instilling Multi-round Thinking to Text-guided Image Generation
Instilling Multi-round Thinking to Text-guided Image Generation
Lidong Zeng
Zhedong Zheng
Yinwei Wei
Tat-Seng Chua
108
5
0
16 Jan 2024
Towards Efficient Diffusion-Based Image Editing with Instant Attention
  Masks
Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Siyu Zou
Jiji Tang
Yiyi Zhou
Jing He
Chaoyi Zhao
Rongsheng Zhang
Zhipeng Hu
Xiaoshuai Sun
111
11
0
15 Jan 2024
Uni3D-LLM: Unifying Point Cloud Perception, Generation and Editing with
  Large Language Models
Uni3D-LLM: Unifying Point Cloud Perception, Generation and Editing with Large Language Models
Dingning Liu
Xiaoshui Huang
Yuenan Hou
Zhihui Wang
Zhen-fei Yin
Yongshun Gong
Peng Gao
Wanli Ouyang
49
11
0
09 Jan 2024
TIER: Text-Image Encoder-based Regression for AIGC Image Quality
  Assessment
TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment
Jiquan Yuan
Xinyan Cao
Jinming Che
Qinyuan Wang
Sen Liang
Wei Ren
Jinlong Lin
Xixin Cao
EGVM
49
1
0
08 Jan 2024
Deep Learning-based Image and Video Inpainting: A Survey
Deep Learning-based Image and Video Inpainting: A Survey
Weize Quan
Jiaxi Chen
Yanli Liu
Dong-Ming Yan
Peter Wonka
3DV
78
38
0
07 Jan 2024
Improving Diffusion-Based Image Synthesis with Context Prediction
Improving Diffusion-Based Image Synthesis with Context Prediction
Ling Yang
Jingwei Liu
Shenda Hong
Zhilong Zhang
Zhilin Huang
Zheming Cai
Wentao Zhang
Tengjiao Wang
DiffM
89
36
0
04 Jan 2024
Semantic Guidance Tuning for Text-To-Image Diffusion Models
Hyun Kang
Dohae Lee
Myungjin Shin
In-Kwon Lee
51
1
0
26 Dec 2023
Semantic Draw Engineering for Text-to-Image Creation
Semantic Draw Engineering for Text-to-Image Creation
Yang Li
Huaqiang Jiang
Yangkai Wu
51
1
0
23 Dec 2023
Tuning-Free Inversion-Enhanced Control for Consistent Image Editing
Tuning-Free Inversion-Enhanced Control for Consistent Image Editing
Xiaoyue Duan
Shuhao Cui
Guoliang Kang
Baochang Zhang
Zhengcong Fei
Mingyuan Fan
Junshi Huang
DiffM
54
10
0
22 Dec 2023
Emage: Non-Autoregressive Text-to-Image Generation
Emage: Non-Autoregressive Text-to-Image Generation
Zhangyin Feng
Runyi Hu
Liangxin Liu
Fan Zhang
Duyu Tang
Yong Dai
Xiaocheng Feng
Jiwei Li
Bing Qin
Shuming Shi
DiffMVLM
78
0
0
22 Dec 2023
Controllable 3D Face Generation with Conditional Style Code Diffusion
Controllable 3D Face Generation with Conditional Style Code Diffusion
Xi Shen
Jianxin Ma
Chang Zhou
Zongxin Yang
DiffM
103
11
0
21 Dec 2023
Previous
123456...151617
Next