Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.10485
Cited By
AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks
28 November 2017
Tao Xu
Pengchuan Zhang
Qiuyuan Huang
Han Zhang
Zhe Gan
Xiaolei Huang
Xiaodong He
GAN
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks"
50 / 822 papers shown
Title
Adaptive Mixed-Scale Feature Fusion Network for Blind AI-Generated Image Quality Assessment
Tianwei Zhou
Songbai Tan
Wei Zhou
Yu Luo
Yuan-Gen Wang
Guanghui Yue
EGVM
101
11
0
23 Apr 2024
Enhancing Prompt Following with Visual Control Through Training-Free Mask-Guided Diffusion
Hongyu Chen
Yi-Meng Gao
Min Zhou
Peng Wang
Xubin Li
Tiezheng Ge
Bo Zheng
DiffM
68
5
0
23 Apr 2024
MultiBooth: Towards Generating All Your Concepts in an Image from Text
Chenyang Zhu
Kai Li
Yue Ma
Chunming He
Li Xiu
DiffM
241
29
0
22 Apr 2024
Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images
Ali Naseh
Katherine Thai
Mohit Iyyer
Amir Houmansadr
90
7
0
21 Apr 2024
ANCHOR: LLM-driven News Subject Conditioning for Text-to-Image Synthesis
Aashish Anantha Ramakrishnan
Sharon X. Huang
Dongwon Lee
90
0
0
15 Apr 2024
Semantic Approach to Quantifying the Consistency of Diffusion Model Image Generation
Brinnae Bent
67
3
0
12 Apr 2024
StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion
Ming Tao
Bing-Kun Bao
Hao Tang
Yaowei Wang
Changsheng Xu
DiffM
91
8
0
09 Apr 2024
Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance
Dazhong Shen
Guanglu Song
Zeyue Xue
Fu-Yun Wang
Yu Liu
DiffM
91
18
0
08 Apr 2024
InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization
Xiefan Guo
Jinlin Liu
Miaomiao Cui
Jiankai Li
Hongyu Yang
Di Huang
102
38
0
06 Apr 2024
Real, fake and synthetic faces -- does the coin have three sides?
Shahzeb Naeem
Ramzi Al-Sharawi
Muhammad Riyyan Khan
Usman Tariq
Abhinav Dhall
H. Al-Nashash
90
1
0
02 Apr 2024
CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusion model
S. Han
Joohee Kim
DiffM
CLIP
70
2
0
22 Mar 2024
Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing
Alberto Baldrati
Davide Morelli
Marcella Cornia
Marco Bertini
Rita Cucchiara
DiffM
81
8
0
21 Mar 2024
Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
Fu-Yun Wang
Xiaoshi Wu
Zhaoyang Huang
Xiaoyu Shi
Dazhong Shen
Guanglu Song
Yu Liu
Hongsheng Li
DiffM
74
14
0
20 Mar 2024
TiBiX: Leveraging Temporal Information for Bidirectional X-ray and Report Generation
Santosh Sanjeev
F. Maani
Arsen Abzhanov
Vijay Ram Papineni
Ibrahim Almakky
Bartlomiej W. Papie.z
Mohammad Yaqub
MedIm
85
0
0
20 Mar 2024
Can AI Outperform Human Experts in Creating Social Media Creatives?
Eunkyung Park
Raymond K. Wong
Junbum Kwon
62
0
0
19 Mar 2024
LogicalDefender: Discovering, Extracting, and Utilizing Common-Sense Knowledge
Yuhe Liu
Mengxue Kang
Zengchang Qin
Xiangxiang Chu
NAI
VLM
56
0
0
18 Mar 2024
Desigen: A Pipeline for Controllable Design Template Generation
Haohan Weng
Danqing Huang
Yu Qiao
Zheng Hu
Chin-Yew Lin
Tong Zhang
Chong Chen
DiffM
67
17
0
14 Mar 2024
Masked Generative Story Transformer with Character Guidance and Caption Augmentation
Christos Papadimitriou
Giorgos Filandrianos
Maria Lymperaiou
Giorgos Stamou
DiffM
141
2
0
13 Mar 2024
CoroNetGAN: Controlled Pruning of GANs via Hypernetworks
Aman Kumar
Khushboo Anand
Shubham Mandloi
Ashutosh Mishra
Avinash Thakur
Neeraj Kasera
Prathosh A P
77
4
0
13 Mar 2024
FaceChain-SuDe: Building Derived Class to Inherit Category Attributes for One-shot Subject-Driven Generation
Pengchong Qiao
Lei Shang
Chang-Shu Liu
Baigui Sun
Xiang Ji
Jie Chen
CVBM
59
3
0
11 Mar 2024
3D-aware Image Generation and Editing with Multi-modal Conditions
Bo Li
Yike Li
Zhien He
Bin Liu
Yu-Kun Lai
67
2
0
11 Mar 2024
Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation
Junyan Wang
Zhenhong Sun
Zhiyu Tan
Xuanbai Chen
Weihua Chen
Hao Li
Cheng Zhang
Yang Song
96
12
0
08 Mar 2024
Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation
Joseph Cho
Fachrina Dewi Puspitasari
Sheng Zheng
Jingyao Zheng
Lik-Hang Lee
Tae-Ho Kim
Choong Seon Hong
Chaoning Zhang
EGVM
VGen
104
43
0
08 Mar 2024
Discriminative Probing and Tuning for Text-to-Image Generation
Leigang Qu
Wenjie Wang
Chak Tou Leong
Hanwang Zhang
Liqiang Nie
Tat-Seng Chua
87
8
0
07 Mar 2024
Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity
Hagyeong Lee
Minkyu Kim
Jun-Hyuk Kim
Seungeon Kim
Dokwan Oh
Jaeho Lee
DiffM
90
6
0
05 Mar 2024
Position: Towards Implicit Prompt For Text-To-Image Models
Yue Yang
Yuqi Lin
Hong Liu
Wenqi Shao
Runjian Chen
Hailong Shang
Yu Wang
Yu Qiao
Kaipeng Zhang
Ping Luo
EGVM
VLM
112
2
0
04 Mar 2024
MCA: Moment Channel Attention Networks
Yangbo Jiang
Zhiwei Jiang
Le Han
Zenan Huang
Nenggan Zheng
37
3
0
04 Mar 2024
HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
Supreeth Narasimhaswamy
Uttaran Bhattacharya
Xiang Chen
Ishita Dasgupta
Saayan Mitra
Minh Hoai
DiffM
71
25
0
04 Mar 2024
Text-guided Explorable Image Super-resolution
Kanchana Vaishnavi Gandikota
Paramanand Chandramouli
113
8
0
02 Mar 2024
CustomSketching: Sketch Concept Extraction for Sketch-based Image Synthesis and Editing
Chufeng Xiao
Hongbo Fu
DiffM
83
3
0
27 Feb 2024
Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community
Arman Isajanyan
Artur Shatveryan
David Kocharyan
Zhangyang Wang
Humphrey Shi
EGVM
131
6
0
15 Feb 2024
AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning
W. Para
Abdelrahman Eldesokey
Zhenyu Li
Pradyumna Reddy
Jiankang Deng
Peter Wonka
DiffM
74
0
0
08 Feb 2024
MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis
Dewei Zhou
You Li
Fan Ma
Zongxin Yang
Yi Yang
DiffM
96
61
0
08 Feb 2024
InstanceDiffusion: Instance-level Control for Image Generation
Xudong Wang
Trevor Darrell
Sai Saketh Rambhatla
Rohit Girdhar
Ishan Misra
VLM
DiffM
61
101
0
05 Feb 2024
Spatial-Aware Latent Initialization for Controllable Image Generation
Wenqiang Sun
Tengtao Li
Zehong Lin
Jun Zhang
94
11
0
29 Jan 2024
FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models
Feihong He
Gang Li
Mengyuan Zhang
Leilei Yan
Hui Xiong
Fanzhang Li
Li Shen
DiffM
88
15
0
28 Jan 2024
Explicitly Representing Syntax Improves Sentence-to-layout Prediction of Unexpected Situations
Wolf Nuyts
Ruben Cartuyvels
Marie-Francine Moens
119
1
0
25 Jan 2024
Text-to-Image Cross-Modal Generation: A Systematic Review
Maciej Żelaszczyk
Jacek Mańdziuk
97
4
0
21 Jan 2024
Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution
Xin Yuan
Jinoo Baek
Keyang Xu
Omer Tov
Hongliang Fei
VGen
65
4
0
18 Jan 2024
Instilling Multi-round Thinking to Text-guided Image Generation
Lidong Zeng
Zhedong Zheng
Yinwei Wei
Tat-Seng Chua
108
5
0
16 Jan 2024
Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Siyu Zou
Jiji Tang
Yiyi Zhou
Jing He
Chaoyi Zhao
Rongsheng Zhang
Zhipeng Hu
Xiaoshuai Sun
111
11
0
15 Jan 2024
Uni3D-LLM: Unifying Point Cloud Perception, Generation and Editing with Large Language Models
Dingning Liu
Xiaoshui Huang
Yuenan Hou
Zhihui Wang
Zhen-fei Yin
Yongshun Gong
Peng Gao
Wanli Ouyang
49
11
0
09 Jan 2024
TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment
Jiquan Yuan
Xinyan Cao
Jinming Che
Qinyuan Wang
Sen Liang
Wei Ren
Jinlong Lin
Xixin Cao
EGVM
49
1
0
08 Jan 2024
Deep Learning-based Image and Video Inpainting: A Survey
Weize Quan
Jiaxi Chen
Yanli Liu
Dong-Ming Yan
Peter Wonka
3DV
78
38
0
07 Jan 2024
Improving Diffusion-Based Image Synthesis with Context Prediction
Ling Yang
Jingwei Liu
Shenda Hong
Zhilong Zhang
Zhilin Huang
Zheming Cai
Wentao Zhang
Tengjiao Wang
DiffM
89
36
0
04 Jan 2024
Semantic Guidance Tuning for Text-To-Image Diffusion Models
Hyun Kang
Dohae Lee
Myungjin Shin
In-Kwon Lee
51
1
0
26 Dec 2023
Semantic Draw Engineering for Text-to-Image Creation
Yang Li
Huaqiang Jiang
Yangkai Wu
51
1
0
23 Dec 2023
Tuning-Free Inversion-Enhanced Control for Consistent Image Editing
Xiaoyue Duan
Shuhao Cui
Guoliang Kang
Baochang Zhang
Zhengcong Fei
Mingyuan Fan
Junshi Huang
DiffM
54
10
0
22 Dec 2023
Emage: Non-Autoregressive Text-to-Image Generation
Zhangyin Feng
Runyi Hu
Liangxin Liu
Fan Zhang
Duyu Tang
Yong Dai
Xiaocheng Feng
Jiwei Li
Bing Qin
Shuming Shi
DiffM
VLM
78
0
0
22 Dec 2023
Controllable 3D Face Generation with Conditional Style Code Diffusion
Xi Shen
Jianxin Ma
Chang Zhou
Zongxin Yang
DiffM
103
11
0
21 Dec 2023
Previous
1
2
3
4
5
6
...
15
16
17
Next