Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1710.10916
Cited By
StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks
19 October 2017
Han Zhang
Tao Xu
Hongsheng Li
Shaoting Zhang
Xiaogang Wang
Xiaolei Huang
Dimitris N. Metaxas
GAN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks"
50 / 138 papers shown
Title
RealCustom++: Representing Images as Real-Word for Real-Time Customization
Zhendong Mao
Mengqi Huang
Fei Ding
Mingcong Liu
Qian He
Xiaojun Chang
DiffM
78
6
0
03 Jan 2025
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Jing He
Haodong Li
Wei Yin
Yixun Liang
Leheng Li
Kaiqiang Zhou
Hongbo Zhang
Bingbing Liu
Ying-Cong Chen
DiffM
VLM
46
40
0
26 Sep 2024
HyperFusion: A Hypernetwork Approach to Multimodal Integration of Tabular and Medical Imaging Data for Predictive Modeling
Daniel Duenias
Brennan Nichyporuk
Tal Arbel
Tammy Riklin-Raviv
39
3
0
20 Mar 2024
Tuning-Free Inversion-Enhanced Control for Consistent Image Editing
Xiaoyue Duan
Shuhao Cui
Guoliang Kang
Baochang Zhang
Zhengcong Fei
Mingyuan Fan
Junshi Huang
DiffM
31
8
0
22 Dec 2023
Improving Compositional Text-to-image Generation with Large Vision-Language Models
Song Wen
Guian Fang
Renrui Zhang
Peng Gao
Hao Dong
Dimitris N. Metaxas
25
17
0
10 Oct 2023
KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing
Jiarui Yao
Yifan Liu
Simon S. Du
Shifeng Chen
DiffM
24
24
0
28 Sep 2023
Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images
Cuican Yu
Guansong Lu
Yihan Zeng
Jian Sun
Xiaodan Liang
Huibin Li
Zongben Xu
Songcen Xu
Wei Zhang
Hang Xu
44
14
0
31 Aug 2023
Counting Guidance for High Fidelity Text-to-Image Synthesis
Wonjune Kang
Kevin Galim
H. Koo
Nam Ik Cho
DiffM
32
8
0
30 Jun 2023
PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation
Jialu Li
Joey Tianyi Zhou
DiffM
31
49
0
30 May 2023
ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis
Hongchen Tan
Baocai Yin
Kun Wei
Xiuping Liu
Xin Li
15
16
0
13 Apr 2023
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation
Mayu Otani
Riku Togashi
Yu Sawai
Ryosuke Ishigami
Yuta Nakashima
Esa Rahtu
J. Heikkilä
Shiníchi Satoh
38
62
0
04 Apr 2023
Spatial Latent Representations in Generative Adversarial Networks for Image Generation
Maciej Sypetkowski
GAN
26
1
0
25 Mar 2023
Factor Decomposed Generative Adversarial Networks for Text-to-Image Synthesis
Jiguo Li
Xiaobin Liu
Lirong Zheng
DRL
19
1
0
24 Mar 2023
TextIR: A Simple Framework for Text-based Editable Image Restoration
Yun-Hao Bai
Cairong Wang
Shuzhao Xie
Chao Dong
Chun Yuan
Zhi Wang
DiffM
30
15
0
28 Feb 2023
Fine-grained Cross-modal Fusion based Refinement for Text-to-Image Synthesis
Haoran Sun
Yang Wang
Haipeng Liu
Biao Qian
26
8
0
17 Feb 2023
Zero-shot Generation of Coherent Storybook from Plain Text Story using Diffusion Models
Hyeonho Jeong
Gihyun Kwon
Jong Chul Ye
40
20
0
08 Feb 2023
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
Ming Tao
Bingkun Bao
Hao Tang
Changsheng Xu
DiffM
VLM
65
101
0
30 Jan 2023
TeTIm-Eval: a novel curated evaluation data set for comparing text-to-image models
Federico A. Galatolo
M. G. Cimino
E. Cogotti
19
4
0
15 Dec 2022
3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation
Zutao Jiang
Guangsong Lu
Xiaodan Liang
Jihua Zhu
Wei Zhang
Xiaojun Chang
Hang Xu
DiffM
21
8
0
02 Dec 2022
ReCo: Region-Controlled Text-to-Image Generation
Zhengyuan Yang
Jianfeng Wang
Zhe Gan
Linjie Li
Kevin Qinghong Lin
...
Nan Duan
Zicheng Liu
Ce Liu
Michael Zeng
Lijuan Wang
DiffM
56
140
0
23 Nov 2022
Learning to Model Multimodal Semantic Alignment for Story Visualization
Bowen Li
Thomas Lukasiewicz
DiffM
23
2
0
14 Nov 2022
Cover Reproducible Steganography via Deep Generative Models
Kejiang Chen
Hang Zhou
Yaofei Wang
Meng Li
Weiming Zhang
Neng H. Yu
DiffM
23
9
0
26 Oct 2022
Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
Rui Li
Weihua Li
Yi Yang
Hanyu Wei
Jianhua Jiang
Quan-wei Bai
DiffM
27
11
0
18 Oct 2022
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
Yi-Chun Zhu
Hongyu Liu
Yibing Song
Ziyang Yuan
Xintong Han
Chun Yuan
Qifeng Chen
Jue Wang
VLM
DiffM
34
30
0
14 Oct 2022
Adma-GAN: Attribute-Driven Memory Augmented GANs for Text-to-Image Generation
Xintian Wu
Hanbin Zhao
Liangli Zheng
Shouhong Ding
Xi Li
29
13
0
28 Sep 2022
Multi-Task Adversarial Training Algorithm for Multi-Speaker Neural Text-to-Speech
Yusuke Nakai
Yuki Saito
K. Udagawa
Hiroshi Saruwatari
AAML
17
1
0
26 Sep 2022
Cross Modal Compression: Towards Human-comprehensible Semantic Compression
Jiguo Li
Chuanmin Jia
Xinfeng Zhang
Siwei Ma
Wen Gao
19
18
0
06 Sep 2022
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Wanshu Fan
Yen-Chun Chen
Dongdong Chen
Yu Cheng
Lu Yuan
Yu-Chiang Frank Wang
DiffM
26
90
0
29 Aug 2022
Vision-Language Matching for Text-to-Image Synthesis via Generative Adversarial Networks
Qingrong Cheng
Keyu Wen
X. Gu
VLM
EGVM
29
16
0
20 Aug 2022
Text-to-Image Generation via Implicit Visual Guidance and Hypernetwork
Xin Yuan
Zhe-nan Lin
Jason Kuen
Jianming Zhang
John Collomosse
29
5
0
17 Aug 2022
Word-Level Fine-Grained Story Visualization
Bowen Li
Thomas Lukasiewicz
DiffM
3DH
31
24
0
03 Aug 2022
Text to Image Synthesis using Stacked Conditional Variational Autoencoders and Conditional Generative Adversarial Networks
Haileleol Tibebu
Aadin Malik
V. D. Silva
GAN
20
7
0
06 Jul 2022
Transforming Image Generation from Scene Graphs
Renato Sortino
S. Palazzo
C. Spampinato
ViT
29
2
0
01 Jul 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
107
1,062
0
22 Jun 2022
Blended Latent Diffusion
Omri Avrahami
Ohad Fried
Dani Lischinski
DiffM
56
373
0
06 Jun 2022
DE-Net: Dynamic Text-guided Image Editing Adversarial Networks
Ming Tao
Bingkun Bao
Hao Tang
Fei Wu
Longhui Wei
Qi Tian
DiffM
19
15
0
02 Jun 2022
Text-to-Face Generation with StyleGAN2
D. M. A. Ayanthi
Sarasi Munasinghe
CVBM
30
5
0
25 May 2022
GR-GAN: Gradual Refinement Text-to-image Generation
Bo Yang
Fangxiang Feng
Xiaojie Wang
EGVM
8
7
0
23 May 2022
MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D Scenes
Anton Ratnarajah
Zhenyu Tang
R. Aralikatti
Tianyi Zhou
AI4CE
20
36
0
18 May 2022
Optimized latent-code selection for explainable conditional text-to-image GANs
Zhenxing Zhang
Lambert Schomaker
25
4
0
27 Apr 2022
Recurrent Affine Transformation for Text-to-image Synthesis
Senmao Ye
Fei Liu
Mingkui Tan
22
26
0
22 Apr 2022
DR-GAN: Distribution Regularization for Text-to-Image Generation
Hongchen Tan
Xiuping Liu
Baocai Yin
Xin Li
GAN
26
36
0
17 Apr 2022
StyleT2F: Generating Human Faces from Textual Description Using StyleGAN2
Mohamed Shawky Sabae
Mohamed Ahmed Dardir
Remonda Talaat Eskarous
M. Ebbed
CVBM
35
2
0
17 Apr 2022
DT2I: Dense Text-to-Image Generation from Region Descriptions
Stanislav Frolov
Prateek Bansal
Jörn Hees
Andreas Dengel
VLM
21
5
0
05 Apr 2022
CNN Filter DB: An Empirical Investigation of Trained Convolutional Filters
Paul Gavrikov
J. Keuper
AAML
18
31
0
29 Mar 2022
Cross-View Panorama Image Synthesis
Songsong Wu
Hao Tang
Xiaoyuan Jing
Haifeng Zhao
J. Qian
N. Sebe
Yan Yan
GAN
26
24
0
22 Mar 2022
CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP
Zihao Wang
Wei Liu
Qian He
Xin-ru Wu
Zili Yi
CLIP
VLM
193
73
0
01 Mar 2022
Deepfake Network Architecture Attribution
Tianyun Yang
Ziyao Huang
Juan Cao
Lei Li
Xirong Li
60
52
0
28 Feb 2022
One-shot Ultra-high-Resolution Generative Adversarial Network That Synthesizes 16K Images On A Single GPU
Junseok Oh
Donghwee Yoon
Injung Kim
31
1
0
28 Feb 2022
Discovering Multiple and Diverse Directions for Cognitive Image Properties
Umut Kocasari
Alperen Bag
Oğuz Kaan Yüksel
Pinar Yanardag
DiffM
16
0
0
23 Feb 2022
1
2
3
Next