ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1612.03242
  4. Cited By
StackGAN: Text to Photo-realistic Image Synthesis with Stacked
  Generative Adversarial Networks

StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks

10 December 2016
Han Zhang
Tao Xu
Hongsheng Li
Shaoting Zhang
Xiaogang Wang
Xiaolei Huang
Dimitris N. Metaxas
    GAN
ArXivPDFHTML

Papers citing "StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks"

50 / 394 papers shown
Title
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation
Can Qin
Ning Yu
Chen Xing
Shu Zhen Zhang
Zeyuan Chen
Stefano Ermon
Yun Fu
Caiming Xiong
Ran Xu
DiffM
40
20
0
17 Mar 2023
Unsupervised Traffic Scene Generation with Synthetic 3D Scene Graphs
Unsupervised Traffic Scene Generation with Synthetic 3D Scene Graphs
Artem Savkin
Rachid Ellouze
Nassir Navab
F. Tombari
21
10
0
15 Mar 2023
Graph Transformer GANs for Graph-Constrained House Generation
Graph Transformer GANs for Graph-Constrained House Generation
H. Tang
Zhenyu Zhang
Humphrey Shi
Bo-wen Li
Lin Shao
N. Sebe
Radu Timofte
Luc Van Gool
46
19
0
14 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of
  Generative AI from GAN to ChatGPT
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
29
507
0
07 Mar 2023
Testing the Channels of Convolutional Neural Networks
Testing the Channels of Convolutional Neural Networks
Kang Choi
Donghyun Son
Younghoon Kim
Jiwon Seo
25
1
0
06 Mar 2023
TextIR: A Simple Framework for Text-based Editable Image Restoration
TextIR: A Simple Framework for Text-based Editable Image Restoration
Yun-Hao Bai
Cairong Wang
Shuzhao Xie
Chao Dong
Chun Yuan
Zhi Wang
DiffM
30
15
0
28 Feb 2023
Fine-grained Cross-modal Fusion based Refinement for Text-to-Image
  Synthesis
Fine-grained Cross-modal Fusion based Refinement for Text-to-Image Synthesis
Haoran Sun
Yang Wang
Haipeng Liu
Biao Qian
26
8
0
17 Feb 2023
Zero-shot Generation of Coherent Storybook from Plain Text Story using
  Diffusion Models
Zero-shot Generation of Coherent Storybook from Plain Text Story using Diffusion Models
Hyeonho Jeong
Gihyun Kwon
Jong Chul Ye
40
20
0
08 Feb 2023
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
Ming Tao
Bingkun Bao
Hao Tang
Changsheng Xu
DiffM
VLM
68
101
0
30 Jan 2023
Open Problems in Applied Deep Learning
Open Problems in Applied Deep Learning
M. Raissi
AI4CE
42
2
0
26 Jan 2023
ANNA: Abstractive Text-to-Image Synthesis with Filtered News Captions
ANNA: Abstractive Text-to-Image Synthesis with Filtered News Captions
Aashish Anantha Ramakrishnan
Sharon X. Huang
Dongwon Lee
24
5
0
05 Jan 2023
On the causality-preservation capabilities of generative modelling
On the causality-preservation capabilities of generative modelling
Yves-Cédric Bauwelinckx
Jan Dhaene
Tim Verdonck
Milan van den Heuvel
CML
AI4CE
38
0
0
03 Jan 2023
Benchmarking Spatial Relationships in Text-to-Image Generation
Benchmarking Spatial Relationships in Text-to-Image Generation
Tejas Gokhale
Hamid Palangi
Besmira Nushi
Vibhav Vineet
Eric Horvitz
Ece Kamar
Chitta Baral
Yezhou Yang
EGVM
45
66
0
20 Dec 2022
Multi-Concept Customization of Text-to-Image Diffusion
Multi-Concept Customization of Text-to-Image Diffusion
Nupur Kumari
Bin Zhang
Richard Y. Zhang
Eli Shechtman
Jun-Yan Zhu
61
823
0
08 Dec 2022
3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation
3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation
Zutao Jiang
Guangsong Lu
Xiaodan Liang
Jihua Zhu
Wei Zhang
Xiaojun Chang
Hang Xu
DiffM
21
8
0
02 Dec 2022
Fair Generative Models via Transfer Learning
Fair Generative Models via Transfer Learning
Christopher T. H. Teo
Milad Abdollahzadeh
Ngai-man Cheung
21
24
0
02 Dec 2022
ReCo: Region-Controlled Text-to-Image Generation
ReCo: Region-Controlled Text-to-Image Generation
Zhengyuan Yang
Jianfeng Wang
Zhe Gan
Linjie Li
Kevin Qinghong Lin
...
Nan Duan
Zicheng Liu
Ce Liu
Michael Zeng
Lijuan Wang
DiffM
56
140
0
23 Nov 2022
Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
Xichen Pan
Pengda Qin
Yuhong Li
Hui Xue
Wenhu Chen
DiffM
21
62
0
20 Nov 2022
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image
  Generation
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation
Zhihong Pan
Xiaoxia Zhou
Hao Tian
DiffM
20
11
0
14 Nov 2022
InstantGroup: Instant Template Generation for Scalable Group of Brain
  MRI Registration
InstantGroup: Instant Template Generation for Scalable Group of Brain MRI Registration
Ziyi He
Albert C. S. Chung
21
1
0
10 Nov 2022
Disentangling Content and Motion for Text-Based Neural Video
  Manipulation
Disentangling Content and Motion for Text-Based Neural Video Manipulation
Levent Karacan
Tolga Kerimouglu
.Ismail .Inan
Tolga Birdal
Erkut Erdem
Aykut Erdem
29
1
0
05 Nov 2022
Cover Reproducible Steganography via Deep Generative Models
Cover Reproducible Steganography via Deep Generative Models
Kejiang Chen
Hang Zhou
Yaofei Wang
Meng Li
Weiming Zhang
Neng H. Yu
DiffM
25
9
0
26 Oct 2022
Character-Centric Story Visualization via Visual Planning and Token
  Alignment
Character-Centric Story Visualization via Visual Planning and Token Alignment
Hong Chen
Rujun Han
Te-Lin Wu
Hideki Nakayama
Nanyun Peng
DiffM
VGen
24
31
0
16 Oct 2022
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation
  with Semantic Modulations
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
Yi-Chun Zhu
Hongyu Liu
Yibing Song
Ziyang Yuan
Xintong Han
Chun Yuan
Qifeng Chen
Jue Wang
VLM
DiffM
34
31
0
14 Oct 2022
DE-FAKE: Detection and Attribution of Fake Images Generated by
  Text-to-Image Generation Models
DE-FAKE: Detection and Attribution of Fake Images Generated by Text-to-Image Generation Models
Zeyang Sha
Zheng Li
Ning Yu
Yang Zhang
DiffM
28
115
0
13 Oct 2022
Music-to-Text Synaesthesia: Generating Descriptive Text from Music
  Recordings
Music-to-Text Synaesthesia: Generating Descriptive Text from Music Recordings
Zhihuan Kuang
Shi Zong
Jianbing Zhang
Jiajun Chen
Hongfu Liu
27
4
0
02 Oct 2022
T2CI-GAN: Text to Compressed Image generation using Generative
  Adversarial Network
T2CI-GAN: Text to Compressed Image generation using Generative Adversarial Network
B. Rajesh
Nandakishore Dusa
M. Javed
S. Dubey
P. Nagabhushan
GAN
19
7
0
01 Oct 2022
Adma-GAN: Attribute-Driven Memory Augmented GANs for Text-to-Image
  Generation
Adma-GAN: Attribute-Driven Memory Augmented GANs for Text-to-Image Generation
Xintian Wu
Hanbin Zhao
Liangli Zheng
Shouhong Ding
Xi Li
29
13
0
28 Sep 2022
Automated Urban Planning aware Spatial Hierarchies and Human Instructions
Dongjie Wang
Kunpeng Liu
Yanyong Huang
Leilei Sun
Bowen Du
Yanjie Fu
AI4CE
33
3
0
26 Sep 2022
Cross Modal Compression: Towards Human-comprehensible Semantic
  Compression
Cross Modal Compression: Towards Human-comprehensible Semantic Compression
Jiguo Li
Chuanmin Jia
Xinfeng Zhang
Siwei Ma
Wen Gao
19
18
0
06 Sep 2022
A Realism Metric for Generated LiDAR Point Clouds
A Realism Metric for Generated LiDAR Point Clouds
Larissa T. Triess
Christoph B. Rist
David Peter
J. Marius Zöllner
3DPC
32
8
0
31 Aug 2022
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
Wanshu Fan
Yen-Chun Chen
Dongdong Chen
Yu Cheng
Lu Yuan
Yu-Chiang Frank Wang
DiffM
29
90
0
29 Aug 2022
Vision-Language Matching for Text-to-Image Synthesis via Generative
  Adversarial Networks
Vision-Language Matching for Text-to-Image Synthesis via Generative Adversarial Networks
Qingrong Cheng
Keyu Wen
X. Gu
VLM
EGVM
32
16
0
20 Aug 2022
Text-to-Image Generation via Implicit Visual Guidance and Hypernetwork
Text-to-Image Generation via Implicit Visual Guidance and Hypernetwork
Xin Yuan
Zhe-nan Lin
Jason Kuen
Jianming Zhang
John Collomosse
32
5
0
17 Aug 2022
Word-Level Fine-Grained Story Visualization
Word-Level Fine-Grained Story Visualization
Bowen Li
Thomas Lukasiewicz
DiffM
3DH
31
24
0
03 Aug 2022
Explicit Use of Fourier Spectrum in Generative Adversarial Networks
Explicit Use of Fourier Spectrum in Generative Adversarial Networks
Soroush Sheikh Gargar
GAN
OOD
29
0
0
02 Aug 2022
Text to Image Synthesis using Stacked Conditional Variational
  Autoencoders and Conditional Generative Adversarial Networks
Text to Image Synthesis using Stacked Conditional Variational Autoencoders and Conditional Generative Adversarial Networks
Haileleol Tibebu
Aadin Malik
V. D. Silva
GAN
20
7
0
06 Jul 2022
Transforming Image Generation from Scene Graphs
Transforming Image Generation from Scene Graphs
Renato Sortino
S. Palazzo
C. Spampinato
ViT
29
2
0
01 Jul 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
107
1,062
0
22 Jun 2022
Blended Latent Diffusion
Blended Latent Diffusion
Omri Avrahami
Ohad Fried
Dani Lischinski
DiffM
59
373
0
06 Jun 2022
Compositional Visual Generation with Composable Diffusion Models
Compositional Visual Generation with Composable Diffusion Models
Nan Liu
Shuang Li
Yilun Du
Antonio Torralba
J. Tenenbaum
DiffM
CoGe
37
496
0
03 Jun 2022
DE-Net: Dynamic Text-guided Image Editing Adversarial Networks
DE-Net: Dynamic Text-guided Image Editing Adversarial Networks
Ming Tao
Bingkun Bao
Hao Tang
Fei Wu
Longhui Wei
Qi Tian
DiffM
24
15
0
02 Jun 2022
VALHALLA: Visual Hallucination for Machine Translation
VALHALLA: Visual Hallucination for Machine Translation
Yi Li
Rameswar Panda
Yoon Kim
Chun-Fu Chen
Rogerio Feris
David D. Cox
Nuno Vasconcelos
MLLM
40
38
0
31 May 2022
Improved Vector Quantized Diffusion Models
Improved Vector Quantized Diffusion Models
Zhicong Tang
Shuyang Gu
Jianmin Bao
Dong Chen
Fang Wen
DiffM
181
63
0
31 May 2022
M6-Fashion: High-Fidelity Multi-modal Image Generation and Editing
M6-Fashion: High-Fidelity Multi-modal Image Generation and Editing
Zhikang Li
Huiling Zhou
Shuai Bai
Peike Li
Chang Zhou
Hongxia Yang
34
4
0
24 May 2022
MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D
  Scenes
MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D Scenes
Anton Ratnarajah
Zhenyu Tang
R. Aralikatti
Tianyi Zhou
AI4CE
20
36
0
18 May 2022
A Comprehensive Survey of Few-shot Learning: Evolution, Applications,
  Challenges, and Opportunities
A Comprehensive Survey of Few-shot Learning: Evolution, Applications, Challenges, and Opportunities
Yisheng Song
Ting-Yuan Wang
S. Mondal
J. P. Sahoo
SLR
50
344
0
13 May 2022
Transformer-based Cross-Modal Recipe Embeddings with Large Batch
  Training
Transformer-based Cross-Modal Recipe Embeddings with Large Batch Training
Jing Yang
Junwen Chen
Keiji Yanai
ViT
21
5
0
10 May 2022
BlobGAN: Spatially Disentangled Scene Representations
BlobGAN: Spatially Disentangled Scene Representations
Dave Epstein
Taesung Park
Richard Y. Zhang
Eli Shechtman
Alexei A. Efros
GAN
SSL
OCL
32
42
0
05 May 2022
Text to artistic image generation
Text to artistic image generation
Qinghe Tian
Jean-Claude Franchitti
GAN
17
1
0
05 May 2022
Previous
12345678
Next