ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1605.05396
  4. Cited By
Generative Adversarial Text to Image Synthesis

Generative Adversarial Text to Image Synthesis

17 May 2016
Scott E. Reed
Zeynep Akata
Xinchen Yan
Lajanugen Logeswaran
Bernt Schiele
Honglak Lee
    GAN
ArXivPDFHTML

Papers citing "Generative Adversarial Text to Image Synthesis"

50 / 504 papers shown
Title
Target-Free Text-guided Image Manipulation
Target-Free Text-guided Image Manipulation
Wanshu Fan
Cheng Yang
Chiao-An Yang
Yu-Chiang Frank Wang
DiffM
26
2
0
26 Nov 2022
SpaText: Spatio-Textual Representation for Controllable Image Generation
SpaText: Spatio-Textual Representation for Controllable Image Generation
Omri Avrahami
Thomas Hayes
Oran Gafni
Sonal Gupta
Yaniv Taigman
Devi Parikh
Dani Lischinski
Ohad Fried
Xiaoyue Yin
DiffM
40
203
0
25 Nov 2022
Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Tanzila Rahman
Hsin-Ying Lee
Jian Ren
Sergey Tulyakov
Shweta Mahajan
Leonid Sigal
DiffM
19
68
0
23 Nov 2022
ReCo: Region-Controlled Text-to-Image Generation
ReCo: Region-Controlled Text-to-Image Generation
Zhengyuan Yang
Jianfeng Wang
Zhe Gan
Linjie Li
Kevin Qinghong Lin
...
Nan Duan
Zicheng Liu
Ce Liu
Michael Zeng
Lijuan Wang
DiffM
56
140
0
23 Nov 2022
Tell Me What Happened: Unifying Text-guided Video Completion via
  Multimodal Masked Video Generation
Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation
Tsu-jui Fu
Licheng Yu
Ning Zhang
Cheng-Yang Fu
Jong-Chyi Su
William Yang Wang
Sean Bell
VGen
56
37
0
23 Nov 2022
PromptTTS: Controllable Text-to-Speech with Text Descriptions
PromptTTS: Controllable Text-to-Speech with Text Descriptions
Zhifang Guo
Yichong Leng
Yihan Wu
Sheng Zhao
Xuejiao Tan
DiffM
19
91
0
22 Nov 2022
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image
  Generation
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation
Zhihong Pan
Xiaoxia Zhou
Hao Tian
DiffM
20
11
0
14 Nov 2022
Learning to Model Multimodal Semantic Alignment for Story Visualization
Learning to Model Multimodal Semantic Alignment for Story Visualization
Bowen Li
Thomas Lukasiewicz
DiffM
31
2
0
14 Nov 2022
InstantGroup: Instant Template Generation for Scalable Group of Brain
  MRI Registration
InstantGroup: Instant Template Generation for Scalable Group of Brain MRI Registration
Ziyi He
Albert C. S. Chung
24
1
0
10 Nov 2022
Disentangling Content and Motion for Text-Based Neural Video
  Manipulation
Disentangling Content and Motion for Text-Based Neural Video Manipulation
Levent Karacan
Tolga Kerimouglu
.Ismail .Inan
Tolga Birdal
Erkut Erdem
Aykut Erdem
29
1
0
05 Nov 2022
Cover Reproducible Steganography via Deep Generative Models
Cover Reproducible Steganography via Deep Generative Models
Kejiang Chen
Hang Zhou
Yaofei Wang
Meng Li
Weiming Zhang
Neng H. Yu
DiffM
31
9
0
26 Oct 2022
Language Does More Than Describe: On The Lack Of Figurative Speech in
  Text-To-Image Models
Language Does More Than Describe: On The Lack Of Figurative Speech in Text-To-Image Models
Ricardo Kleinlein
Cristina Luna Jiménez
Fernando Fernández-Martínez
DiffM
20
3
0
19 Oct 2022
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation
  with Semantic Modulations
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
Yi-Chun Zhu
Hongyu Liu
Yibing Song
Ziyang Yuan
Xintong Han
Chun Yuan
Qifeng Chen
Jue Wang
VLM
DiffM
34
31
0
14 Oct 2022
DE-FAKE: Detection and Attribution of Fake Images Generated by
  Text-to-Image Generation Models
DE-FAKE: Detection and Attribution of Fake Images Generated by Text-to-Image Generation Models
Zeyang Sha
Zheng Li
Ning Yu
Yang Zhang
DiffM
28
116
0
13 Oct 2022
Markup-to-Image Diffusion Models with Scheduled Sampling
Markup-to-Image Diffusion Models with Scheduled Sampling
Yuntian Deng
Noriyuki Kojima
Alexander M. Rush
DiffM
38
4
0
11 Oct 2022
Audio-Visual Face Reenactment
Audio-Visual Face Reenactment
Madhav Agarwal
Rudrabha Mukhopadhyay
Vinay P. Namboodiri
C. V. Jawahar
DiffM
VGen
27
22
0
06 Oct 2022
Fast OT for Latent Domain Adaptation
Fast OT for Latent Domain Adaptation
Siddharth Roheda
Ashkan Panahi
Hamid Krim
OOD
OT
17
1
0
02 Oct 2022
T2CI-GAN: Text to Compressed Image generation using Generative
  Adversarial Network
T2CI-GAN: Text to Compressed Image generation using Generative Adversarial Network
B. Rajesh
Nandakishore Dusa
M. Javed
S. Dubey
P. Nagabhushan
GAN
24
7
0
01 Oct 2022
Vec2Face-v2: Unveil Human Faces from their Blackbox Features via
  Attention-based Network in Face Recognition
Vec2Face-v2: Unveil Human Faces from their Blackbox Features via Attention-based Network in Face Recognition
Thanh-Dat Truong
C. Duong
Ngan Le
Marios Savvides
Khoa Luu
CVBM
72
9
0
11 Sep 2022
Cross Modal Compression: Towards Human-comprehensible Semantic
  Compression
Cross Modal Compression: Towards Human-comprehensible Semantic Compression
Jiguo Li
Chuanmin Jia
Xinfeng Zhang
Siwei Ma
Wen Gao
19
18
0
06 Sep 2022
Semi-Supervised and Unsupervised Deep Visual Learning: A Survey
Semi-Supervised and Unsupervised Deep Visual Learning: A Survey
Yanbei Chen
Massimiliano Mancini
Xiatian Zhu
Zeynep Akata
45
113
0
24 Aug 2022
Large-Scale Traffic Congestion Prediction based on Multimodal Fusion and
  Representation Mapping
Large-Scale Traffic Congestion Prediction based on Multimodal Fusion and Representation Mapping
Bo Zhou
Jiahui Liu
Songyi Cui
Yaping Zhao
26
5
0
23 Aug 2022
Vision-Language Matching for Text-to-Image Synthesis via Generative
  Adversarial Networks
Vision-Language Matching for Text-to-Image Synthesis via Generative Adversarial Networks
Qingrong Cheng
Keyu Wen
X. Gu
VLM
EGVM
32
16
0
20 Aug 2022
A new way of video compression via forward-referencing using deep
  learning
A new way of video compression via forward-referencing using deep learning
S. Rajin
M. Murshed
M. Paul
S. Teng
J. Ma
19
0
0
13 Aug 2022
ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal
  Fashion Design
ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal Fashion Design
Xujie Zhang
Yuyang Sha
Michael C. Kampffmeyer
Zhenyu Xie
Zequn Jie
Chengwen Huang
Jianqing Peng
Xiaodan Liang
14
18
0
11 Aug 2022
Word-Level Fine-Grained Story Visualization
Word-Level Fine-Grained Story Visualization
Bowen Li
Thomas Lukasiewicz
DiffM
3DH
31
24
0
03 Aug 2022
Subject-Specific Lesion Generation and Pseudo-Healthy Synthesis for
  Multiple Sclerosis Brain Images
Subject-Specific Lesion Generation and Pseudo-Healthy Synthesis for Multiple Sclerosis Brain Images
Berke Doga Basaran
Mengyun Qiao
Paul M. Matthews
Wenjia Bai
MedIm
27
9
0
03 Aug 2022
TIPS: Text-Induced Pose Synthesis
TIPS: Text-Induced Pose Synthesis
Prasun Roy
Subhankar Ghosh
Saumik Bhattacharya
Umapada Pal
Michael Blumenstein
DiffM
26
13
0
24 Jul 2022
Contrastive Monotonic Pixel-Level Modulation
Contrastive Monotonic Pixel-Level Modulation
Kun Lu
Rongpeng Li
Honggang Zhang
40
3
0
23 Jul 2022
CP3: Unifying Point Cloud Completion by Pretrain-Prompt-Predict Paradigm
CP3: Unifying Point Cloud Completion by Pretrain-Prompt-Predict Paradigm
Mingye Xu
Yali Wang
Yihao Liu
Tong He
Yu Qiao
3DPC
39
17
0
12 Jul 2022
Text to Image Synthesis using Stacked Conditional Variational
  Autoencoders and Conditional Generative Adversarial Networks
Text to Image Synthesis using Stacked Conditional Variational Autoencoders and Conditional Generative Adversarial Networks
Haileleol Tibebu
Aadin Malik
V. D. Silva
GAN
20
7
0
06 Jul 2022
Spatial Transformation for Image Composition via Correspondence Learning
Spatial Transformation for Image Composition via Correspondence Learning
Bo Zhang
Yue Liu
K. Lu
Li Niu
Liqing Zhang
36
3
0
06 Jul 2022
Transforming Image Generation from Scene Graphs
Transforming Image Generation from Scene Graphs
Renato Sortino
S. Palazzo
C. Spampinato
ViT
29
2
0
01 Jul 2022
A Fast Text-Driven Approach for Generating Artistic Content
A Fast Text-Driven Approach for Generating Artistic Content
M. Lupascu
Ryan Murdock
Ionut Mironica
Yijun Li
24
1
0
22 Jun 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
119
1,066
0
22 Jun 2022
Recurrent Transformer Variational Autoencoders for Multi-Action Motion
  Synthesis
Recurrent Transformer Variational Autoencoders for Multi-Action Motion Synthesis
Rania Briq
Chuhang Zou
L. Pishchulin
Christopher Broaddus
Juergen Gall
24
1
0
14 Jun 2022
Intra-agent speech permits zero-shot task acquisition
Intra-agent speech permits zero-shot task acquisition
Chen Yan
Federico Carnevale
Petko Georgiev
Adam Santoro
Aurelia Guy
Alistair Muldal
Chia-Chun Hung
Josh Abramson
Timothy Lillicrap
Greg Wayne
LM&Ro
36
9
0
07 Jun 2022
Blended Latent Diffusion
Blended Latent Diffusion
Omri Avrahami
Ohad Fried
Dani Lischinski
DiffM
77
373
0
06 Jun 2022
DE-Net: Dynamic Text-guided Image Editing Adversarial Networks
DE-Net: Dynamic Text-guided Image Editing Adversarial Networks
Ming Tao
Bingkun Bao
Hao Tang
Fei Wu
Longhui Wei
Qi Tian
DiffM
24
15
0
02 Jun 2022
Modeling Image Composition for Complex Scene Generation
Modeling Image Composition for Complex Scene Generation
Zuopeng Yang
Daqing Liu
Chaoyue Wang
J. Yang
Dacheng Tao
ViT
36
50
0
02 Jun 2022
VALHALLA: Visual Hallucination for Machine Translation
VALHALLA: Visual Hallucination for Machine Translation
Yi Li
Yikang Shen
Yoon Kim
Chun-Fu Chen
Rogerio Feris
David D. Cox
Nuno Vasconcelos
MLLM
40
38
0
31 May 2022
Improved Vector Quantized Diffusion Models
Improved Vector Quantized Diffusion Models
Zhicong Tang
Shuyang Gu
Jianmin Bao
Dong Chen
Fang Wen
DiffM
187
63
0
31 May 2022
Text-to-Face Generation with StyleGAN2
Text-to-Face Generation with StyleGAN2
D. M. A. Ayanthi
Sarasi Munasinghe
CVBM
30
5
0
25 May 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
...
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
84
5,797
0
23 May 2022
GR-GAN: Gradual Refinement Text-to-image Generation
GR-GAN: Gradual Refinement Text-to-image Generation
Bo Yang
Fangxiang Feng
Xiaojie Wang
EGVM
16
7
0
23 May 2022
AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars
AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars
Fangzhou Hong
Mingyuan Zhang
Liang Pan
Zhongang Cai
Lei Yang
Ziwei Liu
CLIP
98
79
0
17 May 2022
RiCS: A 2D Self-Occlusion Map for Harmonizing Volumetric Objects
RiCS: A 2D Self-Occlusion Map for Harmonizing Volumetric Objects
Yunseok Jang
Ruben Villegas
Jimei Yang
Duygu Ceylan
Xin Sun
Honglak Lee
3DH
32
0
0
14 May 2022
StyLandGAN: A StyleGAN based Landscape Image Synthesis using Depth-map
StyLandGAN: A StyleGAN based Landscape Image Synthesis using Depth-map
Gun-Hee Lee
Jonghwa Yim
Chanran Kim
Min-Jung Kim
GAN
MDE
39
1
0
13 May 2022
High-Resolution UAV Image Generation for Sorghum Panicle Detection
High-Resolution UAV Image Generation for Sorghum Panicle Detection
Enyu Cai
Zhankun Luo
Sriram Baireddy
Jiaqi Guo
Changye Yang
Edward J. Delp
13
2
0
08 May 2022
Synthetic Data -- what, why and how?
Synthetic Data -- what, why and how?
James Jordon
Lukasz Szpruch
F. Houssiau
M. Bottarelli
Giovanni Cherubin
Carsten Maple
Samuel N. Cohen
Adrian Weller
46
109
0
06 May 2022
Previous
123456...91011
Next