ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.10485
  4. Cited By
AttnGAN: Fine-Grained Text to Image Generation with Attentional
  Generative Adversarial Networks

AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks

28 November 2017
Tao Xu
Pengchuan Zhang
Qiuyuan Huang
Han Zhang
Zhe Gan
Xiaolei Huang
Xiaodong He
    GANViT
ArXiv (abs)PDFHTML

Papers citing "AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks"

50 / 822 papers shown
Title
DreamTuner: Single Image is Enough for Subject-Driven Generation
DreamTuner: Single Image is Enough for Subject-Driven Generation
Miao Hua
Jiawei Liu
Fei Ding
Wei Liu
Jie Wu
Qian He
70
31
0
21 Dec 2023
The Right Losses for the Right Gains: Improving the Semantic Consistency
  of Deep Text-to-Image Generation with Distribution-Sensitive Losses
The Right Losses for the Right Gains: Improving the Semantic Consistency of Deep Text-to-Image Generation with Distribution-Sensitive Losses
Mahmoud Ahmed
Omer Moussa
Ismail Shaheen
Mohamed S. Abdelfattah
Amr Abdalla
Marwan Eid
Hesham M. Eraqi
Mohamed Moustafa
111
0
0
18 Dec 2023
DETER: Detecting Edited Regions for Deterring Generative Manipulations
DETER: Detecting Edited Regions for Deterring Generative Manipulations
Sai Wang
Ye Zhu
Ruoyu Wang
Amaya Dharmasiri
Olga Russakovsky
Yu Wu
68
2
0
16 Dec 2023
Rich Human Feedback for Text-to-Image Generation
Rich Human Feedback for Text-to-Image Generation
Youwei Liang
Junfeng He
Gang Li
Peizhao Li
Arseniy Klimovskiy
...
Yiwen Luo
Yang Li
Kai Kohlhoff
Deepak Ramachandran
Vidhya Navalpakkam
EGVM
83
86
0
15 Dec 2023
Text-Guided Face Recognition using Multi-Granularity Cross-Modal
  Contrastive Learning
Text-Guided Face Recognition using Multi-Granularity Cross-Modal Contrastive Learning
Md Golam Moula Mehedi Hasan
S. Sami
Nasser M. Nasrabadi
65
6
0
14 Dec 2023
A Survey of Generative AI for Intelligent Transportation Systems
A Survey of Generative AI for Intelligent Transportation Systems
Huan Yan
Yong Li
52
9
0
13 Dec 2023
GenHowTo: Learning to Generate Actions and State Transformations from
  Instructional Videos
GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos
Tomávs Souvcek
Dima Damen
Michael Wray
Ivan Laptev
Josef Sivic
VGen
88
21
0
12 Dec 2023
ControlNet-XS: Designing an Efficient and Effective Architecture for
  Controlling Text-to-Image Diffusion Models
ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models
Denis Zavadski
Johann-Friedrich Feiden
Carsten Rother
DiffM
81
10
0
11 Dec 2023
CONFORM: Contrast is All You Need For High-Fidelity Text-to-Image
  Diffusion Models
CONFORM: Contrast is All You Need For High-Fidelity Text-to-Image Diffusion Models
Tuna Han Salih Meral
Enis Simsar
Federico Tombari
Pinar Yanardag
DiffMVLM
116
34
0
11 Dec 2023
PSCR: Patches Sampling-based Contrastive Regression for AIGC Image
  Quality Assessment
PSCR: Patches Sampling-based Contrastive Regression for AIGC Image Quality Assessment
Jiquan Yuan
Xinyan Cao
Linjing Cao
Jinlong Lin
Xixin Cao
EGVM
74
11
0
10 Dec 2023
Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Cache Me if You Can: Accelerating Diffusion Models through Block Caching
Felix Wimbauer
Bichen Wu
Edgar Schoenfeld
Xiaoliang Dai
Ji Hou
...
Jonas Kohler
Christian Rupprecht
Daniel Cremers
Peter Vajda
Jialiang Wang
DiffM
95
78
0
06 Dec 2023
DiffusionAtlas: High-Fidelity Consistent Diffusion Video Editing
DiffusionAtlas: High-Fidelity Consistent Diffusion Video Editing
Shao-Yu Chang
Hwann-Tzong Chen
Tyng-Luh Liu
DiffMVGen
99
3
0
05 Dec 2023
ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for
  ControlNet
ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for ControlNet
Soon Yau Cheong
Armin Mustafa
Andrew Gilbert
DiffM
65
5
0
05 Dec 2023
Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment
Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment
Brian Gordon
Yonatan Bitton
Yonatan Shafir
Roopal Garg
Xi Chen
Dani Lischinski
Daniel Cohen-Or
Idan Szpektor
87
12
0
05 Dec 2023
Customization Assistant for Text-to-image Generation
Customization Assistant for Text-to-image Generation
Yufan Zhou
Ruiyi Zhang
Jiuxiang Gu
Tongfei Sun
DiffM
85
12
0
05 Dec 2023
Foundation Models for Weather and Climate Data Understanding: A
  Comprehensive Survey
Foundation Models for Weather and Climate Data Understanding: A Comprehensive Survey
Shengchao Chen
Guodong Long
Jing Jiang
Dikai Liu
Chengqi Zhang
SyDaAI4CE
129
25
0
05 Dec 2023
Multimodality-guided Image Style Transfer using Cross-modal GAN
  Inversion
Multimodality-guided Image Style Transfer using Cross-modal GAN Inversion
Hanyu Wang
Pengxiang Wu
Kevin Dela Rosa
Chen Wang
Abhinav Shrivastava
113
9
0
04 Dec 2023
4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling
4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling
Sherwin Bahmani
Ivan Skorokhodov
Victor Rong
Gordon Wetzstein
Leonidas Guibas
Peter Wonka
Sergey Tulyakov
Jeong Joon Park
Andrea Tagliasacchi
David B. Lindell
DiffM
143
112
0
29 Nov 2023
Material Palette: Extraction of Materials from a Single Image
Material Palette: Extraction of Materials from a Single Image
Ivan Lopes
Fabio Pizzati
Raoul de Charette
DiffM
75
14
0
28 Nov 2023
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Yutong Feng
Biao Gong
Di Chen
Yujun Shen
Yu Liu
Jingren Zhou
DiffM
119
50
0
28 Nov 2023
Efficient Multimodal Diffusion Models Using Joint Data Infilling with
  Partially Shared U-Net
Efficient Multimodal Diffusion Models Using Joint Data Infilling with Partially Shared U-Net
Zizhao Hu
Shaochong Jia
Mohammad Rostami
DiffMMedIm
50
0
0
28 Nov 2023
Text-Driven Image Editing via Learnable Regions
Text-Driven Image Editing via Learnable Regions
Yuanze Lin
Yi-Wen Chen
Yi-Hsuan Tsai
Lu Jiang
Ming-Hsuan Yang
DiffM
103
20
0
28 Nov 2023
Tell2Design: A Dataset for Language-Guided Floor Plan Generation
Tell2Design: A Dataset for Language-Guided Floor Plan Generation
Sicong Leng
Yangqiaoyu Zhou
Mohammed Haroon Dupty
W. Lee
Sam Joyce
Wei Lu
3DV
67
15
0
27 Nov 2023
Learning Disentangled Identifiers for Action-Customized Text-to-Image
  Generation
Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation
Siteng Huang
Biao Gong
Yutong Feng
Xi Chen
Yu Fu
Yu Liu
Donglin Wang
DiffM
68
14
0
27 Nov 2023
Check, Locate, Rectify: A Training-Free Layout Calibration System for
  Text-to-Image Generation
Check, Locate, Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation
Biao Gong
Siteng Huang
Yutong Feng
Shiwei Zhang
Yuyuan Li
Yu Liu
DiffM
110
13
0
27 Nov 2023
ChatTraffic: Text-to-Traffic Generation via Diffusion Model
ChatTraffic: Text-to-Traffic Generation via Diffusion Model
Chengyang Zhang
Yong Zhang
Qitan Shao
Bo Li
Yisheng Lv
Xinglin Piao
Baocai Yin
84
7
0
27 Nov 2023
DreamCreature: Crafting Photorealistic Virtual Creatures from
  Imagination
DreamCreature: Crafting Photorealistic Virtual Creatures from Imagination
KamWoh Ng
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
DiffM
47
6
0
27 Nov 2023
CatVersion: Concatenating Embeddings for Diffusion-Based Text-to-Image
  Personalization
CatVersion: Concatenating Embeddings for Diffusion-Based Text-to-Image Personalization
Ruoyu Zhao
Mingrui Zhu
Shiyin Dong
Nannan Wang
Xinbo Gao
DiffM
68
12
0
24 Nov 2023
Soulstyler: Using Large Language Model to Guide Image Style Transfer for
  Target Object
Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object
Junhao Chen
Peng Rong
Jingbo Sun
Chao Li
Xiang Li
Hongwu Lv
VLM
59
2
0
22 Nov 2023
The Challenges of Image Generation Models in Generating Multi-Component
  Images
The Challenges of Image Generation Models in Generating Multi-Component Images
Tham Yik Foong
Shashank Kotyan
Poyuan Mao
Danilo Vasconcellos Vargas
EGVM
83
1
0
22 Nov 2023
Steal My Artworks for Fine-tuning? A Watermarking Framework for
  Detecting Art Theft Mimicry in Text-to-Image Models
Steal My Artworks for Fine-tuning? A Watermarking Framework for Detecting Art Theft Mimicry in Text-to-Image Models
Ge Luo
Junqiang Huang
Manman Zhang
Zhenxing Qian
Sheng Li
Xinpeng Zhang
WIGM
65
9
0
22 Nov 2023
EditShield: Protecting Unauthorized Image Editing by Instruction-guided
  Diffusion Models
EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models
Ruoxi Chen
Haibo Jin
Yixin Liu
Jinyin Chen
Haohan Wang
Lichao Sun
93
11
0
19 Nov 2023
UFOGen: You Forward Once Large Scale Text-to-Image Generation via
  Diffusion GANs
UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs
Yanwu Xu
Yang Zhao
Zhisheng Xiao
Tingbo Hou
217
121
0
14 Nov 2023
Instant3D: Instant Text-to-3D Generation
Instant3D: Instant Text-to-3D Generation
Ming Li
Pan Zhou
Jia-Wei Liu
Jussi Keppo
Min Lin
Shuicheng Yan
Xiangyu Xu
148
34
0
14 Nov 2023
A Chronological Survey of Theoretical Advancements in Generative
  Adversarial Networks for Computer Vision
A Chronological Survey of Theoretical Advancements in Generative Adversarial Networks for Computer Vision
Hrishikesh Sharma
AI4CEEGVM
45
1
0
02 Nov 2023
Transformation vs Tradition: Artificial General Intelligence (AGI) for
  Arts and Humanities
Transformation vs Tradition: Artificial General Intelligence (AGI) for Arts and Humanities
Zheng Liu
Yiwei Li
Qian Cao
Junwen Chen
Tianze Yang
...
John Gibbs
Khaled Rasheed
Ninghao Liu
Gengchen Mai
Tianming Liu
AI4CE
129
10
0
30 Oct 2023
Davidsonian Scene Graph: Improving Reliability in Fine-grained
  Evaluation for Text-to-Image Generation
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation
Jaemin Cho
Yushi Hu
Roopal Garg
Peter Anderson
Ranjay Krishna
Jason Baldridge
Mohit Bansal
Jordi Pont-Tuset
Su Wang
EGVM
84
81
0
27 Oct 2023
A Picture is Worth a Thousand Words: Principled Recaptioning Improves
  Image Generation
A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image Generation
Eyal Segalis
Dani Valevski
Danny Lumen
Yossi Matias
Yaniv Leviathan
DiffM
112
26
0
25 Oct 2023
Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image
  Generative Models
Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models
Shawn Shan
Wenxin Ding
Josephine Passananti
Stanley Wu
Haitao Zheng
Ben Y. Zhao
SILMDiffM
106
53
0
20 Oct 2023
Bridging the Gap between Synthetic and Authentic Images for Multimodal
  Machine Translation
Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation
Wenyu Guo
Qingkai Fang
Dong Yu
Yang Feng
75
7
0
20 Oct 2023
Object-aware Inversion and Reassembly for Image Editing
Object-aware Inversion and Reassembly for Image Editing
Zhen Yang
Dinggang Gui
Wen Wang
Hao Chen
Bohan Zhuang
Chunhua Shen
DiffM
102
19
0
18 Oct 2023
To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still
  Easy To Generate Unsafe Images ... For Now
To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now
Yimeng Zhang
Jinghan Jia
Xin Chen
Aochuan Chen
Yihua Zhang
Jiancheng Liu
Ke Ding
Sijia Liu
DiffM
177
101
0
18 Oct 2023
GenEval: An Object-Focused Framework for Evaluating Text-to-Image
  Alignment
GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment
Dhruba Ghosh
Hanna Hajishirzi
Ludwig Schmidt
98
202
0
17 Oct 2023
LLM Blueprint: Enabling Text-to-Image Generation with Complex and
  Detailed Prompts
LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts
Hanan Gani
Shariq Farooq Bhat
Muzammal Naseer
Salman Khan
Peter Wonka
DiffM
106
44
0
16 Oct 2023
AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion
AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion
Yitong Jiang
Zhaoyang Zhang
Tianfan Xue
Liang Feng
DiffM
159
46
0
16 Oct 2023
Improving Compositional Text-to-image Generation with Large
  Vision-Language Models
Improving Compositional Text-to-image Generation with Large Vision-Language Models
Song Wen
Guian Fang
Renrui Zhang
Peng Gao
Hao Dong
Dimitris N. Metaxas
85
18
0
10 Oct 2023
AI-Generated Images as Data Source: The Dawn of Synthetic Era
AI-Generated Images as Data Source: The Dawn of Synthetic Era
Zuhao Yang
Fangneng Zhan
Kunhao Liu
Muyu Xu
Shijian Lu
EGVM
107
20
0
03 Oct 2023
Prototype-based Aleatoric Uncertainty Quantification for Cross-modal
  Retrieval
Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval
Hao Li
Marie-Jeanne Lesot
Lianli Gao
Xiaosu Zhu
Christophe Marsala
EDL
78
15
0
29 Sep 2023
AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive
  Computation
AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive Computation
Shengkun Tang
Yaqing Wang
Maksim Dzhigil
Yi Liang
Yongbin Li
Dongkuan Xu
62
7
0
29 Sep 2023
KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image
  Action Editing
KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing
Jiarui Yao
Yifan Liu
Simon S. Du
Shifeng Chen
DiffM
64
24
0
28 Sep 2023
Previous
12345...151617
Next