Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.09515
Cited By
StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis
23 January 2023
Axel Sauer
Tero Karras
S. Laine
Andreas Geiger
Timo Aila
Re-assign community
ArXiv
PDF
HTML
Papers citing
"StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis"
50 / 149 papers shown
Title
Can We Achieve Efficient Diffusion without Self-Attention? Distilling Self-Attention into Convolutions
Ziyi Dong
Chengxing Zhou
Weijian Deng
Pengxu Wei
Xiangyang Ji
Liang Lin
MQ
53
0
0
30 Apr 2025
Autoregressive Distillation of Diffusion Transformers
Yeongmin Kim
Sotiris Anagnostidis
Yuming Du
Edgar Schönfeld
Jonas Kohler
Markos Georgopoulos
Albert Pumarola
Ali K. Thabet
A. Sanakoyeu
28
0
0
15 Apr 2025
Prompting Forgetting: Unlearning in GANs via Textual Guidance
Piyush Nagasubramaniam
Neeraj Karamchandani
Chen Wu
Sencun Zhu
DiffM
AILaw
MU
54
0
0
01 Apr 2025
CODA: Repurposing Continuous VAEs for Discrete Tokenization
Zeyu Liu
Zanlin Ni
Yeguo Hua
Xin Deng
Xiao Ma
Cheng Zhong
Gao Huang
47
0
0
22 Mar 2025
Safe-VAR: Safe Visual Autoregressive Model for Text-to-Image Generative Watermarking
Ziyi Wang
Songbai Tan
Gang Xu
Xuerui Qiu
Hongbin Xu
Xin Meng
Ming Li
Fei Richard Yu
WIGM
63
0
0
14 Mar 2025
MGHanD: Multi-modal Guidance for authentic Hand Diffusion
Taehyeon Eum
Jieun Choi
Tae-Kyun Kim
52
0
0
11 Mar 2025
Modular Customization of Diffusion Models via Blockwise-Parameterized Low-Rank Adaptation
Mingkang Zhu
Xi Chen
Z. Wang
Bei Yu
Hengshuang Zhao
Jiaya Jia
MoMe
55
0
0
11 Mar 2025
LapLoss: Laplacian Pyramid-based Multiscale loss for Image Translation
Krish Didwania
Ishaan Gakhar
Prakhar Arya
Sanskriti Labroo
58
0
0
07 Mar 2025
A Critical Assessment of Modern Generative Models' Ability to Replicate Artistic Styles
Andrea Asperti
Franky George
Tiberio Marras
Razvan Ciprian Stricescu
Fabio Zanotti
EGVM
46
0
0
21 Feb 2025
LS-GAN: Human Motion Synthesis with Latent-space GANs
Avinash Amballa
Gayathri Akkinapalli
Vinitra Muralikrishnan
36
1
0
30 Dec 2024
GANFusion: Feed-Forward Text-to-3D with Diffusion in GAN Space
Souhaib Attaiki
Paul Guerrero
Duygu Ceylan
Niloy J. Mitra
M. Ovsjanikov
85
0
0
21 Dec 2024
SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device
Yushu Wu
Zhixing Zhang
Yanyu Li
Yanwu Xu
Anil Kag
...
Ju Hu
Dimitris N. Metaxas
Yanzhi Wang
Sergey Tulyakov
Jian Ren
DiffM
VGen
97
4
0
13 Dec 2024
Self-Cross Diffusion Guidance for Text-to-Image Synthesis of Similar Subjects
Weimin Qiu
Jieke Wang
Meng Tang
DiffM
82
0
0
28 Nov 2024
CDI: Copyrighted Data Identification in Diffusion Models
Jan Dubiñski
Antoni Kowalczuk
Franziska Boenisch
Adam Dziedzic
72
1
0
19 Nov 2024
AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
Anil Kag
Huseyin Coskun
Jierun Chen
Junli Cao
Willi Menapace
Aliaksandr Siarohin
Sergey Tulyakov
Jian Ren
51
3
0
07 Nov 2024
Kernel Orthogonality does not necessarily imply a Decrease in Feature Map Redundancy in CNNs: Convolutional Similarity Minimization
Zakariae Belmekki
Jun Li
Patrick Reuter
David Antonio Gómez Jáuregui
Karl Jenkins
26
0
0
05 Nov 2024
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders
Viacheslav Surkov
Chris Wendler
Mikhail Terekhov
Justin Deschenaux
Robert West
Çağlar Gülçehre
VLM
40
13
0
28 Oct 2024
Diff-Instruct*: Towards Human-Preferred One-step Text-to-image Generative Models
Weijian Luo
C. Zhang
Debing Zhang
Zhengyang Geng
28
3
0
28 Oct 2024
Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences
Weijian Luo
EGVM
36
6
0
24 Oct 2024
Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing
Haonan Lin
Mengmeng Wang
Jiahao Wang
Wenbin An
Yan Chen
Yong Liu
Feng Tian
Guang Dai
Jingdong Wang
Qianying Wang
DiffM
43
9
0
24 Oct 2024
Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow
Fu-Yun Wang
Ling Yang
Zhaoyang Huang
Mengdi Wang
Hongsheng Li
31
13
0
09 Oct 2024
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Sihyun Yu
Sangkyung Kwak
Huiwon Jang
Jongheon Jeong
Jonathan Huang
Jinwoo Shin
Saining Xie
OCL
70
64
0
09 Oct 2024
Data Extrapolation for Text-to-image Generation on Small Datasets
Senmao Ye
Fei Liu
33
0
0
02 Oct 2024
I2I-Galip: Unsupervised Medical Image Translation Using Generative Adversarial CLIP
Yilmaz Korkmaz
V. Patel
VLM
MedIm
GAN
33
1
0
19 Sep 2024
OSV: One Step is Enough for High-Quality Image to Video Generation
Xiaofeng Mao
Zhengkai Jiang
Fu-Yun Wang
Wenbing Zhu
Hao Chen
Mingmin Chi
Yabiao Wang
Wenhan Luo
DiffM
VGen
74
8
0
17 Sep 2024
Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization
Vage Egiazarian
Denis Kuznedelev
Anton Voronov
Ruslan Svirschevski
Michael Goin
Daniil Pavlov
Dan Alistarh
Dmitry Baranchuk
MQ
31
0
0
31 Aug 2024
AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
Zanlin Ni
Yulin Wang
Renping Zhou
Rui Lu
Jiayi Guo
Jinyi Hu
Zhiyuan Liu
Yuan Yao
Gao Huang
29
7
0
31 Aug 2024
SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher
T. Dao
Thuan Hoang Nguyen
T. Le
D. Vu
Khoi Nguyen
Cuong Pham
Anh Tran
DiffM
38
11
0
26 Aug 2024
UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization
Junjie He
Yifeng Geng
Liefeng Bo
DiffM
48
20
0
12 Aug 2024
LogoSticker: Inserting Logos into Diffusion Models for Customized Generation
Mingkang Zhu
Xi Chen
Zhongdao Wang
Hengshuang Zhao
Jiaya Jia
DiffM
42
0
0
18 Jul 2024
Exploring the Potentials and Challenges of Deep Generative Models in Product Design Conception
Phillip Mueller
Lars Mikelsons
AI4CE
41
1
0
15 Jul 2024
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance
Guian Fang
Wenbiao Yan
Yuanfan Guo
J. N. Han
Zutao Jiang
Hang Xu
Shengcai Liao
Xiaodan Liang
38
4
0
09 Jul 2024
AID-AppEAL: Automatic Image Dataset and Algorithm for Content Appeal Enhancement and Assessment Labeling
Sherry X Chen
Yaron Vaxman
Elad Ben Baruch
David Asulin
Aviad Moreshet
Misha Sra
Pradeep Sen
37
0
0
08 Jul 2024
ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation
Zhiyuan Ma
Yuxiang Wei
Yabin Zhang
Xiangyu Zhu
Zhen Lei
Lei Zhang
DiffM
38
15
0
02 Jul 2024
What If We Recaption Billions of Web Images with LLaMA-3?
Xianhang Li
Haoqin Tu
Mude Hui
Zeyu Wang
Bingchen Zhao
...
Jieru Mei
Qing Liu
Huangjie Zheng
Yuyin Zhou
Cihang Xie
VLM
MLLM
41
35
0
12 Jun 2024
PAL: Pluralistic Alignment Framework for Learning from Heterogeneous Preferences
Daiwei Chen
Yi Chen
Aniket Rege
Ramya Korlakai Vinayak
38
17
0
12 Jun 2024
Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
Yuanhao Zhai
Kevin Lin
Zhengyuan Yang
Linjie Li
Jianfeng Wang
Chung-Ching Lin
David Doermann
Junsong Yuan
Lijuan Wang
VGen
DiffM
41
9
0
11 Jun 2024
SF-V: Single Forward Video Generation Model
Zhixing Zhang
Yanyu Li
Yushu Wu
Yanwu Xu
Anil Kag
...
Aliaksandr Siarohin
Junli Cao
Dimitris N. Metaxas
Sergey Tulyakov
Jian Ren
DiffM
VGen
42
9
0
06 Jun 2024
Diffusion Bridge AutoEncoders for Unsupervised Representation Learning
Yeongmin Kim
Kwanghyeon Lee
Minsang Park
Byeonghu Na
Il-Chul Moon
DiffM
44
2
0
27 May 2024
ID-to-3D: Expressive ID-guided 3D Heads via Score Distillation Sampling
F. Babiloni
Alexandros Lattas
Jiankang Deng
S. Zafeiriou
DiffM
35
4
0
26 May 2024
Improved Distribution Matching Distillation for Fast Image Synthesis
Tianwei Yin
Michael Gharbi
Taesung Park
Richard Zhang
Eli Shechtman
Frédo Durand
William T. Freeman
DiffM
42
94
0
23 May 2024
Nickel and Diming Your GAN: A Dual-Method Approach to Enhancing GAN Efficiency via Knowledge Distillation
Sangyeop Yeo
Yoojin Jang
Jaejun Yoo
29
1
0
19 May 2024
Distilling Diffusion Models into Conditional GANs
Minguk Kang
Richard Zhang
Connelly Barnes
Sylvain Paris
Suha Kwak
Jaesik Park
Eli Shechtman
Jun-Yan Zhu
Taesung Park
40
36
0
09 May 2024
Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation
Jonas Kohler
Albert Pumarola
Edgar Schönfeld
A. Sanakoyeu
Roshan Sumbaly
Peter Vajda
Ali K. Thabet
27
21
0
08 May 2024
Customizing Text-to-Image Models with a Single Image Pair
Maxwell Jones
Sheng-Yu Wang
Nupur Kumari
David Bau
Jun-Yan Zhu
DiffM
25
19
0
02 May 2024
Lazy Layers to Make Fine-Tuned Diffusion Models More Traceable
Haozhe Liu
Wentian Zhang
Bing Li
Bernard Ghanem
Jürgen Schmidhuber
DiffM
WIGM
AAML
28
1
0
01 May 2024
Synthetic Image Verification in the Era of Generative AI: What Works and What Isn't There Yet
D. Tariang
Riccardo Corvi
D. Cozzolino
Giovanni Poggi
Koki Nagano
L. Verdoliva
48
8
0
30 Apr 2024
Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models
Adrien Le Coz
Houssem Ouertatani
Stéphane Herbin
Faouzi Adjed
26
0
0
26 Apr 2024
An Analysis of Recent Advances in Deepfake Image Detection in an Evolving Threat Landscape
Sifat Muhammad Abdullah
Aravind Cheruvu
Shravya Kanchi
Taejoong Chung
Peng Gao
Murtuza Jadliwala
Bimal Viswanath
AAML
29
11
0
24 Apr 2024
Music Consistency Models
Zhengcong Fei
Mingyuan Fan
Junshi Huang
DiffM
48
5
0
20 Apr 2024
1
2
3
Next