Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.10762
Cited By
v1
v2 (latest)
StyleSwin: Transformer-based GAN for High-resolution Image Generation
20 December 2021
Bo Zhang
Shuyang Gu
Bo Zhang
Jianmin Bao
Dong Chen
Fang Wen
Yong Wang
B. Guo
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Github (532★)
Papers citing
"StyleSwin: Transformer-based GAN for High-resolution Image Generation"
50 / 69 papers shown
Title
PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models
Runze He
Bo Cheng
Yuhang Ma
Qingxiang Jia
Shanyuan Liu
Ao Ma
Xiaoyu Wu
Liebucha Wu
Dawei Leng
Yuhui Yin
DiffM
VLM
164
0
0
13 Mar 2025
WaveDH: Wavelet Sub-bands Guided ConvNet for Efficient Image Dehazing
Seongmin Hwang
Daeyoung Han
Cheolkon Jung
Moongu Jeon
120
5
0
20 Jan 2025
Model Synthesis for Zero-Shot Model Attribution
Tianyun Yang
Juan Cao
Danding Wang
Chang Xu
WIGM
109
4
0
20 Jan 2025
Global-Local Distillation Network-Based Audio-Visual Speaker Tracking with Incomplete Modalities
Yidi Li
Yihan Li
Yixin Guo
Bin Ren
Zhenhuan Xu
Hao Guo
Hong Liu
N. Sebe
132
0
0
26 Aug 2024
Towards Semantic Equivalence of Tokenization in Multimodal LLM
Shengqiong Wu
Hao Fei
Xiangtai Li
Jiayi Ji
Hanwang Zhang
Tat-Seng Chua
Shuicheng Yan
MLLM
119
34
0
07 Jun 2024
Glauber Generative Model: Discrete Diffusion Models via Binary Classification
Harshit Varma
Dheeraj M. Nagaraj
Karthikeyan Shanmugam
VLM
143
3
0
27 May 2024
Generative Adversarial Networks
Gilad Cohen
Raja Giryes
GAN
280
30,103
0
01 Mar 2022
Can Vision Transformers Perform Convolution?
Shanda Li
Xiangning Chen
Di He
Cho-Jui Hsieh
ViT
73
21
0
02 Nov 2021
Toward Spatially Unbiased Generative Models
Jooyoung Choi
Jungbeom Lee
Yonghyun Jeong
Sungroh Yoon
DiffM
69
17
0
03 Aug 2021
ViTGAN: Training GANs with Vision Transformers
Kwonjoon Lee
Huiwen Chang
Lu Jiang
Han Zhang
Zhuowen Tu
Ce Liu
ViT
68
186
0
09 Jul 2021
CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows
Xiaoyi Dong
Jianmin Bao
Dongdong Chen
Weiming Zhang
Nenghai Yu
Lu Yuan
Dong Chen
B. Guo
ViT
150
982
0
01 Jul 2021
Alias-Free Generative Adversarial Networks
Tero Karras
M. Aittala
S. Laine
Erik Härkönen
Janne Hellsten
J. Lehtinen
Timo Aila
GAN
177
1,596
0
23 Jun 2021
Improved Transformer for High-Resolution GANs
Long Zhao
Zizhao Zhang
Ting Chen
Dimitris N. Metaxas
Han Zhang
ViT
72
96
0
14 Jun 2021
CoAtNet: Marrying Convolution and Attention for All Data Sizes
Zihang Dai
Hanxiao Liu
Quoc V. Le
Mingxing Tan
ViT
123
1,207
0
09 Jun 2021
CvT: Introducing Convolutions to Vision Transformers
Haiping Wu
Bin Xiao
Noel Codella
Mengchen Liu
Xiyang Dai
Lu Yuan
Lei Zhang
ViT
152
1,910
0
29 Mar 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Ze Liu
Yutong Lin
Yue Cao
Han Hu
Yixuan Wei
Zheng Zhang
Stephen Lin
B. Guo
ViT
453
21,439
0
25 Mar 2021
Scaling Local Self-Attention for Parameter Efficient Visual Backbones
Ashish Vaswani
Prajit Ramachandran
A. Srinivas
Niki Parmar
Blake A. Hechtman
Jonathon Shlens
92
400
0
23 Mar 2021
Transformer in Transformer
Kai Han
An Xiao
Enhua Wu
Jianyuan Guo
Chunjing Xu
Yunhe Wang
ViT
391
1,565
0
27 Feb 2021
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
530
3,724
0
24 Feb 2021
Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
Li-xin Yuan
Yunpeng Chen
Tao Wang
Weihao Yu
Yujun Shi
Zihang Jiang
Francis E. H. Tay
Jiashi Feng
Shuicheng Yan
ViT
133
1,939
0
28 Jan 2021
Transformers in Vision: A Survey
Salman Khan
Muzammal Naseer
Munawar Hayat
Syed Waqas Zamir
Fahad Shahbaz Khan
M. Shah
ViT
305
2,516
0
04 Jan 2021
Training data-efficient image transformers & distillation through attention
Hugo Touvron
Matthieu Cord
Matthijs Douze
Francisco Massa
Alexandre Sablayrolles
Hervé Jégou
ViT
387
6,768
0
23 Dec 2020
Taming Transformers for High-Resolution Image Synthesis
Patrick Esser
Robin Rombach
Bjorn Ommer
ViT
129
2,962
0
17 Dec 2020
Positional Encoding as Spatial Inductive Bias in GANs
Rui Xu
Xintao Wang
Kai-xiang Chen
Bolei Zhou
Chen Change Loy
GAN
93
89
0
09 Dec 2020
Image Generators with Conditionally-Independent Pixel Synthesis
Ivan Anokhin
K. Demochkin
Taras Khakhulin
Gleb Sterkin
Victor Lempitsky
Denis Korzhenkov
73
158
0
27 Nov 2020
Adversarial Generation of Continuous Images
Ivan Skorokhodov
Savva Ignatyev
Mohamed Elhoseiny
GAN
74
174
0
24 Nov 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
657
41,103
0
22 Oct 2020
Generative Adversarial Networks for Image and Video Synthesis: Algorithms and Applications
Xuan Li
Xun Huang
Jiahui Yu
Ting-Chun Wang
Arun Mallya
GAN
98
155
0
06 Aug 2020
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
648
18,096
0
19 Jun 2020
Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains
Matthew Tancik
Pratul P. Srinivasan
B. Mildenhall
Sara Fridovich-Keil
N. Raghavan
Utkarsh Singhal
R. Ramamoorthi
Jonathan T. Barron
Ren Ng
124
2,421
0
18 Jun 2020
Differentiable Augmentation for Data-Efficient GAN Training
Shengyu Zhao
Zhijian Liu
Ji Lin
Jun-Yan Zhu
Song Han
91
608
0
18 Jun 2020
What makes instance discrimination good for transfer learning?
Nanxuan Zhao
Zhirong Wu
Rynson W. H. Lau
Stephen Lin
SSL
71
170
0
11 Jun 2020
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
811
42,055
0
28 May 2020
GIQA: Generated Image Quality Assessment
Shuyang Gu
Jianmin Bao
Dong Chen
Fang Wen
EGVM
202
82
0
19 Mar 2020
On Translation Invariance in CNNs: Convolutional Layers can Exploit Absolute Spatial Location
O. Kayhan
Jan van Gemert
302
236
0
16 Mar 2020
A U-Net Based Discriminator for Generative Adversarial Networks
Edgar Schönfeld
Bernt Schiele
Anna Khoreva
GAN
76
295
0
28 Feb 2020
Decision-Making with Auto-Encoding Variational Bayes
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
422
10,591
0
17 Feb 2020
Improved Consistency Regularization for GANs
Zhengli Zhao
Sameer Singh
Honglak Lee
Zizhao Zhang
Augustus Odena
Han Zhang
58
153
0
11 Feb 2020
How Much Position Information Do Convolutional Neural Networks Encode?
Md. Amirul Islam
Sen Jia
Neil D. B. Bruce
SSL
244
348
0
22 Jan 2020
Analyzing and Improving the Image Quality of StyleGAN
Tero Karras
S. Laine
M. Aittala
Janne Hellsten
J. Lehtinen
Timo Aila
GAN
301
5,815
0
03 Dec 2019
Effectively Unbiased FID and Inception Score and where to find them
Min Jin Chong
David A. Forsyth
EGVM
85
203
0
16 Nov 2019
Momentum Contrast for Unsupervised Visual Representation Learning
Kaiming He
Haoqi Fan
Yuxin Wu
Saining Xie
Ross B. Girshick
SSL
204
12,085
0
13 Nov 2019
On the Relationship between Self-Attention and Convolutional Layers
Jean-Baptiste Cordonnier
Andreas Loukas
Martin Jaggi
112
534
0
08 Nov 2019
Root Mean Square Layer Normalization
Biao Zhang
Rico Sennrich
91
740
0
16 Oct 2019
Local Relation Networks for Image Recognition
Han Hu
Zheng Zhang
Zhenda Xie
Stephen Lin
FAtt
85
501
0
25 Apr 2019
COCO-GAN: Generation by Parts via Conditional Coordinating
Chieh Hubert Lin
Chia-Che Chang
Yu-Sheng Chen
Da-Cheng Juan
Wei Wei
Hwann-Tzong Chen
62
135
0
30 Mar 2019
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
589
10,561
0
12 Dec 2018
Large Scale GAN Training for High Fidelity Natural Image Synthesis
Andrew Brock
Jeff Donahue
Karen Simonyan
262
5,394
0
28 Sep 2018
Glow: Generative Flow with Invertible 1x1 Convolutions
Diederik P. Kingma
Prafulla Dhariwal
BDL
DRL
295
3,134
0
09 Jul 2018
The relativistic discriminator: a key element missing from standard GAN
Alexia Jolicoeur-Martineau
GAN
60
974
0
02 Jul 2018
1
2
Next