Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.05751
Cited By
Image Transformer
15 February 2018
Niki Parmar
Ashish Vaswani
Jakob Uszkoreit
Lukasz Kaiser
Noam M. Shazeer
Alexander Ku
Dustin Tran
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Image Transformer"
50 / 356 papers shown
Title
Momentum Transformer: Closing the Performance Gap Between Self-attention and Its Linearization
T. Nguyen
Richard G. Baraniuk
Robert M. Kirby
Stanley J. Osher
Bao Wang
32
9
0
01 Aug 2022
DnSwin: Toward Real-World Denoising via Continuous Wavelet Sliding-Transformer
Hao Li
Zhijing Yang
Xiaobin Hong
Ziying Zhao
Junyang Chen
Yukai Shi
Jin-shan Pan
DiffM
ViT
43
11
0
28 Jul 2022
EleGANt: Exquisite and Locally Editable GAN for Makeup Transfer
Chenyu Yang
W. He
Yingqing Xu
Yang Gao
DiffM
19
26
0
20 Jul 2022
Vision Transformer for NeRF-Based View Synthesis from a Single Input Image
Kai-En Lin
Yen-Chen Lin
Wei-Sheng Lai
Nayeon Lee
Yichang Shih
R. Ramamoorthi
ViT
27
112
0
12 Jul 2022
Interaction Transformer for Human Reaction Generation
Baptiste Chopin
Hao Tang
N. Otberdout
Mohamed Daoudi
N. Sebe
ViT
38
27
0
04 Jul 2022
CTrGAN: Cycle Transformers GAN for Gait Transfer
Shahar Mahpod
Noam Gaash
Hay Hoffman
Gil Ben-Artzi
ViT
28
1
0
30 Jun 2022
CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
Qihang Yu
Huiyu Wang
Dahun Kim
Siyuan Qiao
Maxwell D. Collins
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
ViT
MedIm
32
90
0
17 Jun 2022
VectorMapNet: End-to-end Vectorized HD Map Learning
Yicheng Liu
Tianyuan Yuan
Yue Wang
Yilun Wang
Hang Zhao
23
188
0
17 Jun 2022
OmniMAE: Single Model Masked Pretraining on Images and Videos
Rohit Girdhar
Alaaeldin El-Nouby
Mannat Singh
Kalyan Vasudev Alwala
Armand Joulin
Ishan Misra
ViT
37
97
0
16 Jun 2022
A smile is all you need: Predicting limiting activity coefficients from SMILES with natural language processing
Benedikt Winter
Clemens Winter
J. Schilling
A. Bardow
33
28
0
15 Jun 2022
Transformer Lesion Tracker
Wen Tang
Han Kang
Haoyue Zhang
Pengxin Yu
C. Arnold
Rongguo Zhang
MedIm
27
6
0
13 Jun 2022
Separable Self-attention for Mobile Vision Transformers
Sachin Mehta
Mohammad Rastegari
ViT
MQ
26
252
0
06 Jun 2022
Towards Improving the Generation Quality of Autoregressive Slot VAEs
Patrick Emami
Pan He
Sanjay Ranka
Anand Rangarajan
OCL
38
1
0
03 Jun 2022
A Survey on Deep Learning for Skin Lesion Segmentation
Z. Mirikharaji
Kumar Abhishek
Alceu Bissoto
Catarina Barata
Sandra Avila
Eduardo Valle
M. Celebi
Ghassan Hamarneh
44
82
0
01 Jun 2022
PAGER: Progressive Attribute-Guided Extendable Robust Image Generation
Zohreh Azizi
C.-C. Jay Kuo
VLM
DiffM
GAN
35
8
0
01 Jun 2022
A review of machine learning approaches, challenges and prospects for computational tumor pathology
Liangrui Pan
Zhichao Feng
Shaoliang Peng
AI4CE
27
7
0
31 May 2022
Chefs' Random Tables: Non-Trigonometric Random Features
Valerii Likhosherstov
K. Choromanski
Kumar Avinava Dubey
Frederick Liu
Tamás Sarlós
Adrian Weller
33
17
0
30 May 2022
Maximum Likelihood Training of Implicit Nonlinear Diffusion Models
Dongjun Kim
Byeonghu Na
S. Kwon
Dongsoo Lee
Wanmo Kang
Il-Chul Moon
DiffM
213
52
0
27 May 2022
Training and Inference on Any-Order Autoregressive Models the Right Way
Andy Shih
Dorsa Sadigh
Stefano Ermon
BDL
TPM
OOD
CML
35
23
0
26 May 2022
Prompt-based Learning for Unpaired Image Captioning
Peipei Zhu
Tianlin Li
Lin Zhu
Zhenglong Sun
Weishi Zheng
Yaowei Wang
Chia-Ju Chen
VLM
27
31
0
26 May 2022
ASSET: Autoregressive Semantic Scene Editing with Transformers at High Resolutions
Difan Liu
Sandesh Shetty
Tobias Hinz
Matthew Fisher
Richard Y. Zhang
Taesung Park
E. Kalogerakis
ViT
27
30
0
24 May 2022
BodyMap: Learning Full-Body Dense Correspondence Map
A. Ianina
N. Sarafianos
Yuanlu Xu
Ignacio Rocco
Tony Tung
3DH
30
14
0
18 May 2022
MulT: An End-to-End Multitask Learning Transformer
Deblina Bhattacharjee
Tong Zhang
Sabine Süsstrunk
Mathieu Salzmann
ViT
42
63
0
17 May 2022
Attention Mechanism in Neural Networks: Where it Comes and Where it Goes
Derya Soydaner
3DV
44
149
0
27 Apr 2022
ClusterGNN: Cluster-based Coarse-to-Fine Graph Neural Network for Efficient Feature Matching
Yanxing Shi
Junxiong Cai
Yoli Shavit
Tai-Jiang Mu
Wensen Feng
Kai Zhang
GNN
27
77
0
25 Apr 2022
Efficient Linear Attention for Fast and Accurate Keypoint Matching
Suwichaya Suwanwimolkul
S. Komorita
3DPC
3DV
22
11
0
16 Apr 2022
A Call for Clarity in Beam Search: How It Works and When It Stops
Jungo Kasai
Keisuke Sakaguchi
Ronan Le Bras
Dragomir R. Radev
Yejin Choi
Noah A. Smith
26
6
0
11 Apr 2022
Stripformer: Strip Transformer for Fast Image Deblurring
Fu-Jen Tsai
Yan-Tsung Peng
Yen-Yu Lin
Chung-Chi Tsai
Chia-Wen Lin
ViT
21
171
0
10 Apr 2022
Towards An End-to-End Framework for Flow-Guided Video Inpainting
Zerui Li
Cheng Lu
Jia Qin
Chunle Guo
Mingg-Ming Cheng
41
149
0
06 Apr 2022
FoV-Net: Field-of-View Extrapolation Using Self-Attention and Uncertainty
Liqian Ma
Stamatios Georgoulis
Xu Jia
Luc Van Gool
32
6
0
04 Apr 2022
Unitail: Detecting, Reading, and Matching in Retail Scene
Fangyi Chen
Han Zhang
Zaiwang Li
Jiachen Dou
Shentong Mo
Hao Chen
Yongxin Zhang
Uzair Ahmed
Chenchen Zhu
Marios Savvides
30
9
0
01 Apr 2022
Domain Invariant Siamese Attention Mask for Small Object Change Detection via Everyday Indoor Robot Navigation
Koji Takeda
Kanji Tanaka
Yoshimasa Nakamura
3DPC
25
3
0
29 Mar 2022
A General Survey on Attention Mechanisms in Deep Learning
Gianni Brauwers
Flavius Frasincar
31
296
0
27 Mar 2022
A World-Self Model Towards Understanding Intelligence
Yutao Yue
32
2
0
25 Mar 2022
Efficient-VDVAE: Less is more
Louay Hazami
Rayhane Mama
Ragavan Thurairatnam
BDL
26
28
0
25 Mar 2022
High-Performance Transformer Tracking
Xin Chen
B. Yan
Jiawen Zhu
Huchuan Lu
Xiang Ruan
D. Wang
ViT
23
33
0
25 Mar 2022
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
Oran Gafni
Adam Polyak
Oron Ashual
Shelly Sheynin
Devi Parikh
Yaniv Taigman
DiffM
19
511
0
24 Mar 2022
Linearizing Transformer with Key-Value Memory
Yizhe Zhang
Deng Cai
22
5
0
23 Mar 2022
ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through Regularized Self-Attention
Yang Liu
Jiaxiang Liu
L. Chen
Yuxiang Lu
Shi Feng
Zhida Feng
Yu Sun
Hao Tian
Huancheng Wu
Hai-feng Wang
31
9
0
23 Mar 2022
TVConv: Efficient Translation Variant Convolution for Layout-aware Visual Processing
Jie Chen
Tianlang He
Weipeng Zhuo
Li Ma
Sangtae Ha
Shueng-Han Gary Chan
CVBM
21
24
0
20 Mar 2022
ViewFormer: NeRF-free Neural Rendering from Few Images Using Transformers
Jonávs Kulhánek
Erik Derner
Torsten Sattler
Robert Babuvska
ViT
30
73
0
18 Mar 2022
Image Super-Resolution With Deep Variational Autoencoders
Darius Chira
Ilian Haralampiev
Ole Winther
Andrea Dittadi
Valentin Liévin
DRL
35
32
0
17 Mar 2022
Semantic-aligned Fusion Transformer for One-shot Object Detection
Yizhou Zhao
Xun Guo
Yan Lu
ViT
ObjD
22
21
0
17 Mar 2022
InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene Understanding
Hanrong Ye
Dan Xu
ViT
21
84
0
15 Mar 2022
Deep Transformers Thirst for Comprehensive-Frequency Data
R. Xia
Chao Xue
Boyu Deng
Fang Wang
Jingchao Wang
ViT
25
0
0
14 Mar 2022
The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy
Tianlong Chen
Zhenyu Zhang
Yu Cheng
Ahmed Hassan Awadallah
Zhangyang Wang
ViT
41
37
0
12 Mar 2022
Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice
Peihao Wang
Wenqing Zheng
Tianlong Chen
Zhangyang Wang
ViT
33
127
0
09 Mar 2022
Signature and Log-signature for the Study of Empirical Distributions Generated with GANs
J. Curtò
I. D. Zarzà
Hong-Mei Yan
Carlos T. Calafate
23
1
0
07 Mar 2022
PanFormer: a Transformer Based Model for Pan-sharpening
Huanyu Zhou
Qingjie Liu
Yunhong Wang
ViT
20
42
0
06 Mar 2022
Contextformer: A Transformer with Spatio-Channel Attention for Context Modeling in Learned Image Compression
A. B. Koyuncu
Han Gao
Atanas Boev
Georgii Gaikov
Elena Alshina
Eckehard Steinbach
ViT
39
68
0
04 Mar 2022
Previous
1
2
3
4
5
6
7
8
Next