Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.05751
Cited By
v1
v2
v3 (latest)
Image Transformer
15 February 2018
Niki Parmar
Ashish Vaswani
Jakob Uszkoreit
Lukasz Kaiser
Noam M. Shazeer
Alexander Ku
Dustin Tran
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Image Transformer"
50 / 837 papers shown
Title
Momentum Transformer: Closing the Performance Gap Between Self-attention and Its Linearization
T. Nguyen
Richard G. Baraniuk
Robert M. Kirby
Stanley J. Osher
Bao Wang
127
9
0
01 Aug 2022
DnSwin: Toward Real-World Denoising via Continuous Wavelet Sliding-Transformer
Hao Li
Zhijing Yang
Xiaobin Hong
Ziying Zhao
Junyang Chen
Yukai Shi
Jin-shan Pan
DiffM
ViT
89
12
0
28 Jul 2022
When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition
Bohan Li
Ye Yuan
Dingkang Liang
Xiao-Chang Liu
Zhilong Ji
Jinfeng Bai
Wenyu Liu
Xiang Bai
88
50
0
23 Jul 2022
Few-shot Image Generation Using Discrete Content Representation
Y. Hong
Li Niu
Jianfu Zhang
Liqing Zhang
DiffM
84
11
0
22 Jul 2022
Pose for Everything: Towards Category-Agnostic Pose Estimation
Lumin Xu
Sheng Jin
Wang Zeng
Wentao Liu
Chao Qian
Wanli Ouyang
Ping Luo
Xiaogang Wang
71
38
0
21 Jul 2022
Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Dongchao Yang
Jianwei Yu
Helin Wang
Wen Wang
Chao Weng
Yuexian Zou
Dong Yu
DiffM
111
306
0
20 Jul 2022
EleGANt: Exquisite and Locally Editable GAN for Makeup Transfer
Chenyu Yang
W. He
Yingqing Xu
Yang Gao
DiffM
71
27
0
20 Jul 2022
Vision Transformer for NeRF-Based View Synthesis from a Single Input Image
Kai-En Lin
Yen-Chen Lin
Wei-Sheng Lai
Nayeon Lee
Yichang Shih
R. Ramamoorthi
ViT
107
114
0
12 Jul 2022
Attention and Self-Attention in Random Forests
Lev V. Utkin
A. Konstantinov
73
7
0
09 Jul 2022
kMaX-DeepLab: k-means Mask Transformer
Qihang Yu
Huiyu Wang
Siyuan Qiao
Maxwell D. Collins
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
ViT
183
19
0
08 Jul 2022
Detecting and Recovering Sequential DeepFake Manipulation
Rui Shao
Tianxing Wu
Ziwei Liu
AAML
112
42
0
05 Jul 2022
Swin Deformable Attention U-Net Transformer (SDAUT) for Explainable Fast MRI
Jiahao Huang
Xiaodan Xing
Zhifan Gao
Guang Yang
ViT
MedIm
67
26
0
05 Jul 2022
Transformer based Models for Unsupervised Anomaly Segmentation in Brain MR Images
Ahmed Ghorbel
Ahmed Aldahdooh
Shadi Albarqouni
Neuherberg
ViT
MedIm
75
6
0
05 Jul 2022
Interaction Transformer for Human Reaction Generation
Baptiste Chopin
Hao Tang
N. Otberdout
Mohamed Daoudi
N. Sebe
ViT
99
27
0
04 Jul 2022
CTrGAN: Cycle Transformers GAN for Gait Transfer
Shahar Mahpod
Noam Gaash
Hay Hoffman
Gil Ben-Artzi
ViT
83
1
0
30 Jun 2022
Neural Inverse Transform Sampler
Henry Li
Y. Kluger
57
4
0
22 Jun 2022
I^2R-Net: Intra- and Inter-Human Relation Network for Multi-Person Pose Estimation
Yiwei Ding
W. Deng
Yinglin Zheng
Peng Liu
Meihong Wang
Xuan Cheng
Jianmin Bao
Dong Chen
Ming Zeng
3DH
96
13
0
22 Jun 2022
Generative Modelling With Inverse Heat Dissipation
Severi Rissanen
Markus Heinonen
Arno Solin
DiffM
147
121
0
21 Jun 2022
SAViR-T: Spatially Attentive Visual Reasoning with Transformers
Pritish Sahu
Kalliopi Basioti
Vladimir Pavlovic
LRM
68
16
0
18 Jun 2022
CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation
Qihang Yu
Huiyu Wang
Dahun Kim
Siyuan Qiao
Maxwell D. Collins
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
ViT
MedIm
124
92
0
17 Jun 2022
VectorMapNet: End-to-end Vectorized HD Map Learning
Yicheng Liu
Tianyuan Yuan
Yue Wang
Yilun Wang
Hang Zhao
157
199
0
17 Jun 2022
OmniMAE: Single Model Masked Pretraining on Images and Videos
Rohit Girdhar
Alaaeldin El-Nouby
Mannat Singh
Kalyan Vasudev Alwala
Armand Joulin
Ishan Misra
ViT
120
99
0
16 Jun 2022
Multi-scale Cooperative Multimodal Transformers for Multimodal Sentiment Analysis in Videos
Lianyang Ma
Yu Yao
Tao Liang
Tongliang Liu
57
4
0
16 Jun 2022
A smile is all you need: Predicting limiting activity coefficients from SMILES with natural language processing
Benedikt Winter
Clemens Winter
J. Schilling
A. Bardow
75
28
0
15 Jun 2022
A Projection-Based K-space Transformer Network for Undersampled Radial MRI Reconstruction with Limited Training Subjects
Chang Gao
Shu-Fu Shih
J. Finn
X. Zhong
MedIm
63
5
0
15 Jun 2022
TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer
Jiajun Deng
Zhengyuan Yang
Daqing Liu
Tianlang Chen
Wen-gang Zhou
Yanyong Zhang
Houqiang Li
Wanli Ouyang
ViT
110
57
0
14 Jun 2022
Unsupervised inter-frame motion correction for whole-body dynamic PET using convolutional long short-term memory in a convolutional neural network
Xue-yuan Guo
Bo Zhou
D. Pigg
Bruce Spottiswoode
M. Casey
Chi Liu
Nicha Dvornek
MedIm
48
18
0
13 Jun 2022
Transformer Lesion Tracker
Wen Tang
Han Kang
Haoyue Zhang
Pengxin Yu
C. Arnold
Rongguo Zhang
MedIm
63
6
0
13 Jun 2022
On Neural Architecture Inductive Biases for Relational Tasks
Giancarlo Kerg
Sarthak Mittal
David Rolnick
Yoshua Bengio
Blake A. Richards
Guillaume Lajoie
OOD
104
25
0
09 Jun 2022
Blind Face Restoration: Benchmark Datasets and a Baseline Model
Puyang Zhang
Kaihao Zhang
Wenhan Luo
Changsheng Li
Guoren Wang
CVBM
74
17
0
08 Jun 2022
Separable Self-attention for Mobile Vision Transformers
Sachin Mehta
Mohammad Rastegari
ViT
MQ
111
267
0
06 Jun 2022
EAANet: Efficient Attention Augmented Convolutional Networks
Runqing Zhang
Tianshu Zhu
34
0
0
03 Jun 2022
Towards Improving the Generation Quality of Autoregressive Slot VAEs
Patrick Emami
Pan He
Sanjay Ranka
Anand Rangarajan
OCL
77
1
0
03 Jun 2022
Learning Unbiased Transferability for Domain Adaptation by Uncertainty Modeling
Jian Hu
Haowen Zhong
Junchi Yan
S. Gong
Guile Wu
Fei Yang
63
11
0
02 Jun 2022
A Survey on Deep Learning for Skin Lesion Segmentation
Z. Mirikharaji
Kumar Abhishek
Alceu Bissoto
Catarina Barata
Sandra Avila
Eduardo Valle
M. Celebi
Ghassan Hamarneh
128
88
0
01 Jun 2022
PAGER: Progressive Attribute-Guided Extendable Robust Image Generation
Zohreh Azizi
C.-C. Jay Kuo
VLM
DiffM
GAN
89
9
0
01 Jun 2022
A review of machine learning approaches, challenges and prospects for computational tumor pathology
Liangrui Pan
Zhichao Feng
Shaoliang Peng
AI4CE
76
7
0
31 May 2022
Chefs' Random Tables: Non-Trigonometric Random Features
Valerii Likhosherstov
K. Choromanski
Kumar Avinava Dubey
Frederick Liu
Tamás Sarlós
Adrian Weller
92
18
0
30 May 2022
Group-level Brain Decoding with Deep Learning
Richard Csaky
M. Es
Oiwi Parker Jones
M. Woolrich
53
12
0
27 May 2022
Maximum Likelihood Training of Implicit Nonlinear Diffusion Models
Dongjun Kim
Byeonghu Na
S. Kwon
Dongsoo Lee
Wanmo Kang
Il-Chul Moon
DiffM
314
53
0
27 May 2022
Training and Inference on Any-Order Autoregressive Models the Right Way
Andy Shih
Dorsa Sadigh
Stefano Ermon
BDL
TPM
OOD
CML
98
28
0
26 May 2022
Prompt-based Learning for Unpaired Image Captioning
Peipei Zhu
Tianlin Li
Lin Zhu
Zhenglong Sun
Weishi Zheng
Yaowei Wang
Chen Chen
VLM
97
33
0
26 May 2022
ASSET: Autoregressive Semantic Scene Editing with Transformers at High Resolutions
Difan Liu
Sandesh Shetty
Tobias Hinz
Matthew Fisher
Richard Y. Zhang
Taesung Park
E. Kalogerakis
ViT
76
32
0
24 May 2022
Combining Contrastive and Supervised Learning for Video Super-Resolution Detection
Viacheslav Meshchaninov
Ivan Molodetskikh
D. Vatolin
56
0
0
20 May 2022
Transformer with Memory Replay
R. Liu
Barzan Mozafari
OffRL
105
4
0
19 May 2022
BodyMap: Learning Full-Body Dense Correspondence Map
A. Ianina
N. Sarafianos
Yuanlu Xu
Ignacio Rocco
Tony Tung
3DH
66
14
0
18 May 2022
Pluralistic Image Completion with Probabilistic Mixture-of-Experts
Xiaobo Xia
Wenhao Yang
Jieyi Ren
Yewen Li
Yibing Zhan
Bo Han
Tongliang Liu
MoE
60
3
0
18 May 2022
MulT: An End-to-End Multitask Learning Transformer
Deblina Bhattacharjee
Tong Zhang
Sabine Süsstrunk
Mathieu Salzmann
ViT
116
68
0
17 May 2022
BBDM: Image-to-image Translation with Brownian Bridge Diffusion Models
Bo Li
Kaitao Xue
Bin Liu
Yunyu Lai
DiffM
207
147
0
16 May 2022
Multiformer: A Head-Configurable Transformer-Based Model for Direct Speech Translation
Gerard Sant
Gerard I. Gállego
Belen Alastruey
Marta R. Costa-jussá
59
4
0
14 May 2022
Previous
1
2
3
...
6
7
8
...
15
16
17
Next