Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.05751
Cited By
v1
v2
v3 (latest)
Image Transformer
15 February 2018
Niki Parmar
Ashish Vaswani
Jakob Uszkoreit
Lukasz Kaiser
Noam M. Shazeer
Alexander Ku
Dustin Tran
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Image Transformer"
50 / 837 papers shown
Title
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Xizhou Zhu
Weijie Su
Lewei Lu
Bin Li
Xiaogang Wang
Jifeng Dai
ViT
383
5,146
0
08 Oct 2020
Rethinking Attention with Performers
K. Choromanski
Valerii Likhosherstov
David Dohan
Xingyou Song
Andreea Gane
...
Afroz Mohiuddin
Lukasz Kaiser
David Belanger
Lucy J. Colwell
Adrian Weller
202
1,605
0
30 Sep 2020
Attention that does not Explain Away
Nan Ding
Xinjie Fan
Zhenzhong Lan
Dale Schuurmans
Radu Soricut
54
3
0
29 Sep 2020
Knowledge Fusion Transformers for Video Action Recognition
Ganesh Samarth
Sheetal Ojha
Nikhil Pareek
ViT
59
1
0
29 Sep 2020
DeepRemaster: Temporal Source-Reference Attention Networks for Comprehensive Video Enhancement
S. Iizuka
E. Simo-Serra
153
39
0
18 Sep 2020
Efficient Transformers: A Survey
Yi Tay
Mostafa Dehghani
Dara Bahri
Donald Metzler
VLM
250
1,137
0
14 Sep 2020
SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition
Liangzhi Li
Bowen Wang
Manisha Verma
Yuta Nakashima
R. Kawasaki
Hajime Nagahara
OCL
90
51
0
14 Sep 2020
SPAN: Spatial Pyramid Attention Network forImage Manipulation Localization
Xuefeng Hu
Zhihan Zhang
Zhenye Jiang
Syomantak Chaudhuri
Zhenheng Yang
Ram Nevatia
3DPC
68
199
0
01 Sep 2020
Langevin Cooling for Domain Translation
Vignesh Srinivasan
Klaus-Robert Muller
Wojciech Samek
Shinichi Nakajima
74
1
0
31 Aug 2020
HittER: Hierarchical Transformers for Knowledge Graph Embeddings
Sanxing Chen
Xiaodong Liu
Jianfeng Gao
Jian Jiao
Ruofei Zhang
Yangfeng Ji
93
113
0
28 Aug 2020
Skeleton-based Action Recognition via Spatial and Temporal Transformer Networks
Chiara Plizzari
Marco Cannici
Matteo Matteucci
ViT
MedIm
84
312
0
17 Aug 2020
The Chess Transformer: Mastering Play using Generative Language Models
David Noever
Matt Ciolino
Josh Kalin
116
38
0
02 Aug 2020
HATNet: An End-to-End Holistic Attention Network for Diagnosis of Breast Biopsy Images
Sachin Mehta
Ximing Lu
D. Weaver
J. Elmore
Hannaneh Hajishirzi
Linda G. Shapiro
43
5
0
25 Jul 2020
Conformer-Kernel with Query Term Independence for Document Retrieval
Bhaskar Mitra
Sebastian Hofstatter
Hamed Zamani
Nick Craswell
61
21
0
20 Jul 2020
Kernelized Memory Network for Video Object Segmentation
Hongje Seong
Junhyuk Hyun
Euntai Kim
VOS
75
197
0
16 Jul 2020
Autoregressive Unsupervised Image Segmentation
Yassine Ouali
C´eline Hudelot
Myriam Tami
SSL
99
87
0
16 Jul 2020
Attention as Activation
Yimian Dai
Stefan Oehmcke
Fabian Gieseke
Yiquan Wu
Kobus Barnard
52
9
0
15 Jul 2020
Can neural networks acquire a structural bias from raw linguistic data?
Alex Warstadt
Samuel R. Bowman
AI4CE
62
54
0
14 Jul 2020
NVAE: A Deep Hierarchical Variational Autoencoder
Arash Vahdat
Jan Kautz
BDL
152
919
0
08 Jul 2020
Meta-Learning Stationary Stochastic Process Prediction with Convolutional Neural Processes
Andrew Y. K. Foong
W. Bruinsma
Jonathan Gordon
Yann Dubois
James Requeima
Richard Turner
BDL
93
78
0
02 Jul 2020
Software Engineering Event Modeling using Relative Time in Temporal Knowledge Graphs
Kian Ahrabian
Daniel Tarlow
Hehuimin Cheng
Jin L. C. Guo
60
3
0
02 Jul 2020
Sliced Iterative Normalizing Flows
B. Dai
U. Seljak
89
37
0
01 Jul 2020
Data Movement Is All You Need: A Case Study on Optimizing Transformers
A. Ivanov
Nikoli Dryden
Tal Ben-Nun
Shigang Li
Torsten Hoefler
147
135
0
30 Jun 2020
Tomographic Auto-Encoder: Unsupervised Bayesian Recovery of Corrupted Data
F. Tonolini
Pablo G. Moreno
Andreas C. Damianou
Roderick Murray-Smith
40
1
0
30 Jun 2020
Direct Feedback Alignment Scales to Modern Deep Learning Tasks and Architectures
Julien Launay
Iacopo Poli
Franccois Boniface
Florent Krzakala
127
64
0
23 Jun 2020
Locally Masked Convolution for Autoregressive Models
Ajay Jain
Pieter Abbeel
Deepak Pathak
DiffM
OffRL
120
32
0
22 Jun 2020
Sparse GPU Kernels for Deep Learning
Trevor Gale
Matei A. Zaharia
C. Young
Erich Elsen
99
234
0
18 Jun 2020
MoFlow: An Invertible Flow Model for Generating Molecular Graphs
Chengxi Zang
Fei Wang
BDL
156
297
0
17 Jun 2020
Density of States Estimation for Out-of-Distribution Detection
Warren Morningstar
Cusuh Ham
Andrew Gallagher
Balaji Lakshminarayanan
Alexander A. Alemi
Joshua V. Dillon
OODD
114
85
0
16 Jun 2020
NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity
Sang-gil Lee
Sungwon Kim
Sungroh Yoon
77
17
0
11 Jun 2020
A Survey on Generative Adversarial Networks: Variants, Applications, and Training
Abdul Jabbar
Xi Li
Bourahla Omar
103
276
0
09 Jun 2020
Visual Transformers: Token-based Image Representation and Processing for Computer Vision
Bichen Wu
Chenfeng Xu
Xiaoliang Dai
Alvin Wan
Peizhao Zhang
Zhicheng Yan
Masayoshi Tomizuka
Joseph E. Gonzalez
Kurt Keutzer
Peter Vajda
ViT
111
565
0
05 Jun 2020
An Overview of Neural Network Compression
James OÑeill
AI4CE
160
100
0
05 Jun 2020
Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers
K. Choromanski
Valerii Likhosherstov
David Dohan
Xingyou Song
Andreea Gane
...
Peter Hawkins
Jared Davis
David Belanger
Lucy J. Colwell
Adrian Weller
100
86
0
05 Jun 2020
GMAT: Global Memory Augmentation for Transformers
Ankit Gupta
Jonathan Berant
RALM
81
50
0
05 Jun 2020
milliEgo: Single-chip mmWave Radar Aided Egomotion Estimation via Deep Sensor Fusion
Chris Xiaoxuan Lu
Muhamad Risqi U. Saputra
Peijun Zhao
Yasin Almalioglu
Pedro Porto Buarque de Gusmão
Changhao Chen
Ke Sun
A. Trigoni
Andrew Markham
64
5
0
03 Jun 2020
End-to-End Object Detection with Transformers
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
530
13,239
0
26 May 2020
Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis
Rafael Valle
Kevin J. Shih
R. Prenger
Bryan Catanzaro
96
121
0
12 May 2020
Attentional Bottleneck: Towards an Interpretable Deep Driving Network
Jinkyu Kim
Mayank Bansal
96
13
0
08 May 2020
Multi-scale Transformer Language Models
Sandeep Subramanian
R. Collobert
MarcÁurelio Ranzato
Y-Lan Boureau
63
13
0
01 May 2020
Progressive Transformers for End-to-End Sign Language Production
Ben Saunders
Necati Cihan Camgöz
Richard Bowden
SLR
79
135
0
30 Apr 2020
Exploring Self-attention for Image Recognition
Hengshuang Zhao
Jiaya Jia
V. Koltun
SSL
100
792
0
28 Apr 2020
A Spatio-temporal Transformer for 3D Human Motion Prediction
Emre Aksan
Manuel Kaufmann
Peng Cao
Otmar Hilliges
ViT
97
231
0
18 Apr 2020
Understanding the Difficulty of Training Transformers
Liyuan Liu
Xiaodong Liu
Jianfeng Gao
Weizhu Chen
Jiawei Han
AI4CE
96
259
0
17 Apr 2020
Highway Transformer: Self-Gating Enhanced Self-Attentive Networks
Yekun Chai
Jin Shuo
Xinwen Hou
48
17
0
17 Apr 2020
Spatially-Attentive Patch-Hierarchical Network for Adaptive Motion Deblurring
Maitreya Suin
Kuldeep Purohit
A. N. Rajagopalan
3DV
78
283
0
11 Apr 2020
Telling BERT's full story: from Local Attention to Global Aggregation
Damian Pascual
Gino Brunner
Roger Wattenhofer
57
19
0
10 Apr 2020
Normalizing Flows with Multi-Scale Autoregressive Priors
Shweta Mahajan
Apratim Bhattacharyya
Mario Fritz
Bernt Schiele
Stefan Roth
BDL
DRL
53
17
0
08 Apr 2020
Variational Transformers for Diverse Response Generation
Zhaojiang Lin
Genta Indra Winata
Peng Xu
Zihan Liu
Pascale Fung
DRL
78
51
0
28 Mar 2020
Actor-Transformers for Group Activity Recognition
Kirill Gavrilyuk
Ryan Sanford
Mehrsan Javan
Cees G. M. Snoek
ViT
73
182
0
28 Mar 2020
Previous
1
2
3
...
14
15
16
17
Next