Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.05751
Cited By
v1
v2
v3 (latest)
Image Transformer
15 February 2018
Niki Parmar
Ashish Vaswani
Jakob Uszkoreit
Lukasz Kaiser
Noam M. Shazeer
Alexander Ku
Dustin Tran
ViT
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Image Transformer"
50 / 837 papers shown
Title
Sparse DETR: Efficient End-to-End Object Detection with Learnable Sparsity
Byungseok Roh
Jaewoong Shin
Wuhyun Shin
Saehoon Kim
ViT
59
150
0
29 Nov 2021
A model of semantic completion in generative episodic memory
Zahra Fayyaz
Aya Altamimi
Sen Cheng
Laurenz Wiskott
55
22
0
26 Nov 2021
Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes
Sam Bond-Taylor
P. Hessey
Hiroshi Sasaki
T. Breckon
Chris G. Willcocks
DiffM
126
72
0
24 Nov 2021
Octree Transformer: Autoregressive 3D Shape Generation on Hierarchically Structured Sequences
Moritz Ibing
Gregor Kobsik
Leif Kobbelt
95
37
0
24 Nov 2021
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion
Chenfei Wu
Jian Liang
Lei Ji
Fan Yang
Yuejian Fang
Daxin Jiang
Nan Duan
ViT
VGen
88
296
0
24 Nov 2021
Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers
John Guibas
Morteza Mardani
Zong-Yi Li
Andrew Tao
Anima Anandkumar
Bryan Catanzaro
118
246
0
24 Nov 2021
Multi-Person 3D Motion Prediction with Multi-Range Transformers
Jiashun Wang
Huazhe Xu
Medhini Narasimhan
Xiaolong Wang
ViT
112
76
0
23 Nov 2021
Monocular Road Planar Parallax Estimation
Haobo Yuan
Teng Chen
Wei Sui
Jiafeng Xie
Lefei Zhang
Yuan Li
Qian Zhang
54
4
0
22 Nov 2021
CGX: Adaptive System Support for Communication-Efficient Deep Learning
I. Markov
Hamidreza Ramezanikebrya
Dan Alistarh
GNN
82
5
0
16 Nov 2021
Local Multi-Head Channel Self-Attention for Facial Expression Recognition
Roberto Pecoraro
Valerio Basile
Viviana Bono
Sara Gallo
ViT
139
52
0
14 Nov 2021
A Survey of Visual Transformers
Yang Liu
Yao Zhang
Yixin Wang
Feng Hou
Jin Yuan
Jiang Tian
Yang Zhang
Zhongchao Shi
Jianping Fan
Zhiqiang He
3DGS
ViT
207
356
0
11 Nov 2021
Palette: Image-to-Image Diffusion Models
Chitwan Saharia
William Chan
Huiwen Chang
Chris A. Lee
Jonathan Ho
Tim Salimans
David J. Fleet
Mohammad Norouzi
DiffM
VLM
564
1,669
0
10 Nov 2021
Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers
Yanhong Zeng
Huan Yang
Hongyang Chao
Jianbo Wang
Jianlong Fu
ViT
161
26
0
05 Nov 2021
Implicit Deep Adaptive Design: Policy-Based Experimental Design without Likelihoods
Desi R. Ivanova
Adam Foster
Steven Kleinegesse
Michael U. Gutmann
Tom Rainforth
OffRL
132
48
0
03 Nov 2021
Delayed Propagation Transformer: A Universal Computation Engine towards Practical Control in Cyber-Physical Systems
Wenqing Zheng
Qiangqiang Guo
H. Yang
Peihao Wang
Zhangyang Wang
AI4CE
46
12
0
29 Oct 2021
Resampling Base Distributions of Normalizing Flows
Vincent Stimper
Bernhard Schölkopf
José Miguel Hernández-Lobato
BDL
96
33
0
29 Oct 2021
Scatterbrain: Unifying Sparse and Low-rank Attention Approximation
Beidi Chen
Tri Dao
Eric Winsor
Zhao Song
Atri Rudra
Christopher Ré
88
134
0
28 Oct 2021
FacTeR-Check: Semi-automated fact-checking through Semantic Similarity and Natural Language Inference
Alejandro Martín
Javier Huertas-Tato
Álvaro Huertas-García
Guillermo Villar-Rodríguez
David Camacho
HILM
122
31
0
27 Oct 2021
Hierarchical Transformers Are More Efficient Language Models
Piotr Nawrot
Szymon Tworkowski
Michał Tyrolski
Lukasz Kaiser
Yuhuai Wu
Christian Szegedy
Henryk Michalewski
96
69
0
26 Oct 2021
DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction
Hao Feng
Yuechen Wang
Wen-gang Zhou
Jiajun Deng
Houqiang Li
ViT
104
60
0
25 Oct 2021
Transformer Acceleration with Dynamic Sparse Attention
Liu Liu
Zheng Qu
Zhaodong Chen
Yufei Ding
Yuan Xie
74
22
0
21 Oct 2021
PixelPyramids: Exact Inference Models from Lossless Image Pyramids
Shweta Mahajan
Stefan Roth
TPM
56
2
0
17 Oct 2021
Improving Transformers with Probabilistic Attention Keys
Tam Nguyen
T. Nguyen
Dung D. Le
Duy Khuong Nguyen
Viet-Anh Tran
Richard G. Baraniuk
Nhat Ho
Stanley J. Osher
129
33
0
16 Oct 2021
On Learning the Transformer Kernel
Sankalan Pal Chowdhury
Adamos Solomou
Kumar Avinava Dubey
Mrinmaya Sachan
ViT
131
14
0
15 Oct 2021
How Does Momentum Benefit Deep Neural Networks Architecture Design? A Few Case Studies
Bao Wang
Hedi Xia
T. Nguyen
Stanley Osher
AI4CE
109
10
0
13 Oct 2021
Leveraging Transformers for StarCraft Macromanagement Prediction
Muhammad Junaid Khan
Shah Hassan
G. Sukthankar
31
5
0
11 Oct 2021
Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning
Chongjian Ge
Youwei Liang
Yibing Song
Jianbo Jiao
Jue Wang
Ping Luo
ViT
74
35
0
11 Oct 2021
The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation
Guillem Brasó
Nikita Kister
Laura Leal-Taixé
3DPC
88
40
0
11 Oct 2021
Vector-quantized Image Modeling with Improved VQGAN
Jiahui Yu
Xin Li
Jing Yu Koh
Han Zhang
Ruoming Pang
James Qin
Alexander Ku
Yuanzhong Xu
Jason Baldridge
Yonghui Wu
ViT
VLM
DRL
205
527
0
09 Oct 2021
Adversarial Token Attacks on Vision Transformers
Ameya Joshi
Gauri Jagatap
Chinmay Hegde
ViT
104
19
0
08 Oct 2021
Token Pooling in Vision Transformers
D. Marin
Jen-Hao Rick Chang
Anurag Ranjan
Anish K. Prabhu
Mohammad Rastegari
Oncel Tuzel
ViT
146
71
0
08 Oct 2021
Design Strategy Network: A deep hierarchical framework to represent generative design strategies in complex action spaces
Ayush Raina
Jonathan Cagan
Christopher McComb
AI4CE
71
13
0
07 Oct 2021
ATISS: Autoregressive Transformers for Indoor Scene Synthesis
Despoina Paschalidou
Amlan Kar
Maria Shugrina
Karsten Kreis
Andreas Geiger
Sanja Fidler
3DV
ViT
143
155
0
07 Oct 2021
Multi-scale speaker embedding-based graph attention networks for speaker diarisation
Youngki Kwon
Hee-Soo Heo
Jee-weon Jung
You Jin Kim
Bong-Jin Lee
Joon Son Chung
96
19
0
07 Oct 2021
Attention is All You Need? Good Embeddings with Statistics are enough:Large Scale Audio Understanding without Transformers/ Convolutions/ BERTs/ Mixers/ Attention/ RNNs or ....
Prateek Verma
AI4TS
102
2
0
07 Oct 2021
Adversarial Robustness Comparison of Vision Transformer and MLP-Mixer to CNNs
Philipp Benz
Soomin Ham
Chaoning Zhang
Adil Karjauv
In So Kweon
AAML
ViT
109
80
0
06 Oct 2021
ABC: Attention with Bounded-memory Control
Hao Peng
Jungo Kasai
Nikolaos Pappas
Dani Yogatama
Zhaofeng Wu
Lingpeng Kong
Roy Schwartz
Noah A. Smith
125
22
0
06 Oct 2021
Ripple Attention for Visual Perception with Sub-quadratic Complexity
Lin Zheng
Huijie Pan
Lingpeng Kong
72
3
0
06 Oct 2021
A Study of the Generalizability of Self-Supervised Representations
Atharva Tendle
Mohammad Rashedul Hasan
162
28
0
19 Sep 2021
Long-Range Modeling of Source Code Files with eWASH: Extended Window Access by Syntax Hierarchy
Colin B. Clement
Shuai Lu
Xiaoyu Liu
Michele Tufano
Dawn Drain
Nan Duan
Neel Sundaresan
Alexey Svyatkovskiy
99
27
0
17 Sep 2021
Transformer-Unet: Raw Image Processing with Unet
Youyang Sha
Yonghong Zhang
Xuquan Ji
Lei Hu
ViT
MedIm
50
38
0
17 Sep 2021
Expression Snippet Transformer for Robust Video-based Facial Expression Recognition
Yuanyuan Liu
Wenbin Wang
Chuanxu Feng
Haoyu Zhang
Zhe Chen
Yibing Zhan
ViT
79
65
0
17 Sep 2021
From Known to Unknown: Knowledge-guided Transformer for Time-Series Sales Forecasting in Alibaba
Xinyuan Qi
Kai Hou
Tong Liu
Zhongzhong Yu
Sihao Hu
Wenwu Ou
AI4TS
101
20
0
17 Sep 2021
An End-to-End Transformer Model for 3D Object Detection
Ishan Misra
Rohit Girdhar
Armand Joulin
3DPC
ViT
153
489
0
16 Sep 2021
Focus on Impact: Indoor Exploration with Intrinsic Motivation
Roberto Bigazzi
Federico Landi
S. Cascianelli
Lorenzo Baraldi
Marcella Cornia
Rita Cucchiara
OffRL
87
14
0
14 Sep 2021
Single-Read Reconstruction for DNA Data Storage Using Transformers
Yotam Nahum
Eyar Ben-Tolila
Leon Anavy
80
5
0
12 Sep 2021
A Survey on Multi-modal Summarization
Anubhav Jangra
Sourajit Mukherjee
Adam Jatowt
S. Saha
M. Hasanuzzaman
73
63
0
11 Sep 2021
RefineCap: Concept-Aware Refinement for Image Captioning
Yekun Chai
Shuo Jin
Junliang Xing
VLM
27
1
0
08 Sep 2021
Pose-guided Inter- and Intra-part Relational Transformer for Occluded Person Re-Identification
Zhongxing Ma
Yifan Zhao
Jia Li
ViT
83
54
0
08 Sep 2021
Teaching Autoregressive Language Models Complex Tasks By Demonstration
Gabriel Recchia
85
22
0
05 Sep 2021
Previous
1
2
3
...
9
10
11
...
15
16
17
Next