ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05751
  4. Cited By
Image Transformer
v1v2v3 (latest)

Image Transformer

15 February 2018
Niki Parmar
Ashish Vaswani
Jakob Uszkoreit
Lukasz Kaiser
Noam M. Shazeer
Alexander Ku
Dustin Tran
    ViT
ArXiv (abs)PDFHTML

Papers citing "Image Transformer"

50 / 837 papers shown
Title
MCFlow: Monte Carlo Flow Models for Data Imputation
MCFlow: Monte Carlo Flow Models for Data Imputation
Trevor W. Richardson
Wencheng Wu
Lei Lin
Beilei Xu
Edgar A. Bernal
OOD
82
48
0
27 Mar 2020
Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation
Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation
Huiyu Wang
Yukun Zhu
Bradley Green
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
3DPC
138
676
0
17 Mar 2020
Efficient Content-Based Sparse Attention with Routing Transformers
Efficient Content-Based Sparse Attention with Routing Transformers
Aurko Roy
M. Saffar
Ashish Vaswani
David Grangier
MoE
430
607
0
12 Mar 2020
The Impact of Hole Geometry on Relative Robustness of In-Painting
  Networks: An Empirical Study
The Impact of Hole Geometry on Relative Robustness of In-Painting Networks: An Empirical Study
Masood S. Mortazavi
Ning Yan
AAMLOOD
26
0
0
04 Mar 2020
Sparse Sinkhorn Attention
Sparse Sinkhorn Attention
Yi Tay
Dara Bahri
Liu Yang
Donald Metzler
Da-Cheng Juan
102
342
0
26 Feb 2020
Sketchformer: Transformer-based Representation for Sketched Structure
Sketchformer: Transformer-based Representation for Sketched Structure
Leo Sampaio Ferraz Ribeiro
Tu Bui
John Collomosse
M. Ponti
106
128
0
24 Feb 2020
PolyGen: An Autoregressive Generative Model of 3D Meshes
PolyGen: An Autoregressive Generative Model of 3D Meshes
C. Nash
Yaroslav Ganin
A. Eslami
Peter W. Battaglia
AI4CE
118
262
0
23 Feb 2020
Convolutional Tensor-Train LSTM for Spatio-temporal Learning
Convolutional Tensor-Train LSTM for Spatio-temporal Learning
Jiahao Su
Wonmin Byeon
Jean Kossaifi
Furong Huang
Jan Kautz
Anima Anandkumar
AI4TS
76
122
0
21 Feb 2020
Source Separation with Deep Generative Priors
Source Separation with Deep Generative Priors
V. Jayaram
John Thickstun
94
40
0
19 Feb 2020
LocoGAN -- Locally Convolutional GAN
LocoGAN -- Locally Convolutional GAN
Lukasz Struski
Szymon Knop
Jacek Tabor
Wiktor Daniec
Przemysław Spurek
GAN
47
10
0
18 Feb 2020
Robustness Verification for Transformers
Robustness Verification for Transformers
Zhouxing Shi
Huan Zhang
Kai-Wei Chang
Minlie Huang
Cho-Jui Hsieh
AAML
89
109
0
16 Feb 2020
Closing the Dequantization Gap: PixelCNN as a Single-Layer Flow
Closing the Dequantization Gap: PixelCNN as a Single-Layer Flow
Didrik Nielsen
Ole Winther
MQ
235
13
0
06 Feb 2020
Adversarial Code Learning for Image Generation
Adversarial Code Learning for Image Generation
Jiangbo Yuan
Bing Wu
Wanying Ding
Q. Ping
Zhendong Yu
GAN
37
0
0
30 Jan 2020
Auto Completion of User Interface Layout Design Using Transformer-Based
  Tree Decoders
Auto Completion of User Interface Layout Design Using Transformer-Based Tree Decoders
Yang Li
J. Amelot
Xin Zhou
Samy Bengio
Si Si
3DV
40
14
0
14 Jan 2020
Faster Transformer Decoding: N-gram Masked Self-Attention
Faster Transformer Decoding: N-gram Masked Self-Attention
Ciprian Chelba
Mengzhao Chen
Ankur Bapna
Noam M. Shazeer
57
16
0
14 Jan 2020
Reformer: The Efficient Transformer
Reformer: The Efficient Transformer
Nikita Kitaev
Lukasz Kaiser
Anselm Levskaya
VLM
241
2,342
0
13 Jan 2020
Axial Attention in Multidimensional Transformers
Axial Attention in Multidimensional Transformers
Jonathan Ho
Nal Kalchbrenner
Dirk Weissenborn
Tim Salimans
117
535
0
20 Dec 2019
C-Flow: Conditional Generative Flow Models for Images and 3D Point
  Clouds
C-Flow: Conditional Generative Flow Models for Images and 3D Point Clouds
Albert Pumarola
S. Popov
Francesc Moreno-Noguer
V. Ferrari
3DPCAI4CE
161
80
0
15 Dec 2019
Encoding Musical Style with Transformer Autoencoders
Encoding Musical Style with Transformer Autoencoders
Kristy Choi
Curtis Hawthorne
Ian Simon
Monica Dinculescu
Jesse Engel
95
90
0
10 Dec 2019
Towards Robust Image Classification Using Sequential Attention Models
Towards Robust Image Classification Using Sequential Attention Models
Daniel Zoran
Mike Chrzanowski
Po-Sen Huang
Sven Gowal
Alex Mott
Pushmeet Kohli
AAML
66
61
0
04 Dec 2019
IENet: Interacting Embranchment One Stage Anchor Free Detector for
  Orientation Aerial Object Detection
IENet: Interacting Embranchment One Stage Anchor Free Detector for Orientation Aerial Object Detection
Youtian Lin
Pengming Feng
Jian Guan
Wenwu Wang
Jonathon Chambers
ObjD
83
86
0
02 Dec 2019
Factorized Multimodal Transformer for Multimodal Sequential Learning
Factorized Multimodal Transformer for Multimodal Sequential Learning
Amir Zadeh
Chengfeng Mao
Kelly Shi
Yiwei Zhang
Paul Pu Liang
Soujanya Poria
Louis-Philippe Morency
69
45
0
22 Nov 2019
Affine Self Convolution
Affine Self Convolution
Nichita Diaconu
Daniel E. Worrall
35
3
0
18 Nov 2019
A Simplified Fully Quantized Transformer for End-to-end Speech
  Recognition
A Simplified Fully Quantized Transformer for End-to-end Speech Recognition
Alex Bie
Bharat Venkitesh
João Monteiro
Md. Akmal Haidar
Mehdi Rezagholizadeh
MQ
139
27
0
09 Nov 2019
Contextual Grounding of Natural Language Entities in Images
Contextual Grounding of Natural Language Entities in Images
Farley Lai
Ning Xie
Derek Doran
Asim Kadav
ObjD
55
6
0
05 Nov 2019
Learning to Fix Build Errors with Graph2Diff Neural Networks
Learning to Fix Build Errors with Graph2Diff Neural Networks
Daniel Tarlow
Subhodeep Moitra
Andrew Rice
Zimin Chen
Pierre-Antoine Manzagol
Charles Sutton
E. Aftandilian
GNN
128
65
0
04 Nov 2019
Image-Conditioned Graph Generation for Road Network Extraction
Image-Conditioned Graph Generation for Road Network Extraction
Davide Belli
Thomas Kipf
GNN
55
40
0
31 Oct 2019
Convolutional Conditional Neural Processes
Convolutional Conditional Neural Processes
Jonathan Gordon
W. Bruinsma
Andrew Y. K. Foong
James Requeima
Yann Dubois
Richard Turner
BDL
104
168
0
29 Oct 2019
LinesToFacePhoto: Face Photo Generation from Lines with Conditional
  Self-Attention Generative Adversarial Network
LinesToFacePhoto: Face Photo Generation from Lines with Conditional Self-Attention Generative Adversarial Network
Yuhang Li
Xiao Chen
Feng Wu
Zhengjun Zha
CVBMGAN
68
66
0
20 Oct 2019
Root Mean Square Layer Normalization
Root Mean Square Layer Normalization
Biao Zhang
Rico Sennrich
130
768
0
16 Oct 2019
On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention
On Recognizing Texts of Arbitrary Shapes with 2D Self-Attention
Junyeop Lee
Sungrae Park
Jeonghun Baek
Seong Joon Oh
Seonghyeon Kim
Hwalsuk Lee
383
124
0
10 Oct 2019
Bootstrapping Conditional GANs for Video Game Level Generation
Bootstrapping Conditional GANs for Video Game Level Generation
R. Torrado
Geetanjali Sharma
M. Green
Gaurav Jaswal
S. Risi
Julian Togelius
GAN
71
86
0
03 Oct 2019
Local block-wise self attention for normal organ segmentation
Local block-wise self attention for normal organ segmentation
Jue Jiang
Elguindi Sharif
Hyemin Um
S. Berry
Harini Veeraraghavan
3DPC
28
6
0
11 Sep 2019
Forecaster: A Graph Transformer for Forecasting Spatial and
  Time-Dependent Data
Forecaster: A Graph Transformer for Forecasting Spatial and Time-Dependent Data
Yongqian Li
J. M. F. Moura
AI4TS
84
31
0
09 Sep 2019
AtLoc: Attention Guided Camera Localization
AtLoc: Attention Guided Camera Localization
Bing Wang
Changhao Chen
Chris Xiaoxuan Lu
Peijun Zhao
A. Trigoni
Andrew Markham
92
158
0
08 Sep 2019
Transformer Dissection: A Unified Understanding of Transformer's
  Attention via the Lens of Kernel
Transformer Dissection: A Unified Understanding of Transformer's Attention via the Lens of Kernel
Yao-Hung Hubert Tsai
Shaojie Bai
M. Yamada
Louis-Philippe Morency
Ruslan Salakhutdinov
185
261
0
30 Aug 2019
Multiresolution Transformer Networks: Recurrence is Not Essential for
  Modeling Hierarchical Structure
Multiresolution Transformer Networks: Recurrence is Not Essential for Modeling Hierarchical Structure
Vikas Garg
Inderjit S. Dhillon
Hsiang-Fu Yu
59
7
0
27 Aug 2019
Attention-based Dropout Layer for Weakly Supervised Object Localization
Attention-based Dropout Layer for Weakly Supervised Object Localization
Junsuk Choe
Hyunjung Shim
WSOL
147
368
0
27 Aug 2019
Latent-Variable Non-Autoregressive Neural Machine Translation with
  Deterministic Inference Using a Delta Posterior
Latent-Variable Non-Autoregressive Neural Machine Translation with Deterministic Inference Using a Delta Posterior
Raphael Shu
Jason D. Lee
Hideki Nakayama
Kyunghyun Cho
BDL
100
117
0
20 Aug 2019
Likelihood Contribution based Multi-scale Architecture for Generative
  Flows
Likelihood Contribution based Multi-scale Architecture for Generative Flows
Hari Prasanna Das
Pieter Abbeel
C. Spanos
DRLAI4CE
61
5
0
05 Aug 2019
GENESIS: Generative Scene Inference and Sampling with Object-Centric
  Latent Representations
GENESIS: Generative Scene Inference and Sampling with Object-Centric Latent Representations
Martin Engelcke
Adam R. Kosiorek
Oiwi Parker Jones
Ingmar Posner
OCL
181
309
0
30 Jul 2019
Enhancing the Locality and Breaking the Memory Bottleneck of Transformer
  on Time Series Forecasting
Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting
Shiyang Li
Xiaoyong Jin
Yao Xuan
Xiyou Zhou
Wenhu Chen
Yu Wang
Xifeng Yan
AI4TS
204
1,452
0
29 Jun 2019
Learning Set-equivariant Functions with SWARM Mappings
Learning Set-equivariant Functions with SWARM Mappings
Roland Vollgraf
24
3
0
22 Jun 2019
Stand-Alone Self-Attention in Vision Models
Stand-Alone Self-Attention in Vision Models
Prajit Ramachandran
Niki Parmar
Ashish Vaswani
Irwan Bello
Anselm Levskaya
Jonathon Shlens
VLMSLRViT
184
1,218
0
13 Jun 2019
Toward Interpretable Music Tagging with Self-Attention
Toward Interpretable Music Tagging with Self-Attention
Minz Won
Sanghyuk Chun
Xavier Serra
ViT
74
82
0
12 Jun 2019
Scaling Autoregressive Video Models
Scaling Autoregressive Video Models
Dirk Weissenborn
Oscar Täckström
Jakob Uszkoreit
DiffMVGen
127
204
0
06 Jun 2019
Towards Interpretable Reinforcement Learning Using Attention Augmented
  Agents
Towards Interpretable Reinforcement Learning Using Attention Augmented Agents
Alex Mott
Daniel Zoran
Mike Chrzanowski
Daan Wierstra
Danilo Jimenez Rezende
74
192
0
06 Jun 2019
MelNet: A Generative Model for Audio in the Frequency Domain
MelNet: A Generative Model for Audio in the Frequency Domain
Sean Vasquez
M. Lewis
DiffM
93
132
0
04 Jun 2019
SCRAM: Spatially Coherent Randomized Attention Maps
SCRAM: Spatially Coherent Randomized Attention Maps
D. A. Calian
P. Roelants
Jacques Calì
B. Carr
K. Dubba
John E. Reid
Dell Zhang
54
2
0
24 May 2019
Less Memory, Faster Speed: Refining Self-Attention Module for Image
  Reconstruction
Less Memory, Faster Speed: Refining Self-Attention Module for Image Reconstruction
Zheng Wang
Jianwu Li
Ge Song
Tieling Li
23
2
0
20 May 2019
Previous
123...151617
Next