ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05751
  4. Cited By
Image Transformer

Image Transformer

15 February 2018
Niki Parmar
Ashish Vaswani
Jakob Uszkoreit
Lukasz Kaiser
Noam M. Shazeer
Alexander Ku
Dustin Tran
    ViT
ArXivPDFHTML

Papers citing "Image Transformer"

50 / 369 papers shown
Title
Taming Transformers for High-Resolution Image Synthesis
Taming Transformers for High-Resolution Image Synthesis
Patrick Esser
Robin Rombach
Bjorn Ommer
ViT
64
2,831
0
17 Dec 2020
End-to-End Human Pose and Mesh Reconstruction with Transformers
End-to-End Human Pose and Mesh Reconstruction with Transformers
Kevin Qinghong Lin
Lijuan Wang
Zicheng Liu
ViT
34
613
0
17 Dec 2020
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers
Huiyu Wang
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
ViT
40
527
0
01 Dec 2020
AdaBins: Depth Estimation using Adaptive Bins
AdaBins: Depth Estimation using Adaptive Bins
S. Bhat
Ibraheem Alhashim
Peter Wonka
3DV
MDE
ViT
66
836
0
28 Nov 2020
Generative Layout Modeling using Constraint Graphs
Generative Layout Modeling using Constraint Graphs
W. Para
Paul Guerrero
Tom Kelly
Leonidas J. Guibas
Peter Wonka
31
68
0
26 Nov 2020
Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them
  on Images
Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images
R. Child
BDL
VLM
56
339
0
20 Nov 2020
ConvTransformer: A Convolutional Transformer Network for Video Frame
  Synthesis
ConvTransformer: A Convolutional Transformer Network for Video Frame Synthesis
Zhouyong Liu
S. Luo
Wubin Li
Jingben Lu
Yufan Wu
Shilei Sun
Chunguo Li
Luxi Yang
ViT
33
79
0
20 Nov 2020
Two-Stream Appearance Transfer Network for Person Image Generation
Two-Stream Appearance Transfer Network for Person Image Generation
Chengkang Shen
Peiyan Wang
Wei Tang
3DH
GAN
22
0
0
09 Nov 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at
  Scale
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
41
39,428
0
22 Oct 2020
Bayesian Attention Modules
Bayesian Attention Modules
Xinjie Fan
Shujian Zhang
Bo Chen
Mingyuan Zhou
117
59
0
20 Oct 2020
Rethinking Attention with Performers
Rethinking Attention with Performers
K. Choromanski
Valerii Likhosherstov
David Dohan
Xingyou Song
Andreea Gane
...
Afroz Mohiuddin
Lukasz Kaiser
David Belanger
Lucy J. Colwell
Adrian Weller
60
1,522
0
30 Sep 2020
DeepRemaster: Temporal Source-Reference Attention Networks for
  Comprehensive Video Enhancement
DeepRemaster: Temporal Source-Reference Attention Networks for Comprehensive Video Enhancement
S. Iizuka
E. Simo-Serra
27
39
0
18 Sep 2020
Efficient Transformers: A Survey
Efficient Transformers: A Survey
Yi Tay
Mostafa Dehghani
Dara Bahri
Donald Metzler
VLM
114
1,103
0
14 Sep 2020
Skeleton-based Action Recognition via Spatial and Temporal Transformer
  Networks
Skeleton-based Action Recognition via Spatial and Temporal Transformer Networks
Chiara Plizzari
Marco Cannici
Matteo Matteucci
ViT
MedIm
25
300
0
17 Aug 2020
Conformer-Kernel with Query Term Independence for Document Retrieval
Conformer-Kernel with Query Term Independence for Document Retrieval
Bhaskar Mitra
Sebastian Hofstatter
Hamed Zamani
Nick Craswell
27
21
0
20 Jul 2020
Kernelized Memory Network for Video Object Segmentation
Kernelized Memory Network for Video Object Segmentation
Hongje Seong
Junhyuk Hyun
Euntai Kim
VOS
19
195
0
16 Jul 2020
Autoregressive Unsupervised Image Segmentation
Autoregressive Unsupervised Image Segmentation
Yassine Ouali
C´eline Hudelot
Myriam Tami
SSL
35
86
0
16 Jul 2020
Can neural networks acquire a structural bias from raw linguistic data?
Can neural networks acquire a structural bias from raw linguistic data?
Alex Warstadt
Samuel R. Bowman
AI4CE
20
53
0
14 Jul 2020
Data Movement Is All You Need: A Case Study on Optimizing Transformers
Data Movement Is All You Need: A Case Study on Optimizing Transformers
A. Ivanov
Nikoli Dryden
Tal Ben-Nun
Shigang Li
Torsten Hoefler
36
131
0
30 Jun 2020
Direct Feedback Alignment Scales to Modern Deep Learning Tasks and
  Architectures
Direct Feedback Alignment Scales to Modern Deep Learning Tasks and Architectures
Julien Launay
Iacopo Poli
Franccois Boniface
Florent Krzakala
41
63
0
23 Jun 2020
Locally Masked Convolution for Autoregressive Models
Locally Masked Convolution for Autoregressive Models
Ajay Jain
Pieter Abbeel
Deepak Pathak
DiffM
OffRL
39
31
0
22 Jun 2020
Sparse GPU Kernels for Deep Learning
Sparse GPU Kernels for Deep Learning
Trevor Gale
Matei A. Zaharia
C. Young
Erich Elsen
17
230
0
18 Jun 2020
MoFlow: An Invertible Flow Model for Generating Molecular Graphs
MoFlow: An Invertible Flow Model for Generating Molecular Graphs
Chengxi Zang
Fei Wang
BDL
28
280
0
17 Jun 2020
Density of States Estimation for Out-of-Distribution Detection
Density of States Estimation for Out-of-Distribution Detection
Warren Morningstar
Cusuh Ham
Andrew Gallagher
Balaji Lakshminarayanan
Alexander A. Alemi
Joshua V. Dillon
OODD
22
83
0
16 Jun 2020
A Survey on Generative Adversarial Networks: Variants, Applications, and
  Training
A Survey on Generative Adversarial Networks: Variants, Applications, and Training
Abdul Jabbar
Xi Li
Bourahla Omar
25
266
0
09 Jun 2020
Visual Transformers: Token-based Image Representation and Processing for
  Computer Vision
Visual Transformers: Token-based Image Representation and Processing for Computer Vision
Bichen Wu
Chenfeng Xu
Xiaoliang Dai
Alvin Wan
Peizhao Zhang
Zhicheng Yan
Masayoshi Tomizuka
Joseph E. Gonzalez
Kurt Keutzer
Peter Vajda
ViT
39
548
0
05 Jun 2020
An Overview of Neural Network Compression
An Overview of Neural Network Compression
James OÑeill
AI4CE
45
98
0
05 Jun 2020
Masked Language Modeling for Proteins via Linearly Scalable Long-Context
  Transformers
Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers
K. Choromanski
Valerii Likhosherstov
David Dohan
Xingyou Song
Andreea Gane
...
Peter Hawkins
Jared Davis
David Belanger
Lucy J. Colwell
Adrian Weller
39
84
0
05 Jun 2020
End-to-End Object Detection with Transformers
End-to-End Object Detection with Transformers
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
125
12,711
0
26 May 2020
Flowtron: an Autoregressive Flow-based Generative Network for
  Text-to-Speech Synthesis
Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis
Rafael Valle
Kevin J. Shih
R. Prenger
Bryan Catanzaro
21
119
0
12 May 2020
Attentional Bottleneck: Towards an Interpretable Deep Driving Network
Attentional Bottleneck: Towards an Interpretable Deep Driving Network
Jinkyu Kim
Mayank Bansal
27
13
0
08 May 2020
Progressive Transformers for End-to-End Sign Language Production
Progressive Transformers for End-to-End Sign Language Production
Ben Saunders
Necati Cihan Camgöz
Richard Bowden
SLR
22
128
0
30 Apr 2020
Exploring Self-attention for Image Recognition
Exploring Self-attention for Image Recognition
Hengshuang Zhao
Jiaya Jia
V. Koltun
SSL
52
774
0
28 Apr 2020
A Spatio-temporal Transformer for 3D Human Motion Prediction
A Spatio-temporal Transformer for 3D Human Motion Prediction
Emre Aksan
Manuel Kaufmann
Peng Cao
Otmar Hilliges
ViT
28
224
0
18 Apr 2020
Highway Transformer: Self-Gating Enhanced Self-Attentive Networks
Highway Transformer: Self-Gating Enhanced Self-Attentive Networks
Yekun Chai
Jin Shuo
Xinwen Hou
23
17
0
17 Apr 2020
Spatially-Attentive Patch-Hierarchical Network for Adaptive Motion
  Deblurring
Spatially-Attentive Patch-Hierarchical Network for Adaptive Motion Deblurring
Maitreya Suin
Kuldeep Purohit
A. N. Rajagopalan
3DV
33
280
0
11 Apr 2020
Variational Transformers for Diverse Response Generation
Variational Transformers for Diverse Response Generation
Zhaojiang Lin
Genta Indra Winata
Peng Xu
Zihan Liu
Pascale Fung
DRL
18
51
0
28 Mar 2020
Actor-Transformers for Group Activity Recognition
Actor-Transformers for Group Activity Recognition
Kirill Gavrilyuk
Ryan Sanford
Mehrsan Javan
Cees G. M. Snoek
ViT
19
178
0
28 Mar 2020
Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation
Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation
Huiyu Wang
Yukun Zhu
Bradley Green
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
3DPC
28
658
0
17 Mar 2020
Sparse Sinkhorn Attention
Sparse Sinkhorn Attention
Yi Tay
Dara Bahri
Liu Yang
Donald Metzler
Da-Cheng Juan
23
331
0
26 Feb 2020
Sketchformer: Transformer-based Representation for Sketched Structure
Sketchformer: Transformer-based Representation for Sketched Structure
Leo Sampaio Ferraz Ribeiro
Tu Bui
John Collomosse
M. Ponti
22
127
0
24 Feb 2020
Convolutional Tensor-Train LSTM for Spatio-temporal Learning
Convolutional Tensor-Train LSTM for Spatio-temporal Learning
Jiahao Su
Wonmin Byeon
Jean Kossaifi
Furong Huang
Jan Kautz
Anima Anandkumar
AI4TS
19
119
0
21 Feb 2020
Robustness Verification for Transformers
Robustness Verification for Transformers
Zhouxing Shi
Huan Zhang
Kai-Wei Chang
Minlie Huang
Cho-Jui Hsieh
AAML
24
105
0
16 Feb 2020
Axial Attention in Multidimensional Transformers
Axial Attention in Multidimensional Transformers
Jonathan Ho
Nal Kalchbrenner
Dirk Weissenborn
Tim Salimans
36
519
0
20 Dec 2019
C-Flow: Conditional Generative Flow Models for Images and 3D Point
  Clouds
C-Flow: Conditional Generative Flow Models for Images and 3D Point Clouds
Albert Pumarola
S. Popov
Francesc Moreno-Noguer
V. Ferrari
3DPC
AI4CE
31
80
0
15 Dec 2019
Encoding Musical Style with Transformer Autoencoders
Encoding Musical Style with Transformer Autoencoders
Kristy Choi
Curtis Hawthorne
Ian Simon
Monica Dinculescu
Jesse Engel
33
89
0
10 Dec 2019
Factorized Multimodal Transformer for Multimodal Sequential Learning
Factorized Multimodal Transformer for Multimodal Sequential Learning
Amir Zadeh
Chengfeng Mao
Kelly Shi
Yiwei Zhang
Paul Pu Liang
Soujanya Poria
Louis-Philippe Morency
25
44
0
22 Nov 2019
A Simplified Fully Quantized Transformer for End-to-end Speech
  Recognition
A Simplified Fully Quantized Transformer for End-to-end Speech Recognition
Alex Bie
Bharat Venkitesh
João Monteiro
Md. Akmal Haidar
Mehdi Rezagholizadeh
MQ
32
27
0
09 Nov 2019
Convolutional Conditional Neural Processes
Convolutional Conditional Neural Processes
Jonathan Gordon
W. Bruinsma
Andrew Y. K. Foong
James Requeima
Yann Dubois
Richard Turner
BDL
25
162
0
29 Oct 2019
LinesToFacePhoto: Face Photo Generation from Lines with Conditional
  Self-Attention Generative Adversarial Network
LinesToFacePhoto: Face Photo Generation from Lines with Conditional Self-Attention Generative Adversarial Network
Yuhang Li
Xiao Chen
Feng Wu
Zhengjun Zha
CVBM
GAN
27
65
0
20 Oct 2019
Previous
12345678
Next