ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05751
  4. Cited By
Image Transformer
v1v2v3 (latest)

Image Transformer

15 February 2018
Niki Parmar
Ashish Vaswani
Jakob Uszkoreit
Lukasz Kaiser
Noam M. Shazeer
Alexander Ku
Dustin Tran
    ViT
ArXiv (abs)PDFHTML

Papers citing "Image Transformer"

50 / 837 papers shown
Title
CXR-Net: An Artificial Intelligence Pipeline for Quick Covid-19
  Screening of Chest X-Rays
CXR-Net: An Artificial Intelligence Pipeline for Quick Covid-19 Screening of Chest X-Rays
H. Abdulah
B. Huber
Sinan Lal
H. Abdallah
L. Palese
H. Soltanian-Zadeh
D. Gatti
76
4
0
26 Feb 2021
Nested-block self-attention for robust radiotherapy planning
  segmentation
Nested-block self-attention for robust radiotherapy planning segmentation
Harini Veeraraghavan
Jue Jiang
Elguindi Sharif
S. Berry
Ifeanyirochukwu Onochie
Aditya P. Apte
L. Cerviño
Joseph O. Deasy
98
3
0
26 Feb 2021
Conditional Positional Encodings for Vision Transformers
Conditional Positional Encodings for Vision Transformers
Xiangxiang Chu
Zhi Tian
Bo Zhang
Xinlong Wang
Chunhua Shen
ViT
181
626
0
22 Feb 2021
UniT: Multimodal Multitask Learning with a Unified Transformer
UniT: Multimodal Multitask Learning with a Unified Transformer
Ronghang Hu
Amanpreet Singh
ViT
106
301
0
22 Feb 2021
Evolving Attention with Residual Convolutions
Evolving Attention with Residual Convolutions
Yujing Wang
Yaming Yang
Jiangang Bai
Mingliang Zhang
Jing Bai
Jiahao Yu
Ce Zhang
Gao Huang
Yunhai Tong
ViT
112
34
0
20 Feb 2021
Predicting times of waiting on red signals using BERT
Predicting times of waiting on red signals using BERT
Witold Szejgis
Anna Warno
P. Góra
32
1
0
20 Feb 2021
Hard-Attention for Scalable Image Classification
Hard-Attention for Scalable Image Classification
Athanasios Papadopoulos
Pawel Korus
N. Memon
121
25
0
20 Feb 2021
End-to-End Neural Systems for Automatic Children Speech Recognition: An
  Empirical Study
End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical Study
Prashanth Gurunath Shivakumar
Shrikanth Narayanan
55
54
0
19 Feb 2021
Improved Denoising Diffusion Probabilistic Models
Improved Denoising Diffusion Probabilistic Models
Alex Nichol
Prafulla Dhariwal
DiffM
373
3,749
0
18 Feb 2021
Composable Generative Models
Composable Generative Models
Johan Leduc
Nicolas Grislain
SyDa
84
4
0
18 Feb 2021
LambdaNetworks: Modeling Long-Range Interactions Without Attention
LambdaNetworks: Modeling Long-Range Interactions Without Attention
Irwan Bello
359
181
0
17 Feb 2021
TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can
  Scale Up
TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up
Yi Ding
Shiyu Chang
Zhangyang Wang
ViT
163
394
0
14 Feb 2021
On the Regularity of Attention
On the Regularity of Attention
James Vuckovic
A. Baratin
Rémi Tachet des Combes
60
7
0
10 Feb 2021
Is Space-Time Attention All You Need for Video Understanding?
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
448
2,080
0
09 Feb 2021
Colorization Transformer
Colorization Transformer
Manoj Kumar
Dirk Weissenborn
Nal Kalchbrenner
ViT
357
160
0
08 Feb 2021
TransReID: Transformer-based Object Re-Identification
TransReID: Transformer-based Object Re-Identification
Shuting He
Haowen Luo
Pichao Wang
F. Wang
Hao Li
Wei Jiang
ViT
288
828
0
08 Feb 2021
TransUNet: Transformers Make Strong Encoders for Medical Image
  Segmentation
TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation
Jieneng Chen
Yongyi Lu
Qihang Yu
Xiangde Luo
Ehsan Adeli
Yan Wang
Le Lu
Alan Yuille
Yuyin Zhou
ViTMedIm
234
3,553
0
08 Feb 2021
Relaxed Transformer Decoders for Direct Action Proposal Generation
Relaxed Transformer Decoders for Direct Action Proposal Generation
Jing Tan
Jiaqi Tang
Limin Wang
Gangshan Wu
ViT
156
182
0
03 Feb 2021
Tokens-to-Token ViT: Training Vision Transformers from Scratch on
  ImageNet
Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
Li-xin Yuan
Yunpeng Chen
Tao Wang
Weihao Yu
Yujun Shi
Zihang Jiang
Francis E. H. Tay
Jiashi Feng
Shuicheng Yan
ViT
219
1,957
0
28 Jan 2021
Investigating Bi-Level Optimization for Learning and Vision from a
  Unified Perspective: A Survey and Beyond
Investigating Bi-Level Optimization for Learning and Vision from a Unified Perspective: A Survey and Beyond
Risheng Liu
Jiaxin Gao
Jin Zhang
Deyu Meng
Zhouchen Lin
AI4CE
152
229
0
27 Jan 2021
Adversarial Text-to-Image Synthesis: A Review
Adversarial Text-to-Image Synthesis: A Review
Stanislav Frolov
Tobias Hinz
Federico Raue
Jörn Hees
Andreas Dengel
EGVM
88
178
0
25 Jan 2021
Maximum Likelihood Training of Score-Based Diffusion Models
Maximum Likelihood Training of Score-Based Diffusion Models
Yang Song
Conor Durkan
Iain Murray
Stefano Ermon
DiffM
248
676
0
22 Jan 2021
DAF:re: A Challenging, Crowd-Sourced, Large-Scale, Long-Tailed Dataset
  For Anime Character Recognition
DAF:re: A Challenging, Crowd-Sourced, Large-Scale, Long-Tailed Dataset For Anime Character Recognition
Edwin Arkel Rios
Wen-Huang Cheng
Bo-Cheng Lai
CVBM
44
12
0
21 Jan 2021
Activity Graph Transformer for Temporal Action Localization
Activity Graph Transformer for Temporal Action Localization
Megha Nawhal
Greg Mori
133
71
0
21 Jan 2021
UPDeT: Universal Multi-agent Reinforcement Learning via Policy
  Decoupling with Transformers
UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers
Siyi Hu
Fengda Zhu
Xiaojun Chang
Xiaodan Liang
OffRL
94
77
0
20 Jan 2021
BANet: Blur-aware Attention Networks for Dynamic Scene Deblurring
BANet: Blur-aware Attention Networks for Dynamic Scene Deblurring
Fu-Jen Tsai
Yan-Tsung Peng
Yen-Yu Lin
Chung-Chi Tsai
Chia-Wen Lin
83
101
0
19 Jan 2021
TrackFormer: Multi-Object Tracking with Transformers
TrackFormer: Multi-Object Tracking with Transformers
Tim Meinhardt
A. Kirillov
Laura Leal-Taixe
Christoph Feichtenhofer
VOT
344
781
0
07 Jan 2021
Trear: Transformer-based RGB-D Egocentric Action Recognition
Trear: Transformer-based RGB-D Egocentric Action Recognition
Xiangyu Li
Yonghong Hou
Pichao Wang
Zhimin Gao
Mingliang Xu
Wanqing Li
ViT
233
88
0
05 Jan 2021
Transformers in Vision: A Survey
Transformers in Vision: A Survey
Salman Khan
Muzammal Naseer
Munawar Hayat
Syed Waqas Zamir
Fahad Shahbaz Khan
M. Shah
ViT
418
2,570
0
04 Jan 2021
TransPose: Keypoint Localization via Transformer
TransPose: Keypoint Localization via Transformer
Sen Yang
Zhibin Quan
Mu Nie
Wankou Yang
ViT
205
271
0
28 Dec 2020
A Survey on Visual Transformer
A Survey on Visual Transformer
Kai Han
Yunhe Wang
Hanting Chen
Xinghao Chen
Jianyuan Guo
...
Chunjing Xu
Yixing Xu
Zhaohui Yang
Yiman Zhang
Dacheng Tao
ViT
237
2,294
0
23 Dec 2020
Sub-Linear Memory: How to Make Performers SLiM
Sub-Linear Memory: How to Make Performers SLiM
Valerii Likhosherstov
K. Choromanski
Jared Davis
Xingyou Song
Adrian Weller
71
19
0
21 Dec 2020
ShineOn: Illuminating Design Choices for Practical Video-based Virtual
  Clothing Try-on
ShineOn: Illuminating Design Choices for Practical Video-based Virtual Clothing Try-on
Gaurav Kuppa
Andrew Jong
Vera Liu
Ziwei Liu
Teng-Sheng Moh
CVBM
76
21
0
18 Dec 2020
Taming Transformers for High-Resolution Image Synthesis
Taming Transformers for High-Resolution Image Synthesis
Patrick Esser
Robin Rombach
Bjorn Ommer
ViT
148
3,016
0
17 Dec 2020
End-to-End Human Pose and Mesh Reconstruction with Transformers
End-to-End Human Pose and Mesh Reconstruction with Transformers
Kevin Qinghong Lin
Lijuan Wang
Zicheng Liu
ViT
86
628
0
17 Dec 2020
Transformer Guided Geometry Model for Flow-Based Unsupervised Visual
  Odometry
Transformer Guided Geometry Model for Flow-Based Unsupervised Visual Odometry
Xiangyu Li
Yonghong Hou
Pichao Wang
Zhimin Gao
Mingliang Xu
Wanqing Li
ViT
234
27
0
08 Dec 2020
Perfect density models cannot guarantee anomaly detection
Perfect density models cannot guarantee anomaly detection
Charline Le Lan
Laurent Dinh
102
50
0
07 Dec 2020
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers
Huiyu Wang
Yukun Zhu
Hartwig Adam
Alan Yuille
Liang-Chieh Chen
ViT
154
531
0
01 Dec 2020
Deeper or Wider Networks of Point Clouds with Self-attention?
Haoxi Ran
Li Lu
3DPC
55
1
0
29 Nov 2020
AdaBins: Depth Estimation using Adaptive Bins
AdaBins: Depth Estimation using Adaptive Bins
S. Bhat
Ibraheem Alhashim
Peter Wonka
3DVMDEViT
203
864
0
28 Nov 2020
General Multi-label Image Classification with Transformers
General Multi-label Image Classification with Transformers
Jack Lanchantin
Tianlu Wang
Vicente Ordonez
Yanjun Qi
ViT
80
268
0
27 Nov 2020
Temporal-Channel Transformer for 3D Lidar-Based Video Object Detection
  in Autonomous Driving
Temporal-Channel Transformer for 3D Lidar-Based Video Object Detection in Autonomous Driving
Zhenxun Yuan
Xiao-yang Song
Lei Bai
Wen-gang Zhou
Zhe Wang
Wanli Ouyang
ViT
90
132
0
27 Nov 2020
Generative Layout Modeling using Constraint Graphs
Generative Layout Modeling using Constraint Graphs
W. Para
Paul Guerrero
Tom Kelly
Leonidas Guibas
Peter Wonka
88
71
0
26 Nov 2020
Cycle-consistent Generative Adversarial Networks for Neural Style
  Transfer using data from ChangÉ-4
Cycle-consistent Generative Adversarial Networks for Neural Style Transfer using data from ChangÉ-4
J. D. Curtó
R. Duvall
GAN
34
3
0
23 Nov 2020
Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them
  on Images
Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images
R. Child
BDLVLM
194
353
0
20 Nov 2020
ConvTransformer: A Convolutional Transformer Network for Video Frame
  Synthesis
ConvTransformer: A Convolutional Transformer Network for Video Frame Synthesis
Zhouyong Liu
S. Luo
Wubin Li
Jingben Lu
Yufan Wu
Shilei Sun
Chunguo Li
Luxi Yang
ViT
111
81
0
20 Nov 2020
Two-Stream Appearance Transfer Network for Person Image Generation
Two-Stream Appearance Transfer Network for Person Image Generation
Chengkang Shen
Peiyan Wang
Wei Tang
3DHGAN
68
0
0
09 Nov 2020
Long Range Arena: A Benchmark for Efficient Transformers
Long Range Arena: A Benchmark for Efficient Transformers
Yi Tay
Mostafa Dehghani
Samira Abnar
Songlin Yang
Dara Bahri
Philip Pham
J. Rao
Liu Yang
Sebastian Ruder
Donald Metzler
171
731
0
08 Nov 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at
  Scale
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
795
41,946
0
22 Oct 2020
Bayesian Attention Modules
Bayesian Attention Modules
Xinjie Fan
Shujian Zhang
Bo Chen
Mingyuan Zhou
183
62
0
20 Oct 2020
Previous
123...1314151617
Next