ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05751
  4. Cited By
Image Transformer
v1v2v3 (latest)

Image Transformer

15 February 2018
Niki Parmar
Ashish Vaswani
Jakob Uszkoreit
Lukasz Kaiser
Noam M. Shazeer
Alexander Ku
Dustin Tran
    ViT
ArXiv (abs)PDFHTML

Papers citing "Image Transformer"

50 / 837 papers shown
Title
Learn To Remember: Transformer with Recurrent Memory for Document-Level
  Machine Translation
Learn To Remember: Transformer with Recurrent Memory for Document-Level Machine Translation
Yukun Feng
Feng Li
Ziang Song
Boyuan Zheng
Philipp Koehn
41
19
0
03 May 2022
Attention Mechanism in Neural Networks: Where it Comes and Where it Goes
Attention Mechanism in Neural Networks: Where it Comes and Where it Goes
Derya Soydaner
3DV
129
183
0
27 Apr 2022
Enable Deep Learning on Mobile Devices: Methods, Systems, and
  Applications
Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Han Cai
Ji Lin
Chengyue Wu
Zhijian Liu
Haotian Tang
Hanrui Wang
Ligeng Zhu
Song Han
116
117
0
25 Apr 2022
ClusterGNN: Cluster-based Coarse-to-Fine Graph Neural Network for
  Efficient Feature Matching
ClusterGNN: Cluster-based Coarse-to-Fine Graph Neural Network for Efficient Feature Matching
Yanxing Shi
Junxiong Cai
Yoli Shavit
Tai-Jiang Mu
Wensen Feng
Kai Zhang
GNN
118
80
0
25 Apr 2022
Transformation Invariant Cancerous Tissue Classification Using Spatially
  Transformed DenseNet
Transformation Invariant Cancerous Tissue Classification Using Spatially Transformed DenseNet
Omar Mahdi
Ali Bou Nassif
MedIm
28
2
0
23 Apr 2022
Self-paced Multi-grained Cross-modal Interaction Modeling for Referring
  Expression Comprehension
Self-paced Multi-grained Cross-modal Interaction Modeling for Referring Expression Comprehension
Peihan Miao
Wei Su
Gaoang Wang
Xuewei Li
Xi Li
ObjD
82
10
0
21 Apr 2022
Less than Few: Self-Shot Video Instance Segmentation
Less than Few: Self-Shot Video Instance Segmentation
Pengwan Yang
Yuki M. Asano
Pascal Mettes
Cees G. M. Snoek
SSL
86
2
0
19 Apr 2022
Learning with Signatures
Learning with Signatures
J. Curtò
I. D. Zarzà
Hongfei Yan
Carlos T. Calafate
133
0
0
17 Apr 2022
Towards Lightweight Transformer via Group-wise Transformation for
  Vision-and-Language Tasks
Towards Lightweight Transformer via Group-wise Transformation for Vision-and-Language Tasks
Gen Luo
Yiyi Zhou
Xiaoshuai Sun
Yan Wang
Liujuan Cao
Yongjian Wu
Feiyue Huang
Rongrong Ji
ViT
64
47
0
16 Apr 2022
Visual Attention Methods in Deep Learning: An In-Depth Survey
Visual Attention Methods in Deep Learning: An In-Depth Survey
Mohammed Hassanin
Saeed Anwar
Ibrahim Radwan
Fahad Shahbaz Khan
Ajmal Mian
136
166
0
16 Apr 2022
Efficient Linear Attention for Fast and Accurate Keypoint Matching
Efficient Linear Attention for Fast and Accurate Keypoint Matching
Suwichaya Suwanwimolkul
S. Komorita
3DPC3DV
74
11
0
16 Apr 2022
Glass Segmentation with RGB-Thermal Image Pairs
Glass Segmentation with RGB-Thermal Image Pairs
Dong Huo
Jian Wang
Yiming Qian
Yee-Hong Yang
ISeg
121
41
0
12 Apr 2022
A Call for Clarity in Beam Search: How It Works and When It Stops
A Call for Clarity in Beam Search: How It Works and When It Stops
Jungo Kasai
Keisuke Sakaguchi
Ronan Le Bras
Dragomir R. Radev
Yejin Choi
Noah A. Smith
104
9
0
11 Apr 2022
Linear Complexity Randomized Self-attention Mechanism
Linear Complexity Randomized Self-attention Mechanism
Lin Zheng
Chong-Jun Wang
Lingpeng Kong
84
33
0
10 Apr 2022
Stripformer: Strip Transformer for Fast Image Deblurring
Stripformer: Strip Transformer for Fast Image Deblurring
Fu-Jen Tsai
Yan-Tsung Peng
Yen-Yu Lin
Chung-Chi Tsai
Chia-Wen Lin
ViT
105
184
0
10 Apr 2022
Underwater Image Enhancement Using Pre-trained Transformer
Underwater Image Enhancement Using Pre-trained Transformer
Abderrahmene Boudiaf
Yuhang Guo
Adarsh Ghimire
Naoufel Werghi
G. D. Masi
S. Javed
Jorge Dias
ViT
34
7
0
08 Apr 2022
Multimodal Multi-Head Convolutional Attention with Various Kernel Sizes
  for Medical Image Super-Resolution
Multimodal Multi-Head Convolutional Attention with Various Kernel Sizes for Medical Image Super-Resolution
Mariana-Iuliana Georgescu
Radu Tudor Ionescu
A. Miron
O. Savencu
Nicolae-Cătălin Ristea
N. Verga
Fahad Shahbaz Khan
SupR
65
55
0
08 Apr 2022
Towards An End-to-End Framework for Flow-Guided Video Inpainting
Towards An End-to-End Framework for Flow-Guided Video Inpainting
Zerui Li
Cheng Lu
Jia Qin
Chunle Guo
Mingg-Ming Cheng
108
153
0
06 Apr 2022
Revisiting Near/Remote Sensing with Geospatial Attention
Revisiting Near/Remote Sensing with Geospatial Attention
Scott Workman
M. U. Rafique
Hunter Blanton
Nathan Jacobs
121
17
0
04 Apr 2022
FoV-Net: Field-of-View Extrapolation Using Self-Attention and
  Uncertainty
FoV-Net: Field-of-View Extrapolation Using Self-Attention and Uncertainty
Liqian Ma
Stamatios Georgoulis
Xu Jia
Luc Van Gool
67
6
0
04 Apr 2022
Unitail: Detecting, Reading, and Matching in Retail Scene
Unitail: Detecting, Reading, and Matching in Retail Scene
Fangyi Chen
Han Zhang
Zaiwang Li
Jiachen Dou
Shentong Mo
Hao Chen
Yongxin Zhang
Uzair Ahmed
Chenchen Zhu
Marios Savvides
103
9
0
01 Apr 2022
Domain Invariant Siamese Attention Mask for Small Object Change
  Detection via Everyday Indoor Robot Navigation
Domain Invariant Siamese Attention Mask for Small Object Change Detection via Everyday Indoor Robot Navigation
Koji Takeda
Kanji Tanaka
Yoshimasa Nakamura
3DPC
70
3
0
29 Mar 2022
A General Survey on Attention Mechanisms in Deep Learning
A General Survey on Attention Mechanisms in Deep Learning
Gianni Brauwers
Flavius Frasincar
106
334
0
27 Mar 2022
A World-Self Model Towards Understanding Intelligence
A World-Self Model Towards Understanding Intelligence
Yutao Yue
61
2
0
25 Mar 2022
Efficient-VDVAE: Less is more
Efficient-VDVAE: Less is more
Louay Hazami
Rayhane Mama
Ragavan Thurairatnam
BDL
104
28
0
25 Mar 2022
High-Performance Transformer Tracking
High-Performance Transformer Tracking
Xin Chen
B. Yan
Jiawen Zhu
Huchuan Lu
Xiang Ruan
D. Wang
ViT
111
34
0
25 Mar 2022
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
Oran Gafni
Adam Polyak
Oron Ashual
Shelly Sheynin
Devi Parikh
Yaniv Taigman
DiffM
97
526
0
24 Mar 2022
Linearizing Transformer with Key-Value Memory
Linearizing Transformer with Key-Value Memory
Yizhe Zhang
Deng Cai
133
6
0
23 Mar 2022
ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through
  Regularized Self-Attention
ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through Regularized Self-Attention
Yang Liu
Jiaxiang Liu
L. Chen
Yuxiang Lu
Shi Feng
Zhida Feng
Yu Sun
Hao Tian
Huancheng Wu
Hai-feng Wang
72
10
0
23 Mar 2022
AbductionRules: Training Transformers to Explain Unexpected Inputs
AbductionRules: Training Transformers to Explain Unexpected Inputs
Nathan Young
Qiming Bao
Joshua Bensemann
Michael Witbrock
AI4CELRM
64
25
0
23 Mar 2022
PaCa-ViT: Learning Patch-to-Cluster Attention in Vision Transformers
PaCa-ViT: Learning Patch-to-Cluster Attention in Vision Transformers
Ryan Grainger
Thomas Paniagua
Xi Song
Naresh P. Cuntoor
Mun Wai Lee
Tianfu Wu
ViT
59
11
0
22 Mar 2022
Scalable Video Object Segmentation with Identification Mechanism
Scalable Video Object Segmentation with Identification Mechanism
Zongxin Yang
Jiaxu Miao
Yunchao Wei
Wenguan Wang
Xiaohan Wang
Yi Yang
VOS
104
25
0
22 Mar 2022
TVConv: Efficient Translation Variant Convolution for Layout-aware
  Visual Processing
TVConv: Efficient Translation Variant Convolution for Layout-aware Visual Processing
Jie Chen
Tianlang He
Weipeng Zhuo
Li Ma
Sangtae Ha
Shueng-Han Gary Chan
CVBM
103
25
0
20 Mar 2022
ViewFormer: NeRF-free Neural Rendering from Few Images Using
  Transformers
ViewFormer: NeRF-free Neural Rendering from Few Images Using Transformers
Jonávs Kulhánek
Erik Derner
Torsten Sattler
Robert Babuvska
ViT
110
75
0
18 Mar 2022
SepTr: Separable Transformer for Audio Spectrogram Processing
SepTr: Separable Transformer for Audio Spectrogram Processing
Nicolae-Cătălin Ristea
Radu Tudor Ionescu
Fahad Shahbaz Khan
ViT
96
32
0
17 Mar 2022
AutoSDF: Shape Priors for 3D Completion, Reconstruction and Generation
AutoSDF: Shape Priors for 3D Completion, Reconstruction and Generation
Paritosh Mittal
Y. Cheng
Maneesh Singh
Shubham Tulsiani
133
230
0
17 Mar 2022
Image Super-Resolution With Deep Variational Autoencoders
Image Super-Resolution With Deep Variational Autoencoders
Darius Chira
Ilian Haralampiev
Ole Winther
Andrea Dittadi
Valentin Liévin
DRL
83
33
0
17 Mar 2022
Semantic-aligned Fusion Transformer for One-shot Object Detection
Semantic-aligned Fusion Transformer for One-shot Object Detection
Yizhou Zhao
Xun Guo
Yan Lu
ViTObjD
82
21
0
17 Mar 2022
FormNet: Structural Encoding beyond Sequential Modeling in Form Document
  Information Extraction
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction
Chen-Yu Lee
Chun-Liang Li
Timothy Dozat
Vincent Perot
Guolong Su
Nan Hua
Joshua Ainslie
Renshen Wang
Yasuhisa Fujii
Tomas Pfister
96
79
0
16 Mar 2022
Unified Visual Transformer Compression
Unified Visual Transformer Compression
Shixing Yu
Tianlong Chen
Jiayi Shen
Huan Yuan
Jianchao Tan
Sen Yang
Ji Liu
Zhangyang Wang
ViT
99
94
0
15 Mar 2022
Implicit Feature Decoupling with Depthwise Quantization
Implicit Feature Decoupling with Depthwise Quantization
Iordanis Fostiropoulos
Barry W. Boehm
52
2
0
15 Mar 2022
InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene
  Understanding
InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene Understanding
Hanrong Ye
Dan Xu
ViT
71
90
0
15 Mar 2022
Deep Transformers Thirst for Comprehensive-Frequency Data
Deep Transformers Thirst for Comprehensive-Frequency Data
R. Xia
Chao Xue
Boyu Deng
Fang Wang
Jingchao Wang
ViT
48
0
0
14 Mar 2022
The Principle of Diversity: Training Stronger Vision Transformers Calls
  for Reducing All Levels of Redundancy
The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy
Tianlong Chen
Zhenyu Zhang
Yu Cheng
Ahmed Hassan Awadallah
Zhangyang Wang
ViT
109
42
0
12 Mar 2022
Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain
  Analysis: From Theory to Practice
Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice
Peihao Wang
Wenqing Zheng
Tianlong Chen
Zhangyang Wang
ViT
88
143
0
09 Mar 2022
Signature and Log-signature for the Study of Empirical Distributions
  Generated with GANs
Signature and Log-signature for the Study of Empirical Distributions Generated with GANs
J. Curtò
I. D. Zarzà
Hong-Mei Yan
Carlos T. Calafate
158
1
0
07 Mar 2022
PanFormer: a Transformer Based Model for Pan-sharpening
PanFormer: a Transformer Based Model for Pan-sharpening
Huanyu Zhou
Qingjie Liu
Yunhong Wang
ViT
59
46
0
06 Mar 2022
Contextformer: A Transformer with Spatio-Channel Attention for Context
  Modeling in Learned Image Compression
Contextformer: A Transformer with Spatio-Channel Attention for Context Modeling in Learned Image Compression
A. B. Koyuncu
Han Gao
Atanas Boev
Georgii Gaikov
Elena Alshina
Eckehard Steinbach
ViT
64
67
0
04 Mar 2022
Enhancing Local Feature Learning for 3D Point Cloud Processing using
  Unary-Pairwise Attention
Enhancing Local Feature Learning for 3D Point Cloud Processing using Unary-Pairwise Attention
H. Xiu
Xin Liu
Weimin Wang
Kyoung-Sook Kim
T. Shinohara
Qiong Chang
M. Matsuoka
3DPC
61
5
0
01 Mar 2022
Dynamic N:M Fine-grained Structured Sparse Attention Mechanism
Dynamic N:M Fine-grained Structured Sparse Attention Mechanism
Zhaodong Chen
Yuying Quan
Zheng Qu
Liu Liu
Yufei Ding
Yuan Xie
92
23
0
28 Feb 2022
Previous
123...789...151617
Next