ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05751
  4. Cited By
Image Transformer
v1v2v3 (latest)

Image Transformer

15 February 2018
Niki Parmar
Ashish Vaswani
Jakob Uszkoreit
Lukasz Kaiser
Noam M. Shazeer
Alexander Ku
Dustin Tran
    ViT
ArXiv (abs)PDFHTML

Papers citing "Image Transformer"

50 / 837 papers shown
Title
Image Inpainting via Iteratively Decoupled Probabilistic Modeling
Image Inpainting via Iteratively Decoupled Probabilistic Modeling
Wenbo Li
Xin Yu
Kun Zhou
Yibing Song
Zhe Lin
Jiaya Jia
DiffM
91
12
0
06 Dec 2022
Lightning Fast Video Anomaly Detection via Adversarial Knowledge
  Distillation
Lightning Fast Video Anomaly Detection via Adversarial Knowledge Distillation
Florinel-Alin Croitoru
Nicolae-Cătălin Ristea
D. Dascalescu
Radu Tudor Ionescu
Fahad Shahbaz Khan
M. Shah
111
2
0
28 Nov 2022
Deep representation learning: Fundamentals, Perspectives, Applications,
  and Open Challenges
Deep representation learning: Fundamentals, Perspectives, Applications, and Open Challenges
K. T. Baghaei
Amirreza Payandeh
Pooya Fayyazsanavi
Shahram Rahimi
Zhiqian Chen
Somayeh Bakhtiari Ramezani
FaMLAI4TS
71
6
0
27 Nov 2022
DBA: Efficient Transformer with Dynamic Bilinear Low-Rank Attention
DBA: Efficient Transformer with Dynamic Bilinear Low-Rank Attention
Bosheng Qin
Juncheng Li
Siliang Tang
Yueting Zhuang
59
2
0
24 Nov 2022
Extreme Generative Image Compression by Learning Text Embedding from
  Diffusion Models
Extreme Generative Image Compression by Learning Text Embedding from Diffusion Models
Zhihong Pan
Xiaoxia Zhou
Hao Tian
DiffM
75
23
0
14 Nov 2022
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image
  Generation
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation
Zhihong Pan
Xiaoxia Zhou
Hao Tian
DiffM
62
12
0
14 Nov 2022
Efficient Image Generation with Variadic Attention Heads
Efficient Image Generation with Variadic Attention Heads
Steven Walton
Ali Hassani
Xingqian Xu
Zhangyang Wang
Humphrey Shi
ViT
89
23
0
10 Nov 2022
Astronomia ex machina: a history, primer, and outlook on neural networks
  in astronomy
Astronomia ex machina: a history, primer, and outlook on neural networks in astronomy
Michael J. Smith
James E. Geach
76
36
0
07 Nov 2022
MogaNet: Multi-order Gated Aggregation Network
MogaNet: Multi-order Gated Aggregation Network
Siyuan Li
Zedong Wang
Zicheng Liu
Cheng Tan
Haitao Lin
Di Wu
Zhiyuan Chen
Jiangbin Zheng
Stan Z. Li
109
65
0
07 Nov 2022
A Transformer Architecture for Online Gesture Recognition of
  Mathematical Expressions
A Transformer Architecture for Online Gesture Recognition of Mathematical Expressions
Mirco Ramo
Guénolé Silvestre
50
1
0
04 Nov 2022
Low-Resource Music Genre Classification with Cross-Modal Neural Model
  Reprogramming
Low-Resource Music Genre Classification with Cross-Modal Neural Model Reprogramming
Yun-Ning Hung
Chao-Han Huck Yang
Pin-Yu Chen
Alexander Lerch
100
19
0
02 Nov 2022
M$^3$ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task
  Learning with Model-Accelerator Co-design
M3^33ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design
Hanxue Liang
Zhiwen Fan
Rishov Sarkar
Ziyu Jiang
Tianlong Chen
Kai Zou
Yu Cheng
Cong Hao
Zhangyang Wang
MoE
77
88
0
26 Oct 2022
End-to-end Transformer for Compressed Video Quality Enhancement
End-to-end Transformer for Compressed Video Quality Enhancement
Li Yu
Wenshuai Chang
Shiyu Wu
Moncef Gabbouj
ViT
69
9
0
25 Oct 2022
Perfectly Secure Steganography Using Minimum Entropy Coupling
Perfectly Secure Steganography Using Minimum Entropy Coupling
Christian Schroeder de Witt
Samuel Sokota
J. Zico Kolter
Jakob N. Foerster
Martin Strohmeier
150
37
0
24 Oct 2022
Fine-grained Semantic Alignment Network for Weakly Supervised Temporal
  Language Grounding
Fine-grained Semantic Alignment Network for Weakly Supervised Temporal Language Grounding
Yuechen Wang
Wen-gang Zhou
Houqiang Li
AI4TS
63
13
0
21 Oct 2022
Museformer: Transformer with Fine- and Coarse-Grained Attention for
  Music Generation
Museformer: Transformer with Fine- and Coarse-Grained Attention for Music Generation
Botao Yu
Peiling Lu
Rui Wang
Wei Hu
Xu Tan
Wei Ye
Shikun Zhang
Tao Qin
Tie-Yan Liu
MGen
104
60
0
19 Oct 2022
Decoupling Features in Hierarchical Propagation for Video Object
  Segmentation
Decoupling Features in Hierarchical Propagation for Video Object Segmentation
Zongxin Yang
Yi Yang
VOS
110
159
0
18 Oct 2022
DiffGAR: Model-Agnostic Restoration from Generative Artifacts Using
  Image-to-Image Diffusion Models
DiffGAR: Model-Agnostic Restoration from Generative Artifacts Using Image-to-Image Diffusion Models
Yueqin Yin
Lianghua Huang
Yu Liu
Kaiqiang Huang
DiffM
76
12
0
16 Oct 2022
Compute-Efficient Deep Learning: Algorithmic Trends and Opportunities
Compute-Efficient Deep Learning: Algorithmic Trends and Opportunities
Brian Bartoldson
B. Kailkhura
Davis W. Blalock
113
51
0
13 Oct 2022
Understanding Embodied Reference with Touch-Line Transformer
Understanding Embodied Reference with Touch-Line Transformer
Yongqian Li
Xiaoxue Chen
Hao Zhao
Jiangtao Gong
Guyue Zhou
Federico Rossano
Yixin Zhu
172
17
0
11 Oct 2022
Style-Guided Inference of Transformer for High-resolution Image
  Synthesis
Style-Guided Inference of Transformer for High-resolution Image Synthesis
Jonghwa Yim
Minjae Kim
ViT
103
0
0
11 Oct 2022
Bird-Eye Transformers for Text Generation Models
Bird-Eye Transformers for Text Generation Models
Lei Sha
Yuhang Song
Yordan Yordanov
Tommaso Salvatori
Thomas Lukasiewicz
57
0
0
08 Oct 2022
Progressive Text-to-Image Generation
Progressive Text-to-Image Generation
Zhengcong Fei
Mingyuan Fan
Li Zhu
Junshi Huang
172
4
0
05 Oct 2022
WavSpA: Wavelet Space Attention for Boosting Transformers' Long Sequence
  Learning Ability
WavSpA: Wavelet Space Attention for Boosting Transformers' Long Sequence Learning Ability
Yufan Zhuang
Zihan Wang
Fangbo Tao
Jingbo Shang
ViTAI4TS
82
3
0
05 Oct 2022
Implicit Warping for Animation with Image Sets
Implicit Warping for Animation with Image Sets
Arun Mallya
Ting-Chun Wang
Xuan Li
VGen
182
42
0
04 Oct 2022
SPARC: Sparse Render-and-Compare for CAD model alignment in a single RGB
  image
SPARC: Sparse Render-and-Compare for CAD model alignment in a single RGB image
Florian Langer
Gwangbin Bae
Ignas Budvytis
R. Cipolla
3DPC
90
12
0
03 Oct 2022
Visual Prompt Tuning for Generative Transfer Learning
Visual Prompt Tuning for Generative Transfer Learning
Kihyuk Sohn
Yuan Hao
José Lezama
Luisa F. Polanía
Huiwen Chang
Han Zhang
Irfan Essa
Lu Jiang
VPVLMVLM
163
89
0
03 Oct 2022
Grouped self-attention mechanism for a memory-efficient Transformer
Grouped self-attention mechanism for a memory-efficient Transformer
Bumjun Jung
Yusuke Mukuta
Tatsuya Harada
AI4TS
26
3
0
02 Oct 2022
STGIN: A Spatial Temporal Graph-Informer Network for Long Sequence
  Traffic Speed Forecasting
STGIN: A Spatial Temporal Graph-Informer Network for Long Sequence Traffic Speed Forecasting
Ruikang Luo
Yaofeng Song
Liping Huang
Yicheng Zhang
Rong Su
GNNAI4TS
42
15
0
01 Oct 2022
Dilated Neighborhood Attention Transformer
Dilated Neighborhood Attention Transformer
Ali Hassani
Humphrey Shi
ViTMedIm
116
73
0
29 Sep 2022
CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention
CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention
Ziyu Guo
Renrui Zhang
Longtian Qiu
Xianzheng Ma
Xupeng Miao
Xuming He
Tengjiao Wang
VLMAAML
113
119
0
28 Sep 2022
Dense-TNT: Efficient Vehicle Type Classification Neural Network Using
  Satellite Imagery
Dense-TNT: Efficient Vehicle Type Classification Neural Network Using Satellite Imagery
Ruikang Luo
Yaofeng Song
Haiying Zhao
Yicheng Zhang
Yi Zhang
Nanbin Zhao
Liping Huang
Rong Su
ViT
65
12
0
27 Sep 2022
Self-Supervised Masked Convolutional Transformer Block for Anomaly
  Detection
Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection
Neelu Madan
Nicolae-Cătălin Ristea
Radu Tudor Ionescu
Kamal Nasrollahi
Fahad Shahbaz Khan
T. Moeslund
M. Shah
ViTMedIm
343
71
0
25 Sep 2022
SpeedLimit: Neural Architecture Search for Quantized Transformer Models
SpeedLimit: Neural Architecture Search for Quantized Transformer Models
Yuji Chai
Luke Bailey
Yunho Jin
Matthew Karle
Glenn G. Ko
David Brooks
Gu-Yeon Wei
H. T. Kung
MQ
48
0
0
25 Sep 2022
UniColor: A Unified Framework for Multi-Modal Colorization with
  Transformer
UniColor: A Unified Framework for Multi-Modal Colorization with Transformer
Zhitong Huang
Nanxuan Zhao
Jing Liao
ViT
80
16
0
22 Sep 2022
MGTR: End-to-End Mutual Gaze Detection with Transformer
MGTR: End-to-End Mutual Gaze Detection with Transformer
Han Guo
Zhengxi Hu
Jingtai Liu
ViT
46
8
0
22 Sep 2022
Mega: Moving Average Equipped Gated Attention
Mega: Moving Average Equipped Gated Attention
Xuezhe Ma
Chunting Zhou
Xiang Kong
Junxian He
Liangke Gui
Graham Neubig
Jonathan May
Luke Zettlemoyer
146
185
0
21 Sep 2022
Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?
Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?
Yi Wang
Zhiwen Fan
Tianlong Chen
Hehe Fan
Zhangyang Wang
ViT
109
10
0
15 Sep 2022
A Benchmark for Weakly Semi-Supervised Abnormality Localization in Chest
  X-Rays
A Benchmark for Weakly Semi-Supervised Abnormality Localization in Chest X-Rays
Haoqin Ji
Haozhe Liu
Yuexiang Li
Jinheng Xie
Nanjun He
Yawen Huang
Dong Wei
Xinrong Chen
Linlin Shen
Yefeng Zheng
68
0
0
05 Sep 2022
DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for
  Text-to-Image Generation
DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation
Mengqi Huang
Zhendong Mao
Penghui Wang
Quang Wang
Yongdong Zhang
70
21
0
03 Sep 2022
Large-Scale Auto-Regressive Modeling Of Street Networks
Large-Scale Auto-Regressive Modeling Of Street Networks
Michael Birsak
Tom Kelly
W. Para
Peter Wonka
GNNAI4TS
46
6
0
01 Sep 2022
A Circular Window-based Cascade Transformer for Online Action Detection
A Circular Window-based Cascade Transformer for Online Action Detection
Shuyuan Cao
Weihua Luo
Bairui Wang
Wei Emma Zhang
Lin Ma
97
6
0
30 Aug 2022
gSwin: Gated MLP Vision Model with Hierarchical Structure of Shifted
  Window
gSwin: Gated MLP Vision Model with Hierarchical Structure of Shifted Window
Mocho Go
Hideyuki Tachibana
ViT
68
9
0
24 Aug 2022
Distance-Aware Occlusion Detection with Focused Attention
Distance-Aware Occlusion Detection with Focused Attention
Yongqian Li
Yucheng Tu
Xiaoxue Chen
Hao Zhao
Guyue Zhou
76
6
0
23 Aug 2022
Accelerating Vision Transformer Training via a Patch Sampling Schedule
Accelerating Vision Transformer Training via a Patch Sampling Schedule
Bradley McDanel
C. Huynh
ViT
77
1
0
19 Aug 2022
GSRFormer: Grounded Situation Recognition Transformer with Alternate
  Semantic Attention Refinement
GSRFormer: Grounded Situation Recognition Transformer with Alternate Semantic Attention Refinement
Zhi-Qi Cheng
Qianwen Dai
Siyao Li
Teruko Mitamura
Alexander G. Hauptmann
72
37
0
18 Aug 2022
Deep is a Luxury We Don't Have
Deep is a Luxury We Don't Have
Ahmed Taha
Yen Nhi Truong Vu
Brent Mombourquette
Thomas P. Matthews
Jason Su
Sadanand Singh
ViTMedIm
66
2
0
11 Aug 2022
Analog Bits: Generating Discrete Data using Diffusion Models with
  Self-Conditioning
Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning
Ting-Li Chen
Ruixiang Zhang
Geoffrey E. Hinton
DiffM
146
313
0
08 Aug 2022
HaloAE: An HaloNet based Local Transformer Auto-Encoder for Anomaly
  Detection and Localization
HaloAE: An HaloNet based Local Transformer Auto-Encoder for Anomaly Detection and Localization
É. Mathian
H. Liu
L. Fernandez-Cuesta
Dimitris Samaras
M. Foll
L. Chen
ViT
92
12
0
06 Aug 2022
What Can Transformers Learn In-Context? A Case Study of Simple Function
  Classes
What Can Transformers Learn In-Context? A Case Study of Simple Function Classes
Shivam Garg
Dimitris Tsipras
Percy Liang
Gregory Valiant
167
514
0
01 Aug 2022
Previous
123...567...151617
Next